My521's picture
Upload 8 files
ac4602e
[2023-12-25 09:03:53,677][distributed_c10d.py][INFO] Added key: store_based_barrier_key:1 to store for rank: 5
[2023-12-25 09:03:53,687][distributed_c10d.py][INFO] Rank 4: Completed store-based barrier for key:store_based_barrier_key:1 with 8 nodes.
[2023-12-25 09:04:20,079][model5_pretrain.py][INFO] Epoch:[0/1](0/112548) loss:11.448 lr:0.0000000 epoch_Time:5160.0min:
[2023-12-25 09:04:20,082][distributed.py][INFO] Reducer buckets have been rebuilt in this iteration.
[2023-12-25 09:04:58,031][model5_pretrain.py][INFO] Epoch:[0/1](100/112548) loss:7.868 lr:0.0000300 epoch_Time:755.0min:
[2023-12-25 09:05:35,666][model5_pretrain.py][INFO] Epoch:[0/1](200/112548) loss:7.416 lr:0.0000600 epoch_Time:730.0min:
[2023-12-25 09:06:13,474][model5_pretrain.py][INFO] Epoch:[0/1](300/112548) loss:6.188 lr:0.0000900 epoch_Time:723.0min:
[2023-12-25 09:06:51,226][model5_pretrain.py][INFO] Epoch:[0/1](400/112548) loss:6.891 lr:0.0001200 epoch_Time:717.0min:
[2023-12-25 09:07:28,887][model5_pretrain.py][INFO] Epoch:[0/1](500/112548) loss:5.332 lr:0.0001500 epoch_Time:714.0min:
[2023-12-25 09:08:06,560][model5_pretrain.py][INFO] Epoch:[0/1](600/112548) loss:5.096 lr:0.0001800 epoch_Time:712.0min:
[2023-12-25 09:08:44,245][model5_pretrain.py][INFO] Epoch:[0/1](700/112548) loss:5.120 lr:0.0002100 epoch_Time:710.0min:
[2023-12-25 09:09:21,965][model5_pretrain.py][INFO] Epoch:[0/1](800/112548) loss:4.798 lr:0.0002400 epoch_Time:708.0min:
[2023-12-25 09:09:59,676][model5_pretrain.py][INFO] Epoch:[0/1](900/112548) loss:4.646 lr:0.0002700 epoch_Time:707.0min:
[2023-12-25 09:10:37,397][model5_pretrain.py][INFO] Epoch:[0/1](1000/112548) loss:4.587 lr:0.0003000 epoch_Time:706.0min:
[2023-12-25 09:11:15,085][model5_pretrain.py][INFO] Epoch:[0/1](1100/112548) loss:4.548 lr:0.0003000 epoch_Time:705.0min:
[2023-12-25 09:11:52,785][model5_pretrain.py][INFO] Epoch:[0/1](1200/112548) loss:4.478 lr:0.0003000 epoch_Time:704.0min:
[2023-12-25 09:12:30,484][model5_pretrain.py][INFO] Epoch:[0/1](1300/112548) loss:4.533 lr:0.0003000 epoch_Time:703.0min:
[2023-12-25 09:13:08,204][model5_pretrain.py][INFO] Epoch:[0/1](1400/112548) loss:4.661 lr:0.0003000 epoch_Time:702.0min:
[2023-12-25 09:13:45,901][model5_pretrain.py][INFO] Epoch:[0/1](1500/112548) loss:4.775 lr:0.0003000 epoch_Time:701.0min:
[2023-12-25 09:14:23,591][model5_pretrain.py][INFO] Epoch:[0/1](1600/112548) loss:4.498 lr:0.0003000 epoch_Time:700.0min:
[2023-12-25 09:15:01,277][model5_pretrain.py][INFO] Epoch:[0/1](1700/112548) loss:3.631 lr:0.0002999 epoch_Time:700.0min:
[2023-12-25 09:15:38,957][model5_pretrain.py][INFO] Epoch:[0/1](1800/112548) loss:3.503 lr:0.0002999 epoch_Time:698.0min:
[2023-12-25 09:16:16,636][model5_pretrain.py][INFO] Epoch:[0/1](1900/112548) loss:4.775 lr:0.0002999 epoch_Time:698.0min:
[2023-12-25 09:16:54,324][model5_pretrain.py][INFO] Epoch:[0/1](2000/112548) loss:4.013 lr:0.0002999 epoch_Time:697.0min:
[2023-12-25 09:17:31,998][model5_pretrain.py][INFO] Epoch:[0/1](2100/112548) loss:3.858 lr:0.0002999 epoch_Time:696.0min:
[2023-12-25 09:18:09,678][model5_pretrain.py][INFO] Epoch:[0/1](2200/112548) loss:3.645 lr:0.0002998 epoch_Time:696.0min:
[2023-12-25 09:18:47,351][model5_pretrain.py][INFO] Epoch:[0/1](2300/112548) loss:4.378 lr:0.0002998 epoch_Time:695.0min:
[2023-12-25 09:19:25,021][model5_pretrain.py][INFO] Epoch:[0/1](2400/112548) loss:3.961 lr:0.0002998 epoch_Time:694.0min:
[2023-12-25 09:20:02,689][model5_pretrain.py][INFO] Epoch:[0/1](2500/112548) loss:4.001 lr:0.0002997 epoch_Time:694.0min:
[2023-12-25 09:20:40,352][model5_pretrain.py][INFO] Epoch:[0/1](2600/112548) loss:4.306 lr:0.0002997 epoch_Time:692.0min:
[2023-12-25 09:21:18,035][model5_pretrain.py][INFO] Epoch:[0/1](2700/112548) loss:4.191 lr:0.0002997 epoch_Time:691.0min:
[2023-12-25 09:21:55,707][model5_pretrain.py][INFO] Epoch:[0/1](2800/112548) loss:3.702 lr:0.0002996 epoch_Time:691.0min:
[2023-12-25 09:22:33,376][model5_pretrain.py][INFO] Epoch:[0/1](2900/112548) loss:4.223 lr:0.0002996 epoch_Time:690.0min:
[2023-12-25 09:23:11,050][model5_pretrain.py][INFO] Epoch:[0/1](3000/112548) loss:4.147 lr:0.0002995 epoch_Time:690.0min:
[2023-12-25 09:23:48,725][model5_pretrain.py][INFO] Epoch:[0/1](3100/112548) loss:4.070 lr:0.0002995 epoch_Time:689.0min:
[2023-12-25 09:24:26,396][model5_pretrain.py][INFO] Epoch:[0/1](3200/112548) loss:3.090 lr:0.0002994 epoch_Time:688.0min:
[2023-12-25 09:25:04,078][model5_pretrain.py][INFO] Epoch:[0/1](3300/112548) loss:3.437 lr:0.0002994 epoch_Time:688.0min:
[2023-12-25 09:25:41,760][model5_pretrain.py][INFO] Epoch:[0/1](3400/112548) loss:3.836 lr:0.0002993 epoch_Time:687.0min:
[2023-12-25 09:26:19,442][model5_pretrain.py][INFO] Epoch:[0/1](3500/112548) loss:4.028 lr:0.0002993 epoch_Time:686.0min:
[2023-12-25 09:26:57,130][model5_pretrain.py][INFO] Epoch:[0/1](3600/112548) loss:3.261 lr:0.0002992 epoch_Time:686.0min:
[2023-12-25 09:27:34,779][model5_pretrain.py][INFO] Epoch:[0/1](3700/112548) loss:4.117 lr:0.0002992 epoch_Time:685.0min:
[2023-12-25 09:28:12,457][model5_pretrain.py][INFO] Epoch:[0/1](3800/112548) loss:4.117 lr:0.0002991 epoch_Time:685.0min:
[2023-12-25 09:28:50,123][model5_pretrain.py][INFO] Epoch:[0/1](3900/112548) loss:3.766 lr:0.0002990 epoch_Time:684.0min:
[2023-12-25 09:29:27,785][model5_pretrain.py][INFO] Epoch:[0/1](4000/112548) loss:3.485 lr:0.0002990 epoch_Time:683.0min:
[2023-12-25 09:30:05,463][model5_pretrain.py][INFO] Epoch:[0/1](4100/112548) loss:3.641 lr:0.0002989 epoch_Time:683.0min:
[2023-12-25 09:30:43,127][model5_pretrain.py][INFO] Epoch:[0/1](4200/112548) loss:3.077 lr:0.0002988 epoch_Time:682.0min:
[2023-12-25 09:31:20,794][model5_pretrain.py][INFO] Epoch:[0/1](4300/112548) loss:3.516 lr:0.0002988 epoch_Time:681.0min:
[2023-12-25 09:31:58,473][model5_pretrain.py][INFO] Epoch:[0/1](4400/112548) loss:3.751 lr:0.0002987 epoch_Time:681.0min:
[2023-12-25 09:32:36,140][model5_pretrain.py][INFO] Epoch:[0/1](4500/112548) loss:3.204 lr:0.0002986 epoch_Time:679.0min:
[2023-12-25 09:33:13,808][model5_pretrain.py][INFO] Epoch:[0/1](4600/112548) loss:3.512 lr:0.0002985 epoch_Time:679.0min:
[2023-12-25 09:33:51,469][model5_pretrain.py][INFO] Epoch:[0/1](4700/112548) loss:3.664 lr:0.0002984 epoch_Time:678.0min:
[2023-12-25 09:34:29,147][model5_pretrain.py][INFO] Epoch:[0/1](4800/112548) loss:3.222 lr:0.0002983 epoch_Time:677.0min:
[2023-12-25 09:35:06,817][model5_pretrain.py][INFO] Epoch:[0/1](4900/112548) loss:3.977 lr:0.0002983 epoch_Time:677.0min:
[2023-12-25 09:35:44,496][model5_pretrain.py][INFO] Epoch:[0/1](5000/112548) loss:3.618 lr:0.0002982 epoch_Time:676.0min:
[2023-12-25 09:36:22,163][model5_pretrain.py][INFO] Epoch:[0/1](5100/112548) loss:3.887 lr:0.0002981 epoch_Time:675.0min:
[2023-12-25 09:36:59,839][model5_pretrain.py][INFO] Epoch:[0/1](5200/112548) loss:3.761 lr:0.0002980 epoch_Time:675.0min:
[2023-12-25 09:37:37,502][model5_pretrain.py][INFO] Epoch:[0/1](5300/112548) loss:3.315 lr:0.0002979 epoch_Time:674.0min:
[2023-12-25 09:38:15,173][model5_pretrain.py][INFO] Epoch:[0/1](5400/112548) loss:3.479 lr:0.0002978 epoch_Time:674.0min:
[2023-12-25 09:38:52,844][model5_pretrain.py][INFO] Epoch:[0/1](5500/112548) loss:2.999 lr:0.0002977 epoch_Time:673.0min:
[2023-12-25 09:39:30,526][model5_pretrain.py][INFO] Epoch:[0/1](5600/112548) loss:3.832 lr:0.0002976 epoch_Time:672.0min:
[2023-12-25 09:40:08,204][model5_pretrain.py][INFO] Epoch:[0/1](5700/112548) loss:3.473 lr:0.0002975 epoch_Time:672.0min:
[2023-12-25 09:40:45,860][model5_pretrain.py][INFO] Epoch:[0/1](5800/112548) loss:3.407 lr:0.0002974 epoch_Time:671.0min:
[2023-12-25 09:41:23,535][model5_pretrain.py][INFO] Epoch:[0/1](5900/112548) loss:3.253 lr:0.0002973 epoch_Time:670.0min:
[2023-12-25 09:42:01,189][model5_pretrain.py][INFO] Epoch:[0/1](6000/112548) loss:3.847 lr:0.0002971 epoch_Time:670.0min:
[2023-12-25 09:42:38,862][model5_pretrain.py][INFO] Epoch:[0/1](6100/112548) loss:3.551 lr:0.0002970 epoch_Time:669.0min:
[2023-12-25 09:43:16,531][model5_pretrain.py][INFO] Epoch:[0/1](6200/112548) loss:3.246 lr:0.0002969 epoch_Time:669.0min:
[2023-12-25 09:43:54,212][model5_pretrain.py][INFO] Epoch:[0/1](6300/112548) loss:3.516 lr:0.0002968 epoch_Time:668.0min:
[2023-12-25 09:44:31,881][model5_pretrain.py][INFO] Epoch:[0/1](6400/112548) loss:3.571 lr:0.0002967 epoch_Time:667.0min:
[2023-12-25 09:45:09,553][model5_pretrain.py][INFO] Epoch:[0/1](6500/112548) loss:3.691 lr:0.0002965 epoch_Time:667.0min:
[2023-12-25 09:45:47,232][model5_pretrain.py][INFO] Epoch:[0/1](6600/112548) loss:3.781 lr:0.0002964 epoch_Time:666.0min:
[2023-12-25 09:46:24,903][model5_pretrain.py][INFO] Epoch:[0/1](6700/112548) loss:3.396 lr:0.0002963 epoch_Time:665.0min:
[2023-12-25 09:47:02,582][model5_pretrain.py][INFO] Epoch:[0/1](6800/112548) loss:3.499 lr:0.0002962 epoch_Time:665.0min:
[2023-12-25 09:47:40,265][model5_pretrain.py][INFO] Epoch:[0/1](6900/112548) loss:4.133 lr:0.0002960 epoch_Time:664.0min:
[2023-12-25 09:48:17,966][model5_pretrain.py][INFO] Epoch:[0/1](7000/112548) loss:3.885 lr:0.0002959 epoch_Time:663.0min:
[2023-12-25 09:48:55,670][model5_pretrain.py][INFO] Epoch:[0/1](7100/112548) loss:2.886 lr:0.0002958 epoch_Time:663.0min:
[2023-12-25 09:49:33,370][model5_pretrain.py][INFO] Epoch:[0/1](7200/112548) loss:3.441 lr:0.0002956 epoch_Time:662.0min:
[2023-12-25 09:50:11,061][model5_pretrain.py][INFO] Epoch:[0/1](7300/112548) loss:3.878 lr:0.0002955 epoch_Time:662.0min:
[2023-12-25 09:50:48,760][model5_pretrain.py][INFO] Epoch:[0/1](7400/112548) loss:3.380 lr:0.0002953 epoch_Time:661.0min:
[2023-12-25 09:51:26,464][model5_pretrain.py][INFO] Epoch:[0/1](7500/112548) loss:3.334 lr:0.0002952 epoch_Time:660.0min:
[2023-12-25 09:52:04,169][model5_pretrain.py][INFO] Epoch:[0/1](7600/112548) loss:3.551 lr:0.0002950 epoch_Time:660.0min:
[2023-12-25 09:52:41,861][model5_pretrain.py][INFO] Epoch:[0/1](7700/112548) loss:3.822 lr:0.0002949 epoch_Time:659.0min:
[2023-12-25 09:53:19,567][model5_pretrain.py][INFO] Epoch:[0/1](7800/112548) loss:3.424 lr:0.0002947 epoch_Time:658.0min:
[2023-12-25 09:53:57,268][model5_pretrain.py][INFO] Epoch:[0/1](7900/112548) loss:3.482 lr:0.0002946 epoch_Time:658.0min:
[2023-12-25 09:54:34,941][model5_pretrain.py][INFO] Epoch:[0/1](8000/112548) loss:3.434 lr:0.0002944 epoch_Time:657.0min:
[2023-12-25 09:55:12,635][model5_pretrain.py][INFO] Epoch:[0/1](8100/112548) loss:2.995 lr:0.0002943 epoch_Time:657.0min:
[2023-12-25 09:55:50,317][model5_pretrain.py][INFO] Epoch:[0/1](8200/112548) loss:3.274 lr:0.0002941 epoch_Time:656.0min:
[2023-12-25 09:56:28,005][model5_pretrain.py][INFO] Epoch:[0/1](8300/112548) loss:3.621 lr:0.0002939 epoch_Time:655.0min:
[2023-12-25 09:57:05,698][model5_pretrain.py][INFO] Epoch:[0/1](8400/112548) loss:3.380 lr:0.0002938 epoch_Time:655.0min:
[2023-12-25 09:57:43,396][model5_pretrain.py][INFO] Epoch:[0/1](8500/112548) loss:3.840 lr:0.0002936 epoch_Time:654.0min:
[2023-12-25 09:58:21,079][model5_pretrain.py][INFO] Epoch:[0/1](8600/112548) loss:3.825 lr:0.0002934 epoch_Time:653.0min:
[2023-12-25 09:58:58,787][model5_pretrain.py][INFO] Epoch:[0/1](8700/112548) loss:3.484 lr:0.0002933 epoch_Time:653.0min:
[2023-12-25 09:59:36,492][model5_pretrain.py][INFO] Epoch:[0/1](8800/112548) loss:3.754 lr:0.0002931 epoch_Time:652.0min:
[2023-12-25 10:00:14,191][model5_pretrain.py][INFO] Epoch:[0/1](8900/112548) loss:3.495 lr:0.0002929 epoch_Time:652.0min:
[2023-12-25 10:00:51,882][model5_pretrain.py][INFO] Epoch:[0/1](9000/112548) loss:3.392 lr:0.0002927 epoch_Time:651.0min:
[2023-12-25 10:01:29,579][model5_pretrain.py][INFO] Epoch:[0/1](9100/112548) loss:3.491 lr:0.0002925 epoch_Time:650.0min:
[2023-12-25 10:02:07,277][model5_pretrain.py][INFO] Epoch:[0/1](9200/112548) loss:4.133 lr:0.0002924 epoch_Time:650.0min:
[2023-12-25 10:02:44,972][model5_pretrain.py][INFO] Epoch:[0/1](9300/112548) loss:3.342 lr:0.0002922 epoch_Time:649.0min:
[2023-12-25 10:03:22,653][model5_pretrain.py][INFO] Epoch:[0/1](9400/112548) loss:3.409 lr:0.0002920 epoch_Time:648.0min:
[2023-12-25 10:04:00,344][model5_pretrain.py][INFO] Epoch:[0/1](9500/112548) loss:3.553 lr:0.0002918 epoch_Time:648.0min:
[2023-12-25 10:04:38,048][model5_pretrain.py][INFO] Epoch:[0/1](9600/112548) loss:2.951 lr:0.0002916 epoch_Time:647.0min:
[2023-12-25 10:05:15,732][model5_pretrain.py][INFO] Epoch:[0/1](9700/112548) loss:3.427 lr:0.0002914 epoch_Time:647.0min:
[2023-12-25 10:05:53,438][model5_pretrain.py][INFO] Epoch:[0/1](9800/112548) loss:3.828 lr:0.0002912 epoch_Time:646.0min:
[2023-12-25 10:06:31,136][model5_pretrain.py][INFO] Epoch:[0/1](9900/112548) loss:3.682 lr:0.0002910 epoch_Time:645.0min:
[2023-12-25 10:07:08,835][model5_pretrain.py][INFO] Epoch:[0/1](10000/112548) loss:3.593 lr:0.0002908 epoch_Time:645.0min:
[2023-12-25 10:07:46,499][model5_pretrain.py][INFO] Epoch:[0/1](10100/112548) loss:3.510 lr:0.0002906 epoch_Time:644.0min:
[2023-12-25 10:08:24,203][model5_pretrain.py][INFO] Epoch:[0/1](10200/112548) loss:3.644 lr:0.0002904 epoch_Time:643.0min:
[2023-12-25 10:09:01,890][model5_pretrain.py][INFO] Epoch:[0/1](10300/112548) loss:3.415 lr:0.0002902 epoch_Time:643.0min:
[2023-12-25 10:09:39,596][model5_pretrain.py][INFO] Epoch:[0/1](10400/112548) loss:3.621 lr:0.0002900 epoch_Time:642.0min:
[2023-12-25 10:10:17,294][model5_pretrain.py][INFO] Epoch:[0/1](10500/112548) loss:3.574 lr:0.0002898 epoch_Time:642.0min:
[2023-12-25 10:10:54,991][model5_pretrain.py][INFO] Epoch:[0/1](10600/112548) loss:3.539 lr:0.0002896 epoch_Time:641.0min:
[2023-12-25 10:11:32,690][model5_pretrain.py][INFO] Epoch:[0/1](10700/112548) loss:3.539 lr:0.0002893 epoch_Time:640.0min:
[2023-12-25 10:12:10,389][model5_pretrain.py][INFO] Epoch:[0/1](10800/112548) loss:3.658 lr:0.0002891 epoch_Time:640.0min:
[2023-12-25 10:12:48,079][model5_pretrain.py][INFO] Epoch:[0/1](10900/112548) loss:3.451 lr:0.0002889 epoch_Time:639.0min:
[2023-12-25 10:13:25,785][model5_pretrain.py][INFO] Epoch:[0/1](11000/112548) loss:3.075 lr:0.0002887 epoch_Time:638.0min:
[2023-12-25 10:14:03,485][model5_pretrain.py][INFO] Epoch:[0/1](11100/112548) loss:3.515 lr:0.0002885 epoch_Time:638.0min:
[2023-12-25 10:14:41,176][model5_pretrain.py][INFO] Epoch:[0/1](11200/112548) loss:3.180 lr:0.0002882 epoch_Time:637.0min:
[2023-12-25 10:15:18,869][model5_pretrain.py][INFO] Epoch:[0/1](11300/112548) loss:3.476 lr:0.0002880 epoch_Time:636.0min:
[2023-12-25 10:15:56,566][model5_pretrain.py][INFO] Epoch:[0/1](11400/112548) loss:3.186 lr:0.0002878 epoch_Time:636.0min:
[2023-12-25 10:16:34,264][model5_pretrain.py][INFO] Epoch:[0/1](11500/112548) loss:3.294 lr:0.0002875 epoch_Time:635.0min:
[2023-12-25 10:17:11,960][model5_pretrain.py][INFO] Epoch:[0/1](11600/112548) loss:3.662 lr:0.0002873 epoch_Time:635.0min:
[2023-12-25 10:17:49,654][model5_pretrain.py][INFO] Epoch:[0/1](11700/112548) loss:3.092 lr:0.0002871 epoch_Time:634.0min:
[2023-12-25 10:18:27,354][model5_pretrain.py][INFO] Epoch:[0/1](11800/112548) loss:3.717 lr:0.0002868 epoch_Time:633.0min:
[2023-12-25 10:19:05,058][model5_pretrain.py][INFO] Epoch:[0/1](11900/112548) loss:3.214 lr:0.0002866 epoch_Time:633.0min:
[2023-12-25 10:19:42,752][model5_pretrain.py][INFO] Epoch:[0/1](12000/112548) loss:3.665 lr:0.0002863 epoch_Time:632.0min:
[2023-12-25 10:20:20,453][model5_pretrain.py][INFO] Epoch:[0/1](12100/112548) loss:3.574 lr:0.0002861 epoch_Time:631.0min:
[2023-12-25 10:20:58,156][model5_pretrain.py][INFO] Epoch:[0/1](12200/112548) loss:3.534 lr:0.0002859 epoch_Time:631.0min:
[2023-12-25 10:21:35,860][model5_pretrain.py][INFO] Epoch:[0/1](12300/112548) loss:3.488 lr:0.0002856 epoch_Time:630.0min:
[2023-12-25 10:22:13,572][model5_pretrain.py][INFO] Epoch:[0/1](12400/112548) loss:2.777 lr:0.0002854 epoch_Time:630.0min:
[2023-12-25 10:22:51,240][model5_pretrain.py][INFO] Epoch:[0/1](12500/112548) loss:3.611 lr:0.0002851 epoch_Time:629.0min:
[2023-12-25 10:23:28,947][model5_pretrain.py][INFO] Epoch:[0/1](12600/112548) loss:3.201 lr:0.0002848 epoch_Time:628.0min:
[2023-12-25 10:24:06,651][model5_pretrain.py][INFO] Epoch:[0/1](12700/112548) loss:3.245 lr:0.0002846 epoch_Time:628.0min:
[2023-12-25 10:24:44,345][model5_pretrain.py][INFO] Epoch:[0/1](12800/112548) loss:3.558 lr:0.0002843 epoch_Time:627.0min:
[2023-12-25 10:25:22,056][model5_pretrain.py][INFO] Epoch:[0/1](12900/112548) loss:3.345 lr:0.0002841 epoch_Time:626.0min:
[2023-12-25 10:25:59,766][model5_pretrain.py][INFO] Epoch:[0/1](13000/112548) loss:3.465 lr:0.0002838 epoch_Time:626.0min:
[2023-12-25 10:26:37,470][model5_pretrain.py][INFO] Epoch:[0/1](13100/112548) loss:3.813 lr:0.0002835 epoch_Time:625.0min:
[2023-12-25 10:27:15,175][model5_pretrain.py][INFO] Epoch:[0/1](13200/112548) loss:3.181 lr:0.0002833 epoch_Time:625.0min:
[2023-12-25 10:27:52,866][model5_pretrain.py][INFO] Epoch:[0/1](13300/112548) loss:2.879 lr:0.0002830 epoch_Time:624.0min:
[2023-12-25 10:28:30,573][model5_pretrain.py][INFO] Epoch:[0/1](13400/112548) loss:3.288 lr:0.0002827 epoch_Time:623.0min:
[2023-12-25 10:29:08,268][model5_pretrain.py][INFO] Epoch:[0/1](13500/112548) loss:2.932 lr:0.0002825 epoch_Time:623.0min:
[2023-12-25 10:29:45,965][model5_pretrain.py][INFO] Epoch:[0/1](13600/112548) loss:3.387 lr:0.0002822 epoch_Time:622.0min:
[2023-12-25 10:30:23,667][model5_pretrain.py][INFO] Epoch:[0/1](13700/112548) loss:3.095 lr:0.0002819 epoch_Time:621.0min:
[2023-12-25 10:31:01,361][model5_pretrain.py][INFO] Epoch:[0/1](13800/112548) loss:3.321 lr:0.0002816 epoch_Time:621.0min:
[2023-12-25 10:31:39,091][model5_pretrain.py][INFO] Epoch:[0/1](13900/112548) loss:3.250 lr:0.0002813 epoch_Time:620.0min:
[2023-12-25 10:32:16,815][model5_pretrain.py][INFO] Epoch:[0/1](14000/112548) loss:2.916 lr:0.0002811 epoch_Time:620.0min:
[2023-12-25 10:32:54,539][model5_pretrain.py][INFO] Epoch:[0/1](14100/112548) loss:3.191 lr:0.0002808 epoch_Time:619.0min:
[2023-12-25 10:33:32,250][model5_pretrain.py][INFO] Epoch:[0/1](14200/112548) loss:3.126 lr:0.0002805 epoch_Time:618.0min:
[2023-12-25 10:34:09,977][model5_pretrain.py][INFO] Epoch:[0/1](14300/112548) loss:3.248 lr:0.0002802 epoch_Time:618.0min:
[2023-12-25 10:34:47,705][model5_pretrain.py][INFO] Epoch:[0/1](14400/112548) loss:3.093 lr:0.0002799 epoch_Time:617.0min:
[2023-12-25 10:35:25,422][model5_pretrain.py][INFO] Epoch:[0/1](14500/112548) loss:3.257 lr:0.0002796 epoch_Time:616.0min:
[2023-12-25 10:36:03,149][model5_pretrain.py][INFO] Epoch:[0/1](14600/112548) loss:3.408 lr:0.0002793 epoch_Time:616.0min:
[2023-12-25 10:36:40,844][model5_pretrain.py][INFO] Epoch:[0/1](14700/112548) loss:2.778 lr:0.0002790 epoch_Time:615.0min:
[2023-12-25 10:37:18,575][model5_pretrain.py][INFO] Epoch:[0/1](14800/112548) loss:3.346 lr:0.0002787 epoch_Time:614.0min:
[2023-12-25 10:37:56,295][model5_pretrain.py][INFO] Epoch:[0/1](14900/112548) loss:3.483 lr:0.0002784 epoch_Time:614.0min:
[2023-12-25 10:38:34,009][model5_pretrain.py][INFO] Epoch:[0/1](15000/112548) loss:3.169 lr:0.0002781 epoch_Time:613.0min:
[2023-12-25 10:39:11,736][model5_pretrain.py][INFO] Epoch:[0/1](15100/112548) loss:3.078 lr:0.0002778 epoch_Time:613.0min:
[2023-12-25 10:39:49,459][model5_pretrain.py][INFO] Epoch:[0/1](15200/112548) loss:2.988 lr:0.0002775 epoch_Time:612.0min:
[2023-12-25 10:40:27,197][model5_pretrain.py][INFO] Epoch:[0/1](15300/112548) loss:2.867 lr:0.0002772 epoch_Time:611.0min:
[2023-12-25 10:41:04,910][model5_pretrain.py][INFO] Epoch:[0/1](15400/112548) loss:3.263 lr:0.0002769 epoch_Time:611.0min:
[2023-12-25 10:41:42,641][model5_pretrain.py][INFO] Epoch:[0/1](15500/112548) loss:3.125 lr:0.0002766 epoch_Time:610.0min:
[2023-12-25 10:42:20,361][model5_pretrain.py][INFO] Epoch:[0/1](15600/112548) loss:3.032 lr:0.0002762 epoch_Time:609.0min:
[2023-12-25 10:42:58,092][model5_pretrain.py][INFO] Epoch:[0/1](15700/112548) loss:3.016 lr:0.0002759 epoch_Time:609.0min:
[2023-12-25 10:43:35,809][model5_pretrain.py][INFO] Epoch:[0/1](15800/112548) loss:3.324 lr:0.0002756 epoch_Time:608.0min:
[2023-12-25 10:44:13,538][model5_pretrain.py][INFO] Epoch:[0/1](15900/112548) loss:2.720 lr:0.0002753 epoch_Time:608.0min:
[2023-12-25 10:44:51,252][model5_pretrain.py][INFO] Epoch:[0/1](16000/112548) loss:3.429 lr:0.0002750 epoch_Time:607.0min:
[2023-12-25 10:45:28,983][model5_pretrain.py][INFO] Epoch:[0/1](16100/112548) loss:3.555 lr:0.0002746 epoch_Time:606.0min:
[2023-12-25 10:46:06,719][model5_pretrain.py][INFO] Epoch:[0/1](16200/112548) loss:2.867 lr:0.0002743 epoch_Time:606.0min:
[2023-12-25 10:46:44,445][model5_pretrain.py][INFO] Epoch:[0/1](16300/112548) loss:3.313 lr:0.0002740 epoch_Time:605.0min:
[2023-12-25 10:47:22,164][model5_pretrain.py][INFO] Epoch:[0/1](16400/112548) loss:2.614 lr:0.0002736 epoch_Time:604.0min:
[2023-12-25 10:47:59,892][model5_pretrain.py][INFO] Epoch:[0/1](16500/112548) loss:3.297 lr:0.0002733 epoch_Time:604.0min:
[2023-12-25 10:48:37,629][model5_pretrain.py][INFO] Epoch:[0/1](16600/112548) loss:2.595 lr:0.0002730 epoch_Time:603.0min:
[2023-12-25 10:49:15,362][model5_pretrain.py][INFO] Epoch:[0/1](16700/112548) loss:3.915 lr:0.0002726 epoch_Time:603.0min:
[2023-12-25 10:49:53,089][model5_pretrain.py][INFO] Epoch:[0/1](16800/112548) loss:3.587 lr:0.0002723 epoch_Time:602.0min:
[2023-12-25 10:50:30,774][model5_pretrain.py][INFO] Epoch:[0/1](16900/112548) loss:3.243 lr:0.0002720 epoch_Time:601.0min:
[2023-12-25 10:51:08,499][model5_pretrain.py][INFO] Epoch:[0/1](17000/112548) loss:3.072 lr:0.0002716 epoch_Time:601.0min:
[2023-12-25 10:51:46,217][model5_pretrain.py][INFO] Epoch:[0/1](17100/112548) loss:2.936 lr:0.0002713 epoch_Time:600.0min:
[2023-12-25 10:52:23,928][model5_pretrain.py][INFO] Epoch:[0/1](17200/112548) loss:2.446 lr:0.0002709 epoch_Time:599.0min:
[2023-12-25 10:53:01,642][model5_pretrain.py][INFO] Epoch:[0/1](17300/112548) loss:3.783 lr:0.0002706 epoch_Time:599.0min:
[2023-12-25 10:53:39,357][model5_pretrain.py][INFO] Epoch:[0/1](17400/112548) loss:2.821 lr:0.0002702 epoch_Time:598.0min:
[2023-12-25 10:54:17,069][model5_pretrain.py][INFO] Epoch:[0/1](17500/112548) loss:2.442 lr:0.0002699 epoch_Time:598.0min:
[2023-12-25 10:54:54,786][model5_pretrain.py][INFO] Epoch:[0/1](17600/112548) loss:2.982 lr:0.0002695 epoch_Time:597.0min:
[2023-12-25 10:55:32,499][model5_pretrain.py][INFO] Epoch:[0/1](17700/112548) loss:3.030 lr:0.0002692 epoch_Time:596.0min:
[2023-12-25 10:56:10,219][model5_pretrain.py][INFO] Epoch:[0/1](17800/112548) loss:3.489 lr:0.0002688 epoch_Time:596.0min:
[2023-12-25 10:56:47,939][model5_pretrain.py][INFO] Epoch:[0/1](17900/112548) loss:3.018 lr:0.0002685 epoch_Time:595.0min:
[2023-12-25 10:57:25,652][model5_pretrain.py][INFO] Epoch:[0/1](18000/112548) loss:2.521 lr:0.0002681 epoch_Time:594.0min:
[2023-12-25 10:58:03,370][model5_pretrain.py][INFO] Epoch:[0/1](18100/112548) loss:2.989 lr:0.0002677 epoch_Time:594.0min:
[2023-12-25 10:58:41,082][model5_pretrain.py][INFO] Epoch:[0/1](18200/112548) loss:3.448 lr:0.0002674 epoch_Time:593.0min:
[2023-12-25 10:59:18,795][model5_pretrain.py][INFO] Epoch:[0/1](18300/112548) loss:3.020 lr:0.0002670 epoch_Time:592.0min:
[2023-12-25 10:59:56,511][model5_pretrain.py][INFO] Epoch:[0/1](18400/112548) loss:3.368 lr:0.0002667 epoch_Time:592.0min:
[2023-12-25 11:00:34,230][model5_pretrain.py][INFO] Epoch:[0/1](18500/112548) loss:3.081 lr:0.0002663 epoch_Time:591.0min:
[2023-12-25 11:01:11,961][model5_pretrain.py][INFO] Epoch:[0/1](18600/112548) loss:3.365 lr:0.0002659 epoch_Time:591.0min:
[2023-12-25 11:01:49,683][model5_pretrain.py][INFO] Epoch:[0/1](18700/112548) loss:3.282 lr:0.0002655 epoch_Time:590.0min:
[2023-12-25 11:02:27,391][model5_pretrain.py][INFO] Epoch:[0/1](18800/112548) loss:2.923 lr:0.0002652 epoch_Time:589.0min:
[2023-12-25 11:03:05,103][model5_pretrain.py][INFO] Epoch:[0/1](18900/112548) loss:2.789 lr:0.0002648 epoch_Time:589.0min:
[2023-12-25 11:03:42,818][model5_pretrain.py][INFO] Epoch:[0/1](19000/112548) loss:2.694 lr:0.0002644 epoch_Time:588.0min:
[2023-12-25 11:04:20,559][model5_pretrain.py][INFO] Epoch:[0/1](19100/112548) loss:2.977 lr:0.0002640 epoch_Time:587.0min:
[2023-12-25 11:04:58,292][model5_pretrain.py][INFO] Epoch:[0/1](19200/112548) loss:3.344 lr:0.0002637 epoch_Time:587.0min:
[2023-12-25 11:05:36,055][model5_pretrain.py][INFO] Epoch:[0/1](19300/112548) loss:3.387 lr:0.0002633 epoch_Time:586.0min:
[2023-12-25 11:06:13,766][model5_pretrain.py][INFO] Epoch:[0/1](19400/112548) loss:3.078 lr:0.0002629 epoch_Time:586.0min:
[2023-12-25 11:06:51,479][model5_pretrain.py][INFO] Epoch:[0/1](19500/112548) loss:2.918 lr:0.0002625 epoch_Time:585.0min:
[2023-12-25 11:07:29,199][model5_pretrain.py][INFO] Epoch:[0/1](19600/112548) loss:2.968 lr:0.0002621 epoch_Time:584.0min:
[2023-12-25 11:08:06,922][model5_pretrain.py][INFO] Epoch:[0/1](19700/112548) loss:3.243 lr:0.0002617 epoch_Time:584.0min:
[2023-12-25 11:08:44,651][model5_pretrain.py][INFO] Epoch:[0/1](19800/112548) loss:3.307 lr:0.0002613 epoch_Time:583.0min:
[2023-12-25 11:09:22,368][model5_pretrain.py][INFO] Epoch:[0/1](19900/112548) loss:2.781 lr:0.0002609 epoch_Time:582.0min:
[2023-12-25 11:10:00,094][model5_pretrain.py][INFO] Epoch:[0/1](20000/112548) loss:3.191 lr:0.0002605 epoch_Time:582.0min:
[2023-12-25 11:10:43,527][model5_pretrain.py][INFO] Epoch:[0/1](20100/112548) loss:2.538 lr:0.0002601 epoch_Time:581.0min:
[2023-12-25 11:11:21,251][model5_pretrain.py][INFO] Epoch:[0/1](20200/112548) loss:3.081 lr:0.0002597 epoch_Time:580.0min:
[2023-12-25 11:11:58,980][model5_pretrain.py][INFO] Epoch:[0/1](20300/112548) loss:2.833 lr:0.0002593 epoch_Time:580.0min:
[2023-12-25 11:12:36,729][model5_pretrain.py][INFO] Epoch:[0/1](20400/112548) loss:3.408 lr:0.0002589 epoch_Time:579.0min:
[2023-12-25 11:13:14,462][model5_pretrain.py][INFO] Epoch:[0/1](20500/112548) loss:2.923 lr:0.0002585 epoch_Time:579.0min:
[2023-12-25 11:13:52,179][model5_pretrain.py][INFO] Epoch:[0/1](20600/112548) loss:2.764 lr:0.0002581 epoch_Time:578.0min:
[2023-12-25 11:14:29,902][model5_pretrain.py][INFO] Epoch:[0/1](20700/112548) loss:3.423 lr:0.0002577 epoch_Time:577.0min:
[2023-12-25 11:15:07,620][model5_pretrain.py][INFO] Epoch:[0/1](20800/112548) loss:2.797 lr:0.0002573 epoch_Time:577.0min:
[2023-12-25 11:15:45,332][model5_pretrain.py][INFO] Epoch:[0/1](20900/112548) loss:2.367 lr:0.0002569 epoch_Time:576.0min:
[2023-12-25 11:16:23,050][model5_pretrain.py][INFO] Epoch:[0/1](21000/112548) loss:2.966 lr:0.0002565 epoch_Time:575.0min:
[2023-12-25 11:17:00,762][model5_pretrain.py][INFO] Epoch:[0/1](21100/112548) loss:3.032 lr:0.0002561 epoch_Time:575.0min:
[2023-12-25 11:17:38,452][model5_pretrain.py][INFO] Epoch:[0/1](21200/112548) loss:3.296 lr:0.0002557 epoch_Time:574.0min:
[2023-12-25 11:18:16,175][model5_pretrain.py][INFO] Epoch:[0/1](21300/112548) loss:3.288 lr:0.0002553 epoch_Time:574.0min:
[2023-12-25 11:18:53,899][model5_pretrain.py][INFO] Epoch:[0/1](21400/112548) loss:3.012 lr:0.0002548 epoch_Time:573.0min:
[2023-12-25 11:19:31,615][model5_pretrain.py][INFO] Epoch:[0/1](21500/112548) loss:3.191 lr:0.0002544 epoch_Time:572.0min:
[2023-12-25 11:20:09,338][model5_pretrain.py][INFO] Epoch:[0/1](21600/112548) loss:3.764 lr:0.0002540 epoch_Time:572.0min:
[2023-12-25 11:20:47,064][model5_pretrain.py][INFO] Epoch:[0/1](21700/112548) loss:3.462 lr:0.0002536 epoch_T[2023-12-25 11[2023-12-25 11:21:24,787][model5_pretrain.py][INFO] Epoch:[0/1](21800/112548) loss:3.328 lr:0.0002532 epoch_Time:570.0min:
[2023-12-25 11:22:02,500][model5_pretrain.py][INFO] Epoch:[0/1](21900/112548) loss:3.182 lr:0.0002527 epoch_Time:570.0min:
[2023-12-25 11:22:40,228][model5_pretrain.py][INFO] Epoch:[0/1](22000/112548) loss:3.622 lr:0.0002523 epoch_Time:569.0min:
[2023-12-25 11:23:17,945][model5_pretrain.py][INFO] Epoch:[0/1](22100/112548) loss:3.608 lr:0.0002519 epoch_Time:568.0min:
[2023-12-25 11:23:55,659][model5_pretrain.py][INFO] Epoch:[0/1](22200/112548) loss:3.103 lr:0.0002515 epoch_Time:568.0min:
[2023-12-25 11:24:33,383][model5_pretrain.py][INFO] Epoch:[0/1](22300/112548) loss:2.716 lr:0.0002510 epoch_Time:567.0min:
[2023-12-25 11:25:11,088][model5_pretrain.py][INFO] Epoch:[0/1](22400/112548) loss:3.219 lr:0.0002506 epoch_T[2023-12-25 11[2023-12-25 11:25:48,803][model5_pretrain.py][INFO] Epoch:[0/1](22500/112548) loss:3.227 lr:0.0002502 epoch_Time:566.0min:
[2023-12-25 11:26:26,513][model5_pretrain.py][INFO] Epoch:[0/1](22600/112548) loss:3.507 lr:0.0002497 epoch_Time:565.0min:
[2023-12-25 11:27:04,232][model5_pretrain.py][INFO] Epoch:[0/1](22700/112548) loss:3.241 lr:0.0002493 epoch_Time:565.0min:
[2023-12-25 11:27:41,949][model5_pretrain.py][INFO] Epoch:[0/1](22800/112548) loss:2.855 lr:0.0002488 epoch_Time:564.0min:
[2023-12-25 11:28:19,658][model5_pretrain.py][INFO] Epoch:[0/1](22900/112548) loss:2.971 lr:0.0002484 epoch_Time:563.0min:
[2023-12-25 11:28:57,365][model5_pretrain.py][INFO] Epoch:[0/1](23000/112548) loss:3.023 lr:0.0002480 epoch_Time:563.0min:
[2023-12-25 11:29:35,083][model5_pretrain.py][INFO] Epoch:[0/1](23100/112548) loss:3.554 lr:0.0002475 epoch_Time:562.0min:
[2023-12-25 11:30:12,803][model5_pretrain.py][INFO] Epoch:[0/1](23200/112548) loss:3.680 lr:0.0002471 epoch_Time:562.0min:
[2023-12-25 11:30:50,515][model5_pretrain.py][INFO] Epoch:[0/1](23300/112548) loss:3.211 lr:0.0002466 epoch_Time:561.0min:
[2023-12-25 11:31:28,248][model5_pretrain.py][INFO] Epoch:[0/1](23400/112548) loss:3.410 lr:0.0002462 epoch_Time:560.0min:
[2023-12-25 11:32:05,976][model5_pretrain.py][INFO] Epoch:[0/1](23500/112548) loss:2.820 lr:0.0002457 epoch_Time:560.0min:
[2023-12-25 11:32:43,695][model5_pretrain.py][INFO] Epoch:[0/1](23600/112548) loss:3.111 lr:0.0002453 epoch_Time:559.0min:
[2023-12-25 11:33:21,416][model5_pretrain.py][INFO] Epoch:[0/1](23700/112548) loss:3.012 lr:0.0002448 epoch_Time:558.0min:
[2023-12-25 11:33:59,129][model5_pretrain.py][INFO] Epoch:[0/1](23800/112548) loss:3.241 lr:0.0002444 epoch_Time:558.0min:
[2023-12-25 11:34:36,838][model5_pretrain.py][INFO] Epoch:[0/1](23900/112548) loss:3.144 lr:0.0002439 epoch_Time:557.0min:
[2023-12-25 11:35:14,552][model5_pretrain.py][INFO] Epoch:[0/1](24000/112548) loss:3.033 lr:0.0002435 epoch_Time:557.0min:
[2023-12-25 11:35:52,267][model5_pretrain.py][INFO] Epoch:[0/1](24100/112548) loss:2.337 lr:0.0002430 epoch_Time:556.0min:
[2023-12-25 11:36:29,988][model5_pretrain.py][INFO] Epoch:[0/1](24200/112548) loss:2.880 lr:0.0002425 epoch_Time:555.0min:
[2023-12-25 11:37:07,711][model5_pretrain.py][INFO] Epoch:[0/1](24300/112548) loss:3.213 lr:0.0002421 epoch_T[2023-12-25 11[2023-12-25 11:37:45,402][model5_pretrain.py][INFO] Epoch:[0/1](24400/112548) loss:2.918 lr:0.0002416 epoch_Time:554.0min:
[2023-12-25 11:38:23,117][model5_pretrain.py][INFO] Epoch:[0/1](24500/112548) loss:2.336 lr:0.0002412 epoch_Time:553.0min:
[2023-12-25 11:39:00,822][model5_pretrain.py][INFO] Epoch:[0/1](24600/112548) loss:3.005 lr:0.0002407 epoch_Time:553.0min:
[2023-12-25 11:39:38,559][model5_pretrain.py][INFO] Epoch:[0/1](24700/112548) loss:3.039 lr:0.0002402 epoch_Time:552.0min:
[2023-12-25 11:40:16,270][model5_pretrain.py][INFO] Epoch:[0/1](24800/112548) loss:3.129 lr:0.0002398 epoch_Time:552.0min:
[2023-12-25 11:40:54,009][model5_pretrain.py][INFO] Epoch:[0/1](24900/112548) loss:3.157 lr:0.0002393 epoch_Time:551.0min:
[2023-12-25 11:41:31,728][model5_pretrain.py][INFO] Epoch:[0/1](25000/112548) loss:2.980 lr:0.0002388 epoch_Time:550.0min:
[2023-12-25 11:42:09,442][model5_pretrain.py][INFO] Epoch:[0/1](25100/112548) loss:3.031 lr:0.0002384 epoch_Time:550.0min:
[2023-12-25 11:42:47,194][model5_pretrain.py][INFO] Epoch:[0/1](25200/112548) loss:3.267 lr:0.0002379 epoch_Time:549.0min:
[2023-12-25 11:43:24,903][model5_pretrain.py][INFO] Epoch:[0/1](25300/112548) loss:2.803 lr:0.0002374 epoch_Time:548.0min:
[2023-12-25 11:44:02,634][model5_pretrain.py][INFO] Epoch:[0/1](25400/112548) loss:3.184 lr:0.0002369 epoch_Time:548.0min:
[2023-12-25 11:44:40,345][model5_pretrain.py][INFO] Epoch:[0/1](25500/112548) loss:3.161 lr:0.0002365 epoch_Time:547.0min:
[2023-12-25 11:45:18,067][model5_pretrain.py][INFO] Epoch:[0/1](25600/112548) loss:3.385 lr:0.0002360 epoch_Time:546.0min:
[2023-12-25 11:45:55,780][model5_pretrain.py][INFO] Epoch:[0/1](25700/112548) loss:3.078 lr:0.0002355 epoch_T[2023-12-25 11[2023-12-25 11:46:33,496][model5_pretrain.py][INFO] Epoch:[0/1](25800/112548) loss:3.402 lr:0.0002350 epoch_Time:545.0min:
[2023-12-25 11:47:11,223][model5_pretrain.py][INFO] Epoch:[0/1](25900/112548) loss:3.119 lr:0.0002345 epoch_T[2023-12-25 11[2023-12-25 11:47:48,939][model5_pretrain.py][INFO] Epoch:[0/1](26000/112548) loss:3.348 lr:0.0002341 epoch_Time:544.0min:
[2023-12-25 11:48:26,662][model5_pretrain.py][INFO] Epoch:[0/1](26100/112548) loss:3.673 lr:0.0002336 epoch_Time:543.0min:
[2023-12-25 11:49:04,382][model5_pretrain.py][INFO] Epoch:[0/1](26200/112548) loss:3.139 lr:0.0002331 epoch_Time:543.0min:
[2023-12-25 11:49:42,099][model5_pretrain.py][INFO] Epoch:[0/1](26300/112548) loss:2.911 lr:0.0002326 epoch_Time:542.0min:
[2023-12-25 11:50:19,821][model5_pretrain.py][INFO] Epoch:[0/1](26400/112548) loss:3.642 lr:0.0002321 epoch_Time:541.0min:
[2023-12-25 11:50:57,539][model5_pretrain.py][INFO] Epoch:[0/1](26500/112548) loss:3.352 lr:0.0002316 epoch_Time:541.0min:
[2023-12-25 11:51:35,262][model5_pretrain.py][INFO] Epoch:[0/1](26600/112548) loss:2.976 lr:0.0002311 epoch_Time:540.0min:
[2023-12-25 11:52:12,993][model5_pretrain.py][INFO] Epoch:[0/1](26700/112548) loss:2.667 lr:0.0002306 epoch_T[2023-12-25 11[2023-12-25 11:52:50,708][model5_pretrain.py][INFO] Epoch:[0/1](26800/112548) loss:3.035 lr:0.0002301 epoch_Time:539.0min:
[2023-12-25 11:53:28,427][model5_pretrain.py][INFO] Epoch:[0/1](26900/112548) loss:3.493 lr:0.0002297 epoch_Time:538.0min:
[2023-12-25 11:54:06,137][model5_pretrain.py][INFO] Epoch:[0/1](27000/112548) loss:3.231 lr:0.0002292 epoch_Time:538.0min:
[2023-12-25 11:54:43,857][model5_pretrain.py][INFO] Epoch:[0/1](27100/112548) loss:2.575 lr:0.0002287 epoch_Time:537.0min:
[2023-12-25 11:55:21,575][model5_pretrain.py][INFO] Epoch:[0/1](27200/112548) loss:3.602 lr:0.0002282 epoch_Time:536.0min:
[2023-12-25 11:55:59,295][model5_pretrain.py][INFO] Epoch:[0/1](27300/112548) loss:3.131 lr:0.0002277 epoch_Time:536.0min:
[2023-12-25 11:56:37,012][model5_pretrain.py][INFO] Epoch:[0/1](27400/112548) loss:3.427 lr:0.0002272 epoch_Time:535.0min:
[2023-12-25 11:57:14,727][model5_pretrain.py][INFO] Epoch:[0/1](27500/112548) loss:2.122 lr:0.0002267 epoch_Time:535.0min:
[2023-12-25 11:57:52,442][model5_pretrain.py][INFO] Epoch:[0/1](27600/112548) loss:3.326 lr:0.0002262 epoch_Time:534.0min:
[2023-12-25 11:58:30,174][model5_pretrain.py][INFO] Epoch:[0/1](27700/112548) loss:2.650 lr:0.0002257 epoch_Time:533.0min:
[2023-12-25 11:59:07,920][model5_pretrain.py][INFO] Epoch:[0/1](27800/112548) loss:2.818 lr:0.0002252 epoch_Time:533.0min:
[2023-12-25 11:59:45,640][model5_pretrain.py][INFO] Epoch:[0/1](27900/112548) loss:2.778 lr:0.0002247 epoch_Time:532.0min:
[2023-12-25 12:00:23,361][model5_pretrain.py][INFO] Epoch:[0/1](28000/112548) loss:3.011 lr:0.0002241 epoch_Time:531.0min:
[2023-12-25 12:01:01,083][model5_pretrain.py][INFO] Epoch:[0/1](28100/112548) loss:2.706 lr:0.0002236 epoch_Time:531.0min:
[2023-12-25 12:01:38,802][model5_pretrain.py][INFO] Epoch:[0/1](28200/112548) loss:3.398 lr:0.0002231 epoch_Time:530.0min:
[2023-12-25 12:02:16,529][model5_pretrain.py][INFO] Epoch:[0/1](28300/112548) loss:1.723 lr:0.0002226 epoch_Time:530.0min:
[2023-12-25 12:02:54,255][model5_pretrain.py][INFO] Epoch:[0/1](28400/112548) loss:2.876 lr:0.0002221 epoch_Time:529.0min:
[2023-12-25 12:03:31,972][model5_pretrain.py][INFO] Epoch:[0/1](28500/112548) loss:3.256 lr:0.0002216 epoch_Time:528.0min:
[2023-12-25 12:04:09,701][model5_pretrain.py][INFO] Epoch:[0/1](28600/112548) loss:3.867 lr:0.0002211 epoch_Time:528.0min:
[2023-12-25 12:04:47,426][model5_pretrain.py][INFO] Epoch:[0/1](28700/112548) loss:3.273 lr:0.0002206 epoch_Time:527.0min:
[2023-12-25 12:05:25,143][model5_pretrain.py][INFO] Epoch:[0/1](28800/112548) loss:3.335 lr:0.0002201 epoch_Time:526.0min:
[2023-12-25 12:06:02,782][model5_pretrain.py][INFO] Epoch:[0/1](28900/112548) loss:2.581 lr:0.0002195 epoch_Time:526.0min:
[2023-12-25 12:06:40,486][model5_pretrain.py][INFO] Epoch:[0/1](29000/112548) loss:2.996 lr:0.0002190 epoch_Time:525.0min:
[2023-12-25 12:07:18,198][model5_pretrain.py][INFO] Epoch:[0/1](29100/112548) loss:2.786 lr:0.0002185 epoch_Time:524.0min:
[2023-12-25 12:07:55,900][model5_pretrain.py][INFO] Epoch:[0/1](29200/112548) loss:2.703 lr:0.0002180 epoch_Time:524.0min:
[2023-12-25 12:08:33,591][model5_pretrain.py][INFO] Epoch:[0/1](29300/112548) loss:3.559 lr:0.0002175 epoch_Time:523.0min:
[2023-12-25 12:09:11,289][model5_pretrain.py][INFO] Epoch:[0/1](29400/112548) loss:2.770 lr:0.0002169 epoch_Time:523.0min:
[2023-12-25 12:09:48,998][model5_pretrain.py][INFO] Epoch:[0/1](29500/112548) loss:3.495 lr:0.0002164 epoch_Time:522.0min:
[2023-12-25 12:10:26,707][model5_pretrain.py][INFO] Epoch:[0/1](29600/112548) loss:2.962 lr:0.0002159 epoch_Time:521.0min:
[2023-12-25 12:11:04,415][model5_pretrain.py][INFO] Epoch:[0/1](29700/112548) loss:2.870 lr:0.0002154 epoch_Time:521.0min:
[2023-12-25 12:11:42,111][model5_pretrain.py][INFO] Epoch:[0/1](29800/112548) loss:2.854 lr:0.0002149 epoch_Time:520.0min:
[2023-12-25 12:12:19,827][model5_pretrain.py][INFO] Epoch:[0/1](29900/112548) loss:3.283 lr:0.0002143 epoch_Time:519.0min:
[2023-12-25 12:12:57,522][model5_pretrain.py][INFO] Epoch:[0/1](30000/112548) loss:3.222 lr:0.0002138 epoch_Time:519.0min:
[2023-12-25 12:13:35,221][model5_pretrain.py][INFO] Epoch:[0/1](30100/112548) loss:2.811 lr:0.0002133 epoch_Time:518.0min:
[2023-12-25 12:14:12,926][model5_pretrain.py][INFO] Epoch:[0/1](30200/112548) loss:2.889 lr:0.0002127 epoch_Time:518.0min:
[2023-12-25 12:14:50,630][model5_pretrain.py][INFO] Epoch:[0/1](30300/112548) loss:2.803 lr:0.0002122 epoch_Time:517.0min:
[2023-12-25 12:15:28,339][model5_pretrain.py][INFO] Epoch:[0/1](30400/112548) loss:2.977 lr:0.0002117 epoch_Time:516.0min:
[2023-12-25 12:16:06,042][model5_pretrain.py][INFO] Epoch:[0/1](30500/112548) loss:3.568 lr:0.0002112 epoch_Time:516.0min:
[2023-12-25 12:16:43,751][model5_pretrain.py][INFO] Epoch:[0/1](30600/112548) loss:2.722 lr:0.0002106 epoch_Time:515.0min:
[2023-12-25 12:17:21,455][model5_pretrain.py][INFO] Epoch:[0/1](30700/112548) loss:2.802 lr:0.0002101 epoch_Time:514.0min:
[2023-12-25 12:17:59,160][model5_pretrain.py][INFO] Epoch:[0/1](30800/112548) loss:3.024 lr:0.0002096 epoch_Time:514.0min:
[2023-12-25 12:18:36,864][model5_pretrain.py][INFO] Epoch:[0/1](30900/112548) loss:3.619 lr:0.0002090 epoch_Time:513.0min:
[2023-12-25 12:19:14,537][model5_pretrain.py][INFO] Epoch:[0/1](31000/112548) loss:2.991 lr:0.0002085 epoch_Time:513.0min:
[2023-12-25 12:19:52,258][model5_pretrain.py][INFO] Epoch:[0/1](31100/112548) loss:2.824 lr:0.0002079 epoch_Time:512.0min:
[2023-12-25 12:20:29,965][model5_pretrain.py][INFO] Epoch:[0/1](31200/112548) loss:3.084 lr:0.0002074 epoch_Time:511.0min:
[2023-12-25 12:21:07,664][model5_pretrain.py][INFO] Epoch:[0/1](31300/112548) loss:3.540 lr:0.0002069 epoch_Time:511.0min:
[2023-12-25 12:21:45,364][model5_pretrain.py][INFO] Epoch:[0/1](31400/112548) loss:2.671 lr:0.0002063 epoch_Time:510.0min:
[2023-12-25 12:22:23,085][model5_pretrain.py][INFO] Epoch:[0/1](31500/112548) loss:3.330 lr:0.0002058 epoch_Time:509.0min:
[2023-12-25 12:23:00,782][model5_pretrain.py][INFO] Epoch:[0/1](31600/112548) loss:3.528 lr:0.0002053 epoch_Time:509.0min:
[2023-12-25 12:23:38,489][model5_pretrain.py][INFO] Epoch:[0/1](31700/112548) loss:2.596 lr:0.0002047 epoch_Time:508.0min:
[2023-12-25 12:24:16,191][model5_pretrain.py][INFO] Epoch:[0/1](31800/112548) loss:2.773 lr:0.0002042 epoch_Time:508.0min:
[2023-12-25 12:24:53,899][model5_pretrain.py][INFO] Epoch:[0/1](31900/112548) loss:2.954 lr:0.0002036 epoch_Time:507.0min:
[2023-12-25 12:25:31,599][model5_pretrain.py][INFO] Epoch:[0/1](32000/112548) loss:3.154 lr:0.0002031 epoch_Time:506.0min:
[2023-12-25 12:26:09,307][model5_pretrain.py][INFO] Epoch:[0/1](32100/112548) loss:3.514 lr:0.0002025 epoch_Time:506.0min:
[2023-12-25 12:26:47,015][model5_pretrain.py][INFO] Epoch:[0/1](32200/112548) loss:2.915 lr:0.0002020 epoch_Time:505.0min:
[2023-12-25 12:27:24,714][model5_pretrain.py][INFO] Epoch:[0/1](32300/112548) loss:3.036 lr:0.0002014 epoch_Time:504.0min:
[2023-12-25 12:28:02,410][model5_pretrain.py][INFO] Epoch:[0/1](32400/112548) loss:2.719 lr:0.0002009 epoch_Time:504.0min:
[2023-12-25 12:28:40,111][model5_pretrain.py][INFO] Epoch:[0/1](32500/112548) loss:3.186 lr:0.0002004 epoch_Time:503.0min:
[2023-12-25 12:29:17,831][model5_pretrain.py][INFO] Epoch:[0/1](32600/112548) loss:3.325 lr:0.0001998 epoch_Time:502.0min:
[2023-12-25 12:29:55,541][model5_pretrain.py][INFO] Epoch:[0/1](32700/112548) loss:2.790 lr:0.0001993 epoch_Time:502.0min:
[2023-12-25 12:30:33,239][model5_pretrain.py][INFO] Epoch:[0/1](32800/112548) loss:2.798 lr:0.0001987 epoch_Time:501.0min:
[2023-12-25 12:31:10,947][model5_pretrain.py][INFO] Epoch:[0/1](32900/112548) loss:3.357 lr:0.0001982 epoch_Time:501.0min:
[2023-12-25 12:31:48,653][model5_pretrain.py][INFO] Epoch:[0/1](33000/112548) loss:3.310 lr:0.0001976 epoch_Time:500.0min:
[2023-12-25 12:32:26,355][model5_pretrain.py][INFO] Epoch:[0/1](33100/112548) loss:3.532 lr:0.0001971 epoch_Time:499.0min:
[2023-12-25 12:33:04,061][model5_pretrain.py][INFO] Epoch:[0/1](33200/112548) loss:2.392 lr:0.0001965 epoch_Time:499.0min:
[2023-12-25 12:33:41,767][model5_pretrain.py][INFO] Epoch:[0/1](33300/112548) loss:2.862 lr:0.0001960 epoch_Time:498.0min:
[2023-12-25 12:34:19,476][model5_pretrain.py][INFO] Epoch:[0/1](33400/112548) loss:2.981 lr:0.0001954 epoch_Time:497.0min:
[2023-12-25 12:34:57,185][model5_pretrain.py][INFO] Epoch:[0/1](33500/112548) loss:3.069 lr:0.0001948 epoch_Time:497.0min:
[2023-12-25 12:35:34,888][model5_pretrain.py][INFO] Epoch:[0/1](33600/112548) loss:3.050 lr:0.0001943 epoch_Time:496.0min:
[2023-12-25 12:36:12,587][model5_pretrain.py][INFO] Epoch:[0/1](33700/112548) loss:3.385 lr:0.0001937 epoch_Time:496.0min:
[2023-12-25 12:36:50,283][model5_pretrain.py][INFO] Epoch:[0/1](33800/112548) loss:2.422 lr:0.0001932 epoch_T[2023-12-25 12[2023-12-25 12:37:27,991][model5_pretrain.py][INFO] Epoch:[0/1](33900/112548) loss:3.464 lr:0.0001926 epoch_Time:494.0min:
[2023-12-25 12:38:05,696][model5_pretrain.py][INFO] Epoch:[0/1](34000/112548) loss:2.226 lr:0.0001921 epoch_Time:494.0min:
[2023-12-25 12:38:43,400][model5_pretrain.py][INFO] Epoch:[0/1](34100/112548) loss:2.950 lr:0.0001915 epoch_Time:493.0min:
[2023-12-25 12:39:21,102][model5_pretrain.py][INFO] Epoch:[0/1](34200/112548) loss:2.428 lr:0.0001909 epoch_Time:492.0min:
[2023-12-25 12:39:58,815][model5_pretrain.py][INFO] Epoch:[0/1](34300/112548) loss:3.063 lr:0.0001904 epoch_Time:492.0min:
[2023-12-25 12:40:36,524][model5_pretrain.py][INFO] Epoch:[0/1](34400/112548) loss:3.442 lr:0.0001898 epoch_Time:491.0min:
[2023-12-25 12:41:14,232][model5_pretrain.py][INFO] Epoch:[0/1](34500/112548) loss:2.619 lr:0.0001893 epoch_Time:491.0min:
[2023-12-25 12:41:51,939][model5_pretrain.py][INFO] Epoch:[0/1](34600/112548) loss:3.206 lr:0.0001887 epoch_Time:490.0min:
[2023-12-25 12:42:29,644][model5_pretrain.py][INFO] Epoch:[0/1](34700/112548) loss:2.198 lr:0.0001881 epoch_Time:489.0min:
[2023-12-25 12:43:07,350][model5_pretrain.py][INFO] Epoch:[0/1](34800/112548) loss:2.594 lr:0.0001876 epoch_Time:489.0min:
[2023-12-25 12:43:45,051][model5_pretrain.py][INFO] Epoch:[0/1](34900/112548) loss:2.862 lr:0.0001870 epoch_Time:488.0min:
[2023-12-25 12:44:22,763][model5_pretrain.py][INFO] Epoch:[0/1](35000/112548) loss:2.789 lr:0.0001865 epoch_Time:487.0min:
[2023-12-25 12:45:00,469][model5_pretrain.py][INFO] Epoch:[0/1](35100/112548) loss:3.019 lr:0.0001859 epoch_Time:487.0min:
[2023-12-25 12:45:38,182][model5_pretrain.py][INFO] Epoch:[0/1](35200/112548) loss:3.088 lr:0.0001853 epoch_Time:486.0min:
[2023-12-25 12:46:15,895][model5_pretrain.py][INFO] Epoch:[0/1](35300/112548) loss:3.450 lr:0.0001848 epoch_Time:486.0min:
[2023-12-25 12:46:53,617][model5_pretrain.py][INFO] Epoch:[0/1](35400/112548) loss:2.885 lr:0.0001842 epoch_Time:485.0min:
[2023-12-25 12:47:31,323][model5_pretrain.py][INFO] Epoch:[0/1](35500/112548) loss:2.972 lr:0.0001836 epoch_Time:484.0min:
[2023-12-25 12:48:09,034][model5_pretrain.py][INFO] Epoch:[0/1](35600/112548) loss:2.367 lr:0.0001831 epoch_Time:484.0min:
[2023-12-25 12:48:46,746][model5_pretrain.py][INFO] Epoch:[0/1](35700/112548) loss:3.418 lr:0.0001825 epoch_Time:483.0min:
[2023-12-25 12:49:24,461][model5_pretrain.py][INFO] Epoch:[0/1](35800/112548) loss:2.978 lr:0.0001819 epoch_Time:482.0min:
[2023-12-25 12:50:02,171][model5_pretrain.py][INFO] Epoch:[0/1](35900/112548) loss:3.277 lr:0.0001814 epoch_Time:482.0min:
[2023-12-25 12:50:39,882][model5_pretrain.py][INFO] Epoch:[0/1](36000/112548) loss:2.850 lr:0.0001808 epoch_Time:481.0min:
[2023-12-25 12:51:17,564][model5_pretrain.py][INFO] Epoch:[0/1](36100/112548) loss:3.378 lr:0.0001802 epoch_Time:480.0min:
[2023-12-25 12:51:55,280][model5_pretrain.py][INFO] Epoch:[0/1](36200/112548) loss:3.067 lr:0.0001797 epoch_Time:480.0min:
[2023-12-25 12:52:32,991][model5_pretrain.py][INFO] Epoch:[0/1](36300/112548) loss:2.221 lr:0.0001791 epoch_Time:479.0min:
[2023-12-25 12:53:10,693][model5_pretrain.py][INFO] Epoch:[0/1](36400/112548) loss:2.938 lr:0.0001785 epoch_Time:479.0min:
[2023-12-25 12:53:48,404][model5_pretrain.py][INFO] Epoch:[0/1](36500/112548) loss:3.075 lr:0.0001780 epoch_Time:478.0min:
[2023-12-25 12:54:26,113][model5_pretrain.py][INFO] Epoch:[0/1](36600/112548) loss:3.456 lr:0.0001774 epoch_Time:477.0min:
[2023-12-25 12:55:03,826][model5_pretrain.py][INFO] Epoch:[0/1](36700/112548) loss:2.674 lr:0.0001768 epoch_Time:477.0min:
[2023-12-25 12:55:41,527][model5_pretrain.py][INFO] Epoch:[0/1](36800/112548) loss:2.914 lr:0.0001763 epoch_Time:476.0min:
[2023-12-25 12:56:19,241][model5_pretrain.py][INFO] Epoch:[0/1](36900/112548) loss:2.843 lr:0.0001757 epoch_Time:475.0min:
[2023-12-25 12:56:56,948][model5_pretrain.py][INFO] Epoch:[0/1](37000/112548) loss:2.460 lr:0.0001751 epoch_Time:475.0min:
[2023-12-25 12:57:34,649][model5_pretrain.py][INFO] Epoch:[0/1](37100/112548) loss:3.254 lr:0.0001745 epoch_Time:474.0min:
[2023-12-25 12:58:12,346][model5_pretrain.py][INFO] Epoch:[0/1](37200/112548) loss:2.653 lr:0.0001740 epoch_Time:474.0min:
[2023-12-25 12:58:50,063][model5_pretrain.py][INFO] Epoch:[0/1](37300/112548) loss:3.494 lr:0.0001734 epoch_Time:473.0min:
[2023-12-25 12:59:27,775][model5_pretrain.py][INFO] Epoch:[0/1](37400/112548) loss:2.600 lr:0.0001728 epoch_Time:472.0min:
[2023-12-25 13:00:05,476][model5_pretrain.py][INFO] Epoch:[0/1](37500/112548) loss:3.001 lr:0.0001723 epoch_Time:472.0min:
[2023-12-25 13:00:43,178][model5_pretrain.py][INFO] Epoch:[0/1](37600/112548) loss:2.807 lr:0.0001717 epoch_Time:471.0min:
[2023-12-25 13:01:20,889][model5_pretrain.py][INFO] Epoch:[0/1](37700/112548) loss:3.367 lr:0.0001711 epoch_T[2023-12-25 13[2023-12-25 13:01:58,594][model5_pretrain.py][INFO] Epoch:[0/1](37800/112548) loss:3.056 lr:0.0001705 epoch_T[2023-12-25 13[2023-12-25 13:02:36,297][model5_pretrain.py][INFO] Epoch:[0/1](37900/112548) loss:3.112 lr:0.0001700 epoch_Time:469.0min:
[2023-12-25 13:03:14,005][model5_pretrain.py][INFO] Epoch:[0/1](38000/112548) loss:3.372 lr:0.0001694 epoch_Time:469.0min:
[2023-12-25 13:03:51,704][model5_pretrain.py][INFO] Epoch:[0/1](38100/112548) loss:3.354 lr:0.0001688 epoch_Time:468.0min:
[2023-12-25 13:04:29,414][model5_pretrain.py][INFO] Epoch:[0/1](38200/112548) loss:2.325 lr:0.0001682 epoch_Time:467.0min:
[2023-12-25 13:05:07,123][model5_pretrain.py][INFO] Epoch:[0/1](38300/112548) loss:2.851 lr:0.0001677 epoch_Time:467.0min:
[2023-12-25 13:05:44,824][model5_pretrain.py][INFO] Epoch:[0/1](38400/112548) loss:3.157 lr:0.0001671 epoch_Time:466.0min:
[2023-12-25 13:06:22,535][model5_pretrain.py][INFO] Epoch:[0/1](38500/112548) loss:2.893 lr:0.0001665 epoch_Time:465.0min:
[2023-12-25 13:07:00,248][model5_pretrain.py][INFO] Epoch:[0/1](38600/112548) loss:2.955 lr:0.0001659 epoch_Time:465.0min:
[2023-12-25 13:07:37,954][model5_pretrain.py][INFO] Epoch:[0/1](38700/112548) loss:2.854 lr:0.0001654 epoch_Time:464.0min:
[2023-12-25 13:08:15,659][model5_pretrain.py][INFO] Epoch:[0/1](38800/112548) loss:3.035 lr:0.0001648 epoch_Time:464.0min:
[2023-12-25 13:08:53,362][model5_pretrain.py][INFO] Epoch:[0/1](38900/112548) loss:2.809 lr:0.0001642 epoch_Time:463.0min:
[2023-12-25 13:09:31,058][model5_pretrain.py][INFO] Epoch:[0/1](39000/112548) loss:3.936 lr:0.0001636 epoch_Time:462.0min:
[2023-12-25 13:10:08,769][model5_pretrain.py][INFO] Epoch:[0/1](39100/112548) loss:2.861 lr:0.0001631 epoch_Time:462.0min:
[2023-12-25 13:10:46,439][model5_pretrain.py][INFO] Epoch:[0/1](39200/112548) loss:2.982 lr:0.0001625 epoch_Time:461.0min:
[2023-12-25 13:11:24,143][model5_pretrain.py][INFO] Epoch:[0/1](39300/112548) loss:2.731 lr:0.0001619 epoch_Time:460.0min:
[2023-12-25 13:12:01,834][model5_pretrain.py][INFO] Epoch:[0/1](39400/112548) loss:3.391 lr:0.0001613 epoch_Time:460.0min:
[2023-12-25 13:12:39,551][model5_pretrain.py][INFO] Epoch:[0/1](39500/112548) loss:2.543 lr:0.0001608 epoch_T[2023-12-25 13[2023-12-25 13:13:17,264][model5_pretrain.py][INFO] Epoch:[0/1](39600/112548) loss:3.293 lr:0.0001602 epoch_T[2023-12-25 13[2023-12-25 13:13:54,970][model5_pretrain.py][INFO] Epoch:[0/1](39700/112548) loss:2.697 lr:0.0001596 epoch_Time:458.0min:
[2023-12-25 13:14:32,677][model5_pretrain.py][INFO] Epoch:[0/1](39800/112548) loss:3.057 lr:0.0001590 epoch_Time:457.0min:
[2023-12-25 13:15:10,394][model5_pretrain.py][INFO] Epoch:[0/1](39900/112548) loss:3.220 lr:0.0001585 epoch_Time:457.0min:
[2023-12-25 13:15:48,244][model5_pretrain.py][INFO] Epoch:[0/1](40000/112548) loss:3.493 lr:0.0001579 epoch_Time:456.0min:
[2023-12-25 13:16:31,617][model5_pretrain.py][INFO] Epoch:[0/1](40100/112548) loss:3.258 lr:0.0001573 epoch_Time:455.0min:
[2023-12-25 13:17:09,328][model5_pretrain.py][INFO] Epoch:[0/1](40200/112548) loss:2.862 lr:0.0001567 epoch_Time:455.0min:
[2023-12-25 13:17:47,046][model5_pretrain.py][INFO] Epoch:[0/1](40300/112548) loss:3.068 lr:0.0001562 epoch_Time:454.0min:
[2023-12-25 13:18:24,777][model5_pretrain.py][INFO] Epoch:[0/1](40400/112548) loss:2.872 lr:0.0001556 epoch_Time:453.0min:
[2023-12-25 13:19:02,501][model5_pretrain.py][INFO] Epoch:[0/1](40500/112548) loss:3.170 lr:0.0001550 epoch_Time:453.0min:
[2023-12-25 13:19:40,227][model5_pretrain.py][INFO] Epoch:[0/1](40600/112548) loss:2.784 lr:0.0001544 epoch_Time:452.0min:
[2023-12-25 13:20:17,955][model5_pretrain.py][INFO] Epoch:[0/1](40700/112548) loss:3.049 lr:0.0001538 epoch_Time:451.0min:
[2023-12-25 13:20:55,690][model5_pretrain.py][INFO] Epoch:[0/1](40800/112548) loss:3.486 lr:0.0001533 epoch_Time:451.0min:
[2023-12-25 13:21:33,408][model5_pretrain.py][INFO] Epoch:[0/1](40900/112548) loss:2.408 lr:0.0001527 epoch_Time:450.0min:
[2023-12-25 13:22:11,135][model5_pretrain.py][INFO] Epoch:[0/1](41000/112548) loss:2.083 lr:0.0001521 epoch_Time:450.0min:
[2023-12-25 13:22:48,861][model5_pretrain.py][INFO] Epoch:[0/1](41100/112548) loss:2.426 lr:0.0001515 epoch_Time:449.0min:
[2023-12-25 13:23:26,595][model5_pretrain.py][INFO] Epoch:[0/1](41200/112548) loss:2.441 lr:0.0001510 epoch_Time:448.0min:
[2023-12-25 13:24:04,327][model5_pretrain.py][INFO] Epoch:[0/1](41300/112548) loss:2.785 lr:0.0001504 epoch_Time:448.0min:
[2023-12-25 13:24:42,060][model5_pretrain.py][INFO] Epoch:[0/1](41400/112548) loss:2.762 lr:0.0001498 epoch_Time:447.0min:
[2023-12-25 13:25:19,796][model5_pretrain.py][INFO] Epoch:[0/1](41500/112548) loss:3.067 lr:0.0001492 epoch_Time:446.0min:
[2023-12-25 13:25:57,528][model5_pretrain.py][INFO] Epoch:[0/1](41600/112548) loss:2.790 lr:0.0001487 epoch_Time:446.0min:
[2023-12-25 13:26:35,260][model5_pretrain.py][INFO] Epoch:[0/1](41700/112548) loss:2.845 lr:0.0001481 epoch_Time:445.0min:
[2023-12-25 13:27:12,995][model5_pretrain.py][INFO] Epoch:[0/1](41800/112548) loss:2.910 lr:0.0001475 epoch_Time:445.0min:
[2023-12-25 13:27:50,736][model5_pretrain.py][INFO] Epoch:[0/1](41900/112548) loss:2.929 lr:0.0001469 epoch_Time:444.0min:
[2023-12-25 13:28:28,441][model5_pretrain.py][INFO] Epoch:[0/1](42000/112548) loss:2.754 lr:0.0001464 epoch_Time:443.0min:
[2023-12-25 13:29:06,173][model5_pretrain.py][INFO] Epoch:[0/1](42100/112548) loss:2.476 lr:0.0001458 epoch_Time:443.0min:
[2023-12-25 13:29:43,896][model5_pretrain.py][INFO] Epoch:[0/1](42200/112548) loss:3.113 lr:0.0001452 epoch_Time:442.0min:
[2023-12-25 13:30:21,616][model5_pretrain.py][INFO] Epoch:[0/1](42300/112548) loss:3.029 lr:0.0001446 epoch_Time:441.0min:
[2023-12-25 13:30:59,345][model5_pretrain.py][INFO] Epoch:[0/1](42400/112548) loss:3.457 lr:0.0001441 epoch_Time:441.0min:
[2023-12-25 13:31:37,062][model5_pretrain.py][INFO] Epoch:[0/1](42500/112548) loss:2.626 lr:0.0001435 epoch_Time:440.0min:
[2023-12-25 13:32:14,788][model5_pretrain.py][INFO] Epoch:[0/1](42600/112548) loss:2.976 lr:0.0001429 epoch_Time:440.0min:
[2023-12-25 13:32:52,518][model5_pretrain.py][INFO] Epoch:[0/1](42700/112548) loss:3.268 lr:0.0001423 epoch_Time:439.0min:
[2023-12-25 13:33:30,246][model5_pretrain.py][INFO] Epoch:[0/1](42800/112548) loss:3.108 lr:0.0001418 epoch_Time:438.0min:
[2023-12-25 13:34:07,974][model5_pretrain.py][INFO] Epoch:[0/1](42900/112548) loss:3.129 lr:0.0001412 epoch_Time:438.0min:
[2023-12-25 13:34:45,699][model5_pretrain.py][INFO] Epoch:[0/1](43000/112548) loss:2.657 lr:0.0001406 epoch_Time:437.0min:
[2023-12-25 13:35:23,421][model5_pretrain.py][INFO] Epoch:[0/1](43100/112548) loss:2.602 lr:0.0001400 epoch_Time:436.0min:
[2023-12-25 13:36:01,151][model5_pretrain.py][INFO] Epoch:[0/1](43200/112548) loss:3.697 lr:0.0001395 epoch_Time:436.0min:
[2023-12-25 13:36:38,882][model5_pretrain.py][INFO] Epoch:[0/1](43300/112548) loss:3.116 lr:0.0001389 epoch_Time:435.0min:
[2023-12-25 13:37:16,605][model5_pretrain.py][INFO] Epoch:[0/1](43400/112548) loss:2.676 lr:0.0001383 epoch_Time:435.0min:
[2023-12-25 13:37:54,327][model5_pretrain.py][INFO] Epoch:[0/1](43500/112548) loss:3.346 lr:0.0001377 epoch_Time:434.0min:
[2023-12-25 13:38:32,050][model5_pretrain.py][INFO] Epoch:[0/1](43600/112548) loss:2.882 lr:0.0001372 epoch_Time:433.0min:
[2023-12-25 13:39:09,773][model5_pretrain.py][INFO] Epoch:[0/1](43700/112548) loss:3.211 lr:0.0001366 epoch_Time:433.0min:
[2023-12-25 13:39:47,500][model5_pretrain.py][INFO] Epoch:[0/1](43800/112548) loss:2.498 lr:0.0001360 epoch_Time:432.0min:
[2023-12-25 13:40:25,352][model5_pretrain.py][INFO] Epoch:[0/1](43900/112548) loss:3.014 lr:0.0001355 epoch_Time:431.0min:
[2023-12-25 13:41:03,079][model5_pretrain.py][INFO] Epoch:[0/1](44000/112548) loss:2.613 lr:0.0001349 epoch_Time:431.0min:
[2023-12-25 13:41:40,801][model5_pretrain.py][INFO] Epoch:[0/1](44100/112548) loss:3.138 lr:0.0001343 epoch_Time:430.0min:
[2023-12-25 13:42:18,536][model5_pretrain.py][INFO] Epoch:[0/1](44200/112548) loss:3.472 lr:0.0001337 epoch_Time:429.0min:
[2023-12-25 13:42:56,261][model5_pretrain.py][INFO] Epoch:[0/1](44300/112548) loss:2.087 lr:0.0001332 epoch_Time:429.0min:
[2023-12-25 13:43:33,985][model5_pretrain.py][INFO] Epoch:[0/1](44400/112548) loss:2.861 lr:0.0001326 epoch_Time:428.0min:
[2023-12-25 13:44:11,706][model5_pretrain.py][INFO] Epoch:[0/1](44500/112548) loss:2.690 lr:0.0001320 epoch_Time:428.0min:
[2023-12-25 13:44:49,428][model5_pretrain.py][INFO] Epoch:[0/1](44600/112548) loss:2.902 lr:0.0001315 epoch_Time:427.0min:
[2023-12-25 13:45:27,268][model5_pretrain.py][INFO] Epoch:[0/1](44700/112548) loss:2.991 lr:0.0001309 epoch_Time:426.0min:
[2023-12-25 13:46:04,980][model5_pretrain.py][INFO] Epoch:[0/1](44800/112548) loss:3.422 lr:0.0001303 epoch_Time:426.0min:
[2023-12-25 13:46:42,674][model5_pretrain.py][INFO] Epoch:[0/1](44900/112548) loss:3.146 lr:0.0001298 epoch_Time:425.0min:
[2023-12-25 13:47:20,350][model5_pretrain.py][INFO] Epoch:[0/1](45000/112548) loss:3.287 lr:0.0001292 epoch_Time:424.0min:
[2023-12-25 13:47:58,053][model5_pretrain.py][INFO] Epoch:[0/1](45100/112548) loss:3.018 lr:0.0001286 epoch_Time:424.0min:
[2023-12-25 13:48:35,752][model5_pretrain.py][INFO] Epoch:[0/1](45200/112548) loss:2.848 lr:0.0001281 epoch_Time:423.0min:
[2023-12-25 13:49:13,453][model5_pretrain.py][INFO] Epoch:[0/1](45300/112548) loss:3.569 lr:0.0001275 epoch_Time:423.0min:
[2023-12-25 13:49:51,147][model5_pretrain.py][INFO] Epoch:[0/1](45400/112548) loss:3.429 lr:0.0001269 epoch_Time:422.0min:
[2023-12-25 13:50:28,845][model5_pretrain.py][INFO] Epoch:[0/1](45500/112548) loss:2.714 lr:0.0001264 epoch_Time:421.0min:
[2023-12-25 13:51:06,553][model5_pretrain.py][INFO] Epoch:[0/1](45600/112548) loss:2.865 lr:0.0001258 epoch_Time:421.0min:
[2023-12-25 13:51:44,250][model5_pretrain.py][INFO] Epoch:[0/1](45700/112548) loss:2.880 lr:0.0001252 epoch_Time:420.0min:
[2023-12-25 13:52:21,965][model5_pretrain.py][INFO] Epoch:[0/1](45800/112548) loss:2.781 lr:0.0001247 epoch_Time:419.0min:
[2023-12-25 13:52:59,681][model5_pretrain.py][INFO] Epoch:[0/1](45900/112548) loss:3.398 lr:0.0001241 epoch_Time:419.0min:
[2023-12-25 13:53:37,387][model5_pretrain.py][INFO] Epoch:[0/1](46000/112548) loss:3.046 lr:0.0001235 epoch_Time:418.0min:
[2023-12-25 13:54:15,096][model5_pretrain.py][INFO] Epoch:[0/1](46100/112548) loss:2.622 lr:0.0001230 epoch_Time:418.0min:
[2023-12-25 13:54:52,804][model5_pretrain.py][INFO] Epoch:[0/1](46200/112548) loss:2.842 lr:0.0001224 epoch_Time:417.0min:
[2023-12-25 13:55:30,512][model5_pretrain.py][INFO] Epoch:[0/1](46300/112548) loss:2.986 lr:0.0001219 epoch_Time:416.0min:
[2023-12-25 13:56:08,216][model5_pretrain.py][INFO] Epoch:[0/1](46400/112548) loss:2.824 lr:0.0001213 epoch_Time:416.0min:
[2023-12-25 13:56:45,925][model5_pretrain.py][INFO] Epoch:[0/1](46500/112548) loss:2.927 lr:0.0001207 epoch_Time:415.0min:
[2023-12-25 13:57:23,645][model5_pretrain.py][INFO] Epoch:[0/1](46600/112548) loss:2.519 lr:0.0001202 epoch_Time:414.0min:
[2023-12-25 13:58:01,347][model5_pretrain.py][INFO] Epoch:[0/1](46700/112548) loss:2.762 lr:0.0001196 epoch_Time:414.0min:
[2023-12-25 13:58:39,047][model5_pretrain.py][INFO] Epoch:[0/1](46800/112548) loss:2.291 lr:0.0001191 epoch_Time:413.0min:
[2023-12-25 13:59:16,757][model5_pretrain.py][INFO] Epoch:[0/1](46900/112548) loss:2.891 lr:0.0001185 epoch_Time:413.0min:
[2023-12-25 13:59:54,467][model5_pretrain.py][INFO] Epoch:[0/1](47000/112548) loss:3.455 lr:0.0001179 epoch_Time:412.0min:
[2023-12-25 14:00:32,177][model5_pretrain.py][INFO] Epoch:[0/1](47100/112548) loss:3.031 lr:0.0001174 epoch_Time:411.0min:
[2023-12-25 14:01:09,888][model5_pretrain.py][INFO] Epoch:[0/1](47200/112548) loss:2.823 lr:0.0001168 epoch_Time:411.0min:
[2023-12-25 14:01:47,595][model5_pretrain.py][INFO] Epoch:[0/1](47300/112548) loss:2.642 lr:0.0001163 epoch_Time:410.0min:
[2023-12-25 14:02:25,310][model5_pretrain.py][INFO] Epoch:[0/1](47400/112548) loss:3.232 lr:0.0001157 epoch_Time:409.0min:
[2023-12-25 14:03:02,986][model5_pretrain.py][INFO] Epoch:[0/1](47500/112548) loss:3.071 lr:0.0001152 epoch_Time:409.0min:
[2023-12-25 14:03:40,691][model5_pretrain.py][INFO] Epoch:[0/1](47600/112548) loss:3.055 lr:0.0001146 epoch_Time:408.0min:
[2023-12-25 14:04:18,402][model5_pretrain.py][INFO] Epoch:[0/1](47700/112548) loss:3.099 lr:0.0001140 epoch_Time:407.0min:
[2023-12-25 14:04:56,106][model5_pretrain.py][INFO] Epoch:[0/1](47800/112548) loss:3.130 lr:0.0001135 epoch_Time:407.0min:
[2023-12-25 14:05:33,808][model5_pretrain.py][INFO] Epoch:[0/1](47900/112548) loss:2.200 lr:0.0001129 epoch_Time:406.0min:
[2023-12-25 14:06:11,511][model5_pretrain.py][INFO] Epoch:[0/1](48000/112548) loss:2.807 lr:0.0001124 epoch_Time:406.0min:
[2023-12-25 14:06:49,227][model5_pretrain.py][INFO] Epoch:[0/1](48100/112548) loss:3.447 lr:0.0001118 epoch_Time:405.0min:
[2023-12-25 14:07:26,934][model5_pretrain.py][INFO] Epoch:[0/1](48200/112548) loss:2.802 lr:0.0001113 epoch_Time:404.0min:
[2023-12-25 14:08:04,630][model5_pretrain.py][INFO] Epoch:[0/1](48300/112548) loss:2.309 lr:0.0001107 epoch_Time:404.0min:
[2023-12-25 14:08:42,323][model5_pretrain.py][INFO] Epoch:[0/1](48400/112548) loss:2.389 lr:0.0001102 epoch_Time:403.0min:
[2023-12-25 14:09:20,157][model5_pretrain.py][INFO] Epoch:[0/1](48500/112548) loss:2.738 lr:0.0001096 epoch_Time:402.0min:
[2023-12-25 14:09:57,867][model5_pretrain.py][INFO] Epoch:[0/1](48600/112548) loss:3.119 lr:0.0001091 epoch_Time:402.0min:
[2023-12-25 14:10:35,577][model5_pretrain.py][INFO] Epoch:[0/1](48700/112548) loss:3.418 lr:0.0001086 epoch_Time:401.0min:
[2023-12-25 14:11:13,287][model5_pretrain.py][INFO] Epoch:[0/1](48800/112548) loss:3.130 lr:0.0001080 epoch_Time:401.0min:
[2023-12-25 14:11:51,005][model5_pretrain.py][INFO] Epoch:[0/1](48900/112548) loss:2.662 lr:0.0001075 epoch_Time:400.0min:
[2023-12-25 14:12:28,724][model5_pretrain.py][INFO] Epoch:[0/1](49000/112548) loss:3.285 lr:0.0001069 epoch_Time:399.0min:
[2023-12-25 14:13:06,442][model5_pretrain.py][INFO] Epoch:[0/1](49100/112548) loss:2.913 lr:0.0001064 epoch_Time:399.0min:
[2023-12-25 14:13:44,163][model5_pretrain.py][INFO] Epoch:[0/1](49200/112548) loss:3.211 lr:0.0001058 epoch_Time:398.0min:
[2023-12-25 14:14:21,887][model5_pretrain.py][INFO] Epoch:[0/1](49300/112548) loss:2.894 lr:0.0001053 epoch_Time:397.0min:
[2023-12-25 14:14:59,609][model5_pretrain.py][INFO] Epoch:[0/1](49400/112548) loss:3.907 lr:0.0001047 epoch_Time:397.0min:
[2023-12-25 14:15:37,323][model5_pretrain.py][INFO] Epoch:[0/1](49500/112548) loss:3.003 lr:0.0001042 epoch_Time:396.0min:
[2023-12-25 14:16:15,063][model5_pretrain.py][INFO] Epoch:[0/1](49600/112548) loss:3.077 lr:0.0001037 epoch_Time:396.0min:
[2023-12-25 14:16:52,799][model5_pretrain.py][INFO] Epoch:[0/1](49700/112548) loss:3.257 lr:0.0001031 epoch_Time:395.0min:
[2023-12-25 14:17:30,833][model5_pretrain.py][INFO] Epoch:[0/1](49800/112548) loss:2.724 lr:0.0001026 epoch_Time:394.0min:
[2023-12-25 14:18:08,553][model5_pretrain.py][INFO] Epoch:[0/1](49900/112548) loss:2.624 lr:0.0001021 epoch_Time:394.0min:
[2023-12-25 14:18:46,283][model5_pretrain.py][INFO] Epoch:[0/1](50000/112548) loss:2.953 lr:0.0001015 epoch_Time:393.0min:
[2023-12-25 14:19:23,970][model5_pretrain.py][INFO] Epoch:[0/1](50100/112548) loss:2.821 lr:0.0001010 epoch_Time:392.0min:
[2023-12-25 14:20:01,682][model5_pretrain.py][INFO] Epoch:[0/1](50200/112548) loss:2.853 lr:0.0001004 epoch_Time:392.0min:
[2023-12-25 14:20:39,401][model5_pretrain.py][INFO] Epoch:[0/1](50300/112548) loss:3.043 lr:0.0000999 epoch_Time:391.0min:
[2023-12-25 14:21:17,109][model5_pretrain.py][INFO] Epoch:[0/1](50400/112548) loss:2.633 lr:0.0000994 epoch_Time:391.0min:
[2023-12-25 14:21:54,825][model5_pretrain.py][INFO] Epoch:[0/1](50500/112548) loss:2.602 lr:0.0000988 epoch_Time:390.0min:
[2023-12-25 14:22:32,547][model5_pretrain.py][INFO] Epoch:[0/1](50600/112548) loss:2.818 lr:0.0000983 epoch_Time:389.0min:
[2023-12-25 14:23:10,269][model5_pretrain.py][INFO] Epoch:[0/1](50700/112548) loss:2.948 lr:0.0000978 epoch_Time:389.0min:
[2023-12-25 14:23:47,983][model5_pretrain.py][INFO] Epoch:[0/1](50800/112548) loss:2.878 lr:0.0000973 epoch_Time:388.0min:
[2023-12-25 14:24:25,740][model5_pretrain.py][INFO] Epoch:[0/1](50900/112548) loss:2.886 lr:0.0000967 epoch_Time:387.0min:
[2023-12-25 14:25:03,519][model5_pretrain.py][INFO] Epoch:[0/1](51000/112548) loss:2.246 lr:0.0000962 epoch_Time:387.0min:
[2023-12-25 14:25:41,302][model5_pretrain.py][INFO] Epoch:[0/1](51100/112548) loss:2.786 lr:0.0000957 epoch_Time:386.0min:
[2023-12-25 14:26:19,062][model5_pretrain.py][INFO] Epoch:[0/1](51200/112548) loss:2.876 lr:0.0000951 epoch_Time:385.0min:
[2023-12-25 14:26:56,791][model5_pretrain.py][INFO] Epoch:[0/1](51300/112548) loss:2.518 lr:0.0000946 epoch_Time:385.0min:
[2023-12-25 14:27:34,510][model5_pretrain.py][INFO] Epoch:[0/1](51400/112548) loss:2.348 lr:0.0000941 epoch_Time:384.0min:
[2023-12-25 14:28:12,220][model5_pretrain.py][INFO] Epoch:[0/1](51500/112548) loss:2.135 lr:0.0000936 epoch_Time:384.0min:
[2023-12-25 14:28:49,941][model5_pretrain.py][INFO] Epoch:[0/1](51600/112548) loss:3.129 lr:0.0000931 epoch_Time:383.0min:
[2023-12-25 14:29:27,659][model5_pretrain.py][INFO] Epoch:[0/1](51700/112548) loss:3.127 lr:0.0000925 epoch_Time:382.0min:
[2023-12-25 14:30:05,377][model5_pretrain.py][INFO] Epoch:[0/1](51800/112548) loss:2.728 lr:0.0000920 epoch_Time:382.0min:
[2023-12-25 14:30:43,092][model5_pretrain.py][INFO] Epoch:[0/1](51900/112548) loss:3.359 lr:0.0000915 epoch_Time:381.0min:
[2023-12-25 14:31:20,813][model5_pretrain.py][INFO] Epoch:[0/1](52000/112548) loss:2.811 lr:0.0000910 epoch_Time:380.0min:
[2023-12-25 14:31:58,529][model5_pretrain.py][INFO] Epoch:[0/1](52100/112548) loss:3.138 lr:0.0000905 epoch_Time:380.0min:
[2023-12-25 14:32:36,266][model5_pretrain.py][INFO] Epoch:[0/1](52200/112548) loss:3.407 lr:0.0000899 epoch_Time:379.0min:
[2023-12-25 14:33:13,992][model5_pretrain.py][INFO] Epoch:[0/1](52300/112548) loss:2.860 lr:0.0000894 epoch_Time:379.0min:
[2023-12-25 14:33:51,726][model5_pretrain.py][INFO] Epoch:[0/1](52400/112548) loss:2.791 lr:0.0000889 epoch_Time:378.0min:
[2023-12-25 14:34:29,460][model5_pretrain.py][INFO] Epoch:[0/1](52500/112548) loss:2.956 lr:0.0000884 epoch_Time:377.0min:
[2023-12-25 14:35:07,197][model5_pretrain.py][INFO] Epoch:[0/1](52600/112548) loss:2.605 lr:0.0000879 epoch_Time:377.0min:
[2023-12-25 14:35:44,914][model5_pretrain.py][INFO] Epoch:[0/1](52700/112548) loss:2.992 lr:0.0000874 epoch_Time:376.0min:
[2023-12-25 14:36:22,641][model5_pretrain.py][INFO] Epoch:[0/1](52800/112548) loss:2.696 lr:0.0000869 epoch_Time:375.0min:
[2023-12-25 14:37:00,366][model5_pretrain.py][INFO] Epoch:[0/1](52900/112548) loss:2.152 lr:0.0000864 epoch_Time:375.0min:
[2023-12-25 14:37:38,094][model5_pretrain.py][INFO] Epoch:[0/1](53000/112548) loss:2.572 lr:0.0000859 epoch_Time:374.0min:
[2023-12-25 14:38:15,816][model5_pretrain.py][INFO] Epoch:[0/1](53100/112548) loss:3.137 lr:0.0000853 epoch_Time:374.0min:
[2023-12-25 14:38:53,541][model5_pretrain.py][INFO] Epoch:[0/1](53200/112548) loss:2.723 lr:0.0000848 epoch_Time:373.0min:
[2023-12-25 14:39:31,265][model5_pretrain.py][INFO] Epoch:[0/1](53300/112548) loss:3.271 lr:0.0000843 epoch_Time:372.0min:
[2023-12-25 14:40:08,998][model5_pretrain.py][INFO] Epoch:[0/1](53400/112548) loss:3.117 lr:0.0000838 epoch_Time:372.0min:
[2023-12-25 14:40:46,716][model5_pretrain.py][INFO] Epoch:[0/1](53500/112548) loss:3.014 lr:0.0000833 epoch_Time:371.0min:
[2023-12-25 14:41:24,436][model5_pretrain.py][INFO] Epoch:[0/1](53600/112548) loss:2.317 lr:0.0000828 epoch_Time:370.0min:
[2023-12-25 14:42:02,153][model5_pretrain.py][INFO] Epoch:[0/1](53700/112548) loss:2.847 lr:0.0000823 epoch_Time:370.0min:
[2023-12-25 14:42:39,876][model5_pretrain.py][INFO] Epoch:[0/1](53800/112548) loss:3.223 lr:0.0000818 epoch_Time:369.0min:
[2023-12-25 14:43:17,592][model5_pretrain.py][INFO] Epoch:[0/1](53900/112548) loss:2.864 lr:0.0000813 epoch_Time:368.0min:
[2023-12-25 14:43:55,309][model5_pretrain.py][INFO] Epoch:[0/1](54000/112548) loss:3.128 lr:0.0000808 epoch_Time:368.0min:
[2023-12-25 14:44:33,023][model5_pretrain.py][INFO] Epoch:[0/1](54100/112548) loss:3.311 lr:0.0000803 epoch_Time:367.0min:
[2023-12-25 14:45:10,717][model5_pretrain.py][INFO] Epoch:[0/1](54200/112548) loss:3.263 lr:0.0000799 epoch_Time:367.0min:
[2023-12-25 14:45:48,443][model5_pretrain.py][INFO] Epoch:[0/1](54300/112548) loss:3.380 lr:0.0000794 epoch_Time:366.0min:
[2023-12-25 14:46:26,145][model5_pretrain.py][INFO] Epoch:[0/1](54400/112548) loss:2.379 lr:0.0000789 epoch_Time:365.0min:
[2023-12-25 14:47:03,873][model5_pretrain.py][INFO] Epoch:[0/1](54500/112548) loss:2.947 lr:0.0000784 epoch_Time:365.0min:
[2023-12-25 14:47:41,573][model5_pretrain.py][INFO] Epoch:[0/1](54600/112548) loss:2.763 lr:0.0000779 epoch_Time:364.0min:
[2023-12-25 14:48:19,285][model5_pretrain.py][INFO] Epoch:[0/1](54700/112548) loss:2.839 lr:0.0000774 epoch_Time:363.0min:
[2023-12-25 14:48:56,977][model5_pretrain.py][INFO] Epoch:[0/1](54800/112548) loss:2.806 lr:0.0000769 epoch_Time:363.0min:
[2023-12-25 14:49:34,673][model5_pretrain.py][INFO] Epoch:[0/1](54900/112548) loss:2.713 lr:0.0000764 epoch_Time:362.0min:
[2023-12-25 14:50:12,377][model5_pretrain.py][INFO] Epoch:[0/1](55000/112548) loss:3.309 lr:0.0000759 epoch_Time:362.0min:
[2023-12-25 14:50:50,075][model5_pretrain.py][INFO] Epoch:[0/1](55100/112548) loss:2.872 lr:0.0000755 epoch_Time:361.0min:
[2023-12-25 14:51:27,778][model5_pretrain.py][INFO] Epoch:[0/1](55200/112548) loss:2.795 lr:0.0000750 epoch_Time:360.0min:
[2023-12-25 14:52:05,487][model5_pretrain.py][INFO] Epoch:[0/1](55300/112548) loss:3.161 lr:0.0000745 epoch_Time:360.0min:
[2023-12-25 14:52:43,183][model5_pretrain.py][INFO] Epoch:[0/1](55400/112548) loss:2.577 lr:0.0000740 epoch_Time:359.0min:
[2023-12-25 14:53:20,886][model5_pretrain.py][INFO] Epoch:[0/1](55500/112548) loss:2.847 lr:0.0000735 epoch_Time:358.0min:
[2023-12-25 14:53:58,589][model5_pretrain.py][INFO] Epoch:[0/1](55600/112548) loss:2.952 lr:0.0000731 epoch_Time:358.0min:
[2023-12-25 14:54:36,303][model5_pretrain.py][INFO] Epoch:[0/1](55700/112548) loss:3.178 lr:0.0000726 epoch_Time:357.0min:
[2023-12-25 14:55:14,004][model5_pretrain.py][INFO] Epoch:[0/1](55800/112548) loss:2.548 lr:0.0000721 epoch_Time:357.0min:
[2023-12-25 14:55:51,704][model5_pretrain.py][INFO] Epoch:[0/1](55900/112548) loss:2.379 lr:0.0000716 epoch_Time:356.0min:
[2023-12-25 14:56:29,405][model5_pretrain.py][INFO] Epoch:[0/1](56000/112548) loss:2.424 lr:0.0000712 epoch_Time:355.0min:
[2023-12-25 14:57:07,113][model5_pretrain.py][INFO] Epoch:[0/1](56100/112548) loss:3.031 lr:0.0000707 epoch_Time:355.0min:
[2023-12-25 14:57:44,810][model5_pretrain.py][INFO] Epoch:[0/1](56200/112548) loss:2.564 lr:0.0000702 epoch_Time:354.0min:
[2023-12-25 14:58:22,512][model5_pretrain.py][INFO] Epoch:[0/1](56300/112548) loss:2.635 lr:0.0000698 epoch_Time:353.0min:
[2023-12-25 14:59:00,218][model5_pretrain.py][INFO] Epoch:[0/1](56400/112548) loss:2.524 lr:0.0000693 epoch_Time:353.0min:
[2023-12-25 14:59:37,917][model5_pretrain.py][INFO] Epoch:[0/1](56500/112548) loss:2.378 lr:0.0000688 epoch_Time:352.0min:
[2023-12-25 15:00:15,624][model5_pretrain.py][INFO] Epoch:[0/1](56600/112548) loss:2.752 lr:0.0000684 epoch_Time:352.0min:
[2023-12-25 15:00:53,324][model5_pretrain.py][INFO] Epoch:[0/1](56700/112548) loss:2.468 lr:0.0000679 epoch_Time:351.0min:
[2023-12-25 15:01:31,024][model5_pretrain.py][INFO] Epoch:[0/1](56800/112548) loss:2.967 lr:0.0000675 epoch_Time:350.0min:
[2023-12-25 15:02:08,723][model5_pretrain.py][INFO] Epoch:[0/1](56900/112548) loss:3.368 lr:0.0000670 epoch_Time:350.0min:
[2023-12-25 15:02:46,424][model5_pretrain.py][INFO] Epoch:[0/1](57000/112548) loss:2.798 lr:0.0000665 epoch_Time:349.0min:
[2023-12-25 15:03:24,122][model5_pretrain.py][INFO] Epoch:[0/1](57100/112548) loss:2.786 lr:0.0000661 epoch_Time:348.0min:
[2023-12-25 15:04:01,824][model5_pretrain.py][INFO] Epoch:[0/1](57200/112548) loss:2.561 lr:0.0000656 epoch_Time:348.0min:
[2023-12-25 15:04:39,521][model5_pretrain.py][INFO] Epoch:[0/1](57300/112548) loss:2.844 lr:0.0000652 epoch_Time:347.0min:
[2023-12-25 15:05:17,224][model5_pretrain.py][INFO] Epoch:[0/1](57400/112548) loss:3.223 lr:0.0000647 epoch_Time:347.0min:
[2023-12-25 15:05:54,928][model5_pretrain.py][INFO] Epoch:[0/1](57500/112548) loss:2.828 lr:0.0000643 epoch_Time:346.0min:
[2023-12-25 15:06:32,629][model5_pretrain.py][INFO] Epoch:[0/1](57600/112548) loss:3.005 lr:0.0000638 epoch_Time:345.0min:
[2023-12-25 15:07:10,298][model5_pretrain.py][INFO] Epoch:[0/1](57700/112548) loss:2.982 lr:0.0000634 epoch_Time:345.0min:
[2023-12-25 15:07:48,003][model5_pretrain.py][INFO] Epoch:[0/1](57800/112548) loss:2.888 lr:0.0000629 epoch_Time:344.0min:
[2023-12-25 15:08:25,714][model5_pretrain.py][INFO] Epoch:[0/1](57900/112548) loss:2.526 lr:0.0000625 epoch_Time:343.0min:
[2023-12-25 15:09:03,419][model5_pretrain.py][INFO] Epoch:[0/1](58000/112548) loss:2.923 lr:0.0000620 epoch_Time:343.0min:
[2023-12-25 15:09:41,127][model5_pretrain.py][INFO] Epoch:[0/1](58100/112548) loss:3.005 lr:0.0000616 epoch_Time:342.0min:
[2023-12-25 15:10:18,843][model5_pretrain.py][INFO] Epoch:[0/1](58200/112548) loss:3.264 lr:0.0000612 epoch_Time:341.0min:
[2023-12-25 15:10:56,561][model5_pretrain.py][INFO] Epoch:[0/1](58300/112548) loss:3.303 lr:0.0000607 epoch_Time:341.0min:
[2023-12-25 15:11:34,278][model5_pretrain.py][INFO] Epoch:[0/1](58400/112548) loss:2.929 lr:0.0000603 epoch_Time:340.0min:
[2023-12-25 15:12:11,987][model5_pretrain.py][INFO] Epoch:[0/1](58500/112548) loss:2.646 lr:0.0000598 epoch_Time:340.0min:
[2023-12-25 15:12:49,690][model5_pretrain.py][INFO] Epoch:[0/1](58600/112548) loss:2.297 lr:0.0000594 epoch_Time:339.0min:
[2023-12-25 15:13:27,389][model5_pretrain.py][INFO] Epoch:[0/1](58700/112548) loss:2.713 lr:0.0000590 epoch_Time:338.0min:
[2023-12-25 15:14:05,091][model5_pretrain.py][INFO] Epoch:[0/1](58800/112548) loss:3.042 lr:0.0000585 epoch_Time:338.0min:
[2023-12-25 15:14:42,791][model5_pretrain.py][INFO] Epoch:[0/1](58900/112548) loss:2.454 lr:0.0000581 epoch_Time:337.0min:
[2023-12-25 15:15:20,491][model5_pretrain.py][INFO] Epoch:[0/1](59000/112548) loss:3.039 lr:0.0000577 epoch_Time:336.0min:
[2023-12-25 15:15:58,193][model5_pretrain.py][INFO] Epoch:[0/1](59100/112548) loss:2.923 lr:0.0000573 epoch_Time:336.0min:
[2023-12-25 15:16:35,892][model5_pretrain.py][INFO] Epoch:[0/1](59200/112548) loss:3.224 lr:0.0000568 epoch_Time:335.0min:
[2023-12-25 15:17:13,589][model5_pretrain.py][INFO] Epoch:[0/1](59300/112548) loss:3.004 lr:0.0000564 epoch_Time:335.0min:
[2023-12-25 15:17:51,291][model5_pretrain.py][INFO] Epoch:[0/1](59400/112548) loss:2.815 lr:0.0000560 epoch_Time:334.0min:
[2023-12-25 15:18:28,987][model5_pretrain.py][INFO] Epoch:[0/1](59500/112548) loss:2.211 lr:0.0000556 epoch_Time:333.0min:
[2023-12-25 15:19:06,695][model5_pretrain.py][INFO] Epoch:[0/1](59600/112548) loss:2.717 lr:0.0000552 epoch_Time:333.0min:
[2023-12-25 15:19:44,400][model5_pretrain.py][INFO] Epoch:[0/1](59700/112548) loss:2.817 lr:0.0000547 epoch_Time:332.0min:
[2023-12-25 15:20:22,104][model5_pretrain.py][INFO] Epoch:[0/1](59800/112548) loss:3.016 lr:0.0000543 epoch_Time:331.0min:
[2023-12-25 15:20:59,815][model5_pretrain.py][INFO] Epoch:[0/1](59900/112548) loss:2.908 lr:0.0000539 epoch_Time:331.0min:
[2023-12-25 15:21:37,514][model5_pretrain.py][INFO] Epoch:[0/1](60000/112548) loss:2.619 lr:0.0000535 epoch_Time:330.0min:
[2023-12-25 15:22:20,538][model5_pretrain.py][INFO] Epoch:[0/1](60100/112548) loss:2.754 lr:0.0000531 epoch_Time:329.0min:
[2023-12-25 15:22:58,258][model5_pretrain.py][INFO] Epoch:[0/1](60200/112548) loss:2.225 lr:0.0000527 epoch_Time:329.0min:
[2023-12-25 15:23:36,008][model5_pretrain.py][INFO] Epoch:[0/1](60300/112548) loss:2.773 lr:0.0000523 epoch_Time:328.0min:
[2023-12-25 15:24:13,750][model5_pretrain.py][INFO] Epoch:[0/1](60400/112548) loss:3.305 lr:0.0000519 epoch_Time:328.0min:
[2023-12-25 15:24:51,569][model5_pretrain.py][INFO] Epoch:[0/1](60500/112548) loss:2.781 lr:0.0000515 epoch_Time:327.0min:
[2023-12-25 15:25:29,395][model5_pretrain.py][INFO] Epoch:[0/1](60600/112548) loss:3.394 lr:0.0000511 epoch_Time:326.0min:
[2023-12-25 15:26:07,120][model5_pretrain.py][INFO] Epoch:[0/1](60700/112548) loss:2.399 lr:0.0000507 epoch_Time:326.0min:
[2023-12-25 15:26:44,854][model5_pretrain.py][INFO] Epoch:[0/1](60800/112548) loss:3.000 lr:0.0000503 epoch_Time:325.0min:
[2023-12-25 15:27:22,885][model5_pretrain.py][INFO] Epoch:[0/1](60900/112548) loss:3.115 lr:0.0000499 epoch_Time:324.0min:
[2023-12-25 15:28:00,604][model5_pretrain.py][INFO] Epoch:[0/1](61000/112548) loss:3.069 lr:0.0000495 epoch_Time:324.0min:
[2023-12-25 15:28:38,341][model5_pretrain.py][INFO] Epoch:[0/1](61100/112548) loss:2.556 lr:0.0000491 epoch_Time:323.0min:
[2023-12-25 15:29:16,066][model5_pretrain.py][INFO] Epoch:[0/1](61200/112548) loss:2.688 lr:0.0000487 epoch_Time:323.0min:
[2023-12-25 15:29:53,761][model5_pretrain.py][INFO] Epoch:[0/1](61300/112548) loss:2.566 lr:0.0000483 epoch_Time:322.0min:
[2023-12-25 15:30:31,472][model5_pretrain.py][INFO] Epoch:[0/1](61400/112548) loss:2.197 lr:0.0000479 epoch_Time:321.0min:
[2023-12-25 15:31:09,190][model5_pretrain.py][INFO] Epoch:[0/1](61500/112548) loss:2.722 lr:0.0000475 epoch_Time:321.0min:
[2023-12-25 15:31:46,899][model5_pretrain.py][INFO] Epoch:[0/1](61600/112548) loss:2.626 lr:0.0000471 epoch_Time:320.0min:
[2023-12-25 15:32:24,617][model5_pretrain.py][INFO] Epoch:[0/1](61700/112548) loss:2.604 lr:0.0000467 epoch_Time:319.0min:
[2023-12-25 15:33:02,335][model5_pretrain.py][INFO] Epoch:[0/1](61800/112548) loss:2.303 lr:0.0000463 epoch_Time:319.0min:
[2023-12-25 15:33:40,048][model5_pretrain.py][INFO] Epoch:[0/1](61900/112548) loss:2.736 lr:0.0000460 epoch_Time:318.0min:
[2023-12-25 15:34:17,763][model5_pretrain.py][INFO] Epoch:[0/1](62000/112548) loss:2.661 lr:0.0000456 epoch_Time:317.0min:
[2023-12-25 15:34:55,491][model5_pretrain.py][INFO] Epoch:[0/1](62100/112548) loss:3.178 lr:0.0000452 epoch_Time:317.0min:
[2023-12-25 15:35:33,203][model5_pretrain.py][INFO] Epoch:[0/1](62200/112548) loss:2.693 lr:0.0000448 epoch_Time:316.0min:
[2023-12-25 15:36:10,954][model5_pretrain.py][INFO] Epoch:[0/1](62300/112548) loss:2.568 lr:0.0000445 epoch_Time:316.0min:
[2023-12-25 15:36:48,686][model5_pretrain.py][INFO] Epoch:[0/1](62400/112548) loss:2.320 lr:0.0000441 epoch_Time:315.0min:
[2023-12-25 15:37:26,404][model5_pretrain.py][INFO] Epoch:[0/1](62500/112548) loss:2.488 lr:0.0000437 epoch_Time:314.0min:
[2023-12-25 15:38:04,129][model5_pretrain.py][INFO] Epoch:[0/1](62600/112548) loss:2.565 lr:0.0000433 epoch_Time:314.0min:
[2023-12-25 15:38:41,847][model5_pretrain.py][INFO] Epoch:[0/1](62700/112548) loss:2.800 lr:0.0000430 epoch_Time:313.0min:
[2023-12-25 15:39:19,568][model5_pretrain.py][INFO] Epoch:[0/1](62800/112548) loss:2.635 lr:0.0000426 epoch_Time:312.0min:
[2023-12-25 15:39:57,293][model5_pretrain.py][INFO] Epoch:[0/1](62900/112548) loss:3.191 lr:0.0000423 epoch_Time:312.0min:
[2023-12-25 15:40:35,015][model5_pretrain.py][INFO] Epoch:[0/1](63000/112548) loss:2.825 lr:0.0000419 epoch_Time:311.0min:
[2023-12-25 15:41:12,732][model5_pretrain.py][INFO] Epoch:[0/1](63100/112548) loss:2.514 lr:0.0000415 epoch_Time:311.0min:
[2023-12-25 15:41:50,447][model5_pretrain.py][INFO] Epoch:[0/1](63200/112548) loss:3.158 lr:0.0000412 epoch_Time:310.0min:
[2023-12-25 15:42:28,158][model5_pretrain.py][INFO] Epoch:[0/1](63300/112548) loss:2.597 lr:0.0000408 epoch_Time:309.0min:
[2023-12-25 15:43:05,875][model5_pretrain.py][INFO] Epoch:[0/1](63400/112548) loss:2.944 lr:0.0000405 epoch_Time:309.0min:
[2023-12-25 15:43:43,589][model5_pretrain.py][INFO] Epoch:[0/1](63500/112548) loss:2.878 lr:0.0000401 epoch_Time:308.0min:
[2023-12-25 15:44:21,311][model5_pretrain.py][INFO] Epoch:[0/1](63600/112548) loss:2.757 lr:0.0000398 epoch_Time:307.0min:
[2023-12-25 15:44:59,036][model5_pretrain.py][INFO] Epoch:[0/1](63700/112548) loss:2.679 lr:0.0000394 epoch_Time:307.0min:
[2023-12-25 15:45:36,728][model5_pretrain.py][INFO] Epoch:[0/1](63800/112548) loss:2.796 lr:0.0000391 epoch_Time:306.0min:
[2023-12-25 15:46:14,456][model5_pretrain.py][INFO] Epoch:[0/1](63900/112548) loss:2.699 lr:0.0000387 epoch_Time:306.0min:
[2023-12-25 15:46:52,180][model5_pretrain.py][INFO] Epoch:[0/1](64000/112548) loss:2.403 lr:0.0000384 epoch_Time:305.0min:
[2023-12-25 15:47:29,898][model5_pretrain.py][INFO] Epoch:[0/1](64100/112548) loss:2.763 lr:0.0000380 epoch_Time:304.0min:
[2023-12-25 15:48:07,622][model5_pretrain.py][INFO] Epoch:[0/1](64200/112548) loss:2.439 lr:0.0000377 epoch_Time:304.0min:
[2023-12-25 15:48:45,349][model5_pretrain.py][INFO] Epoch:[0/1](64300/112548) loss:3.081 lr:0.0000374 epoch_Time:303.0min:
[2023-12-25 15:49:23,074][model5_pretrain.py][INFO] Epoch:[0/1](64400/112548) loss:2.090 lr:0.0000370 epoch_Time:302.0min:
[2023-12-25 15:50:00,798][model5_pretrain.py][INFO] Epoch:[0/1](64500/112548) loss:2.648 lr:0.0000367 epoch_Time:302.0min:
[2023-12-25 15:50:38,526][model5_pretrain.py][INFO] Epoch:[0/1](64600/112548) loss:2.868 lr:0.0000364 epoch_Time:301.0min:
[2023-12-25 15:51:16,250][model5_pretrain.py][INFO] Epoch:[0/1](64700/112548) loss:2.560 lr:0.0000360 epoch_Time:301.0min:
[2023-12-25 15:51:53,974][model5_pretrain.py][INFO] Epoch:[0/1](64800/112548) loss:2.754 lr:0.0000357 epoch_Time:300.0min:
[2023-12-25 15:52:31,695][model5_pretrain.py][INFO] Epoch:[0/1](64900/112548) loss:2.596 lr:0.0000354 epoch_Time:299.0min:
[2023-12-25 15:53:09,419][model5_pretrain.py][INFO] Epoch:[0/1](65000/112548) loss:2.904 lr:0.0000350 epoch_Time:299.0min:
[2023-12-25 15:53:47,152][model5_pretrain.py][INFO] Epoch:[0/1](65100/112548) loss:2.766 lr:0.0000347 epoch_Time:298.0min:
[2023-12-25 15:54:24,877][model5_pretrain.py][INFO] Epoch:[0/1](65200/112548) loss:3.384 lr:0.0000344 epoch_Time:297.0min:
[2023-12-25 15:55:02,601][model5_pretrain.py][INFO] Epoch:[0/1](65300/112548) loss:2.985 lr:0.0000341 epoch_Time:297.0min:
[2023-12-25 15:55:40,323][model5_pretrain.py][INFO] Epoch:[0/1](65400/112548) loss:2.411 lr:0.0000338 epoch_Time:296.0min:
[2023-12-25 15:56:18,057][model5_pretrain.py][INFO] Epoch:[0/1](65500/112548) loss:2.875 lr:0.0000334 epoch_Time:295.0min:
[2023-12-25 15:56:55,785][model5_pretrain.py][INFO] Epoch:[0/1](65600/112548) loss:3.268 lr:0.0000331 epoch_Time:295.0min:
[2023-12-25 15:57:33,516][model5_pretrain.py][INFO] Epoch:[0/1](65700/112548) loss:3.197 lr:0.0000328 epoch_Time:294.0min:
[2023-12-25 15:58:11,248][model5_pretrain.py][INFO] Epoch:[0/1](65800/112548) loss:2.373 lr:0.0000325 epoch_Time:294.0min:
[2023-12-25 15:58:48,974][model5_pretrain.py][INFO] Epoch:[0/1](65900/112548) loss:3.083 lr:0.0000322 epoch_Time:293.0min:
[2023-12-25 15:59:26,700][model5_pretrain.py][INFO] Epoch:[0/1](66000/112548) loss:2.805 lr:0.0000319 epoch_Time:292.0min:
[2023-12-25 16:00:04,435][model5_pretrain.py][INFO] Epoch:[0/1](66100/112548) loss:2.841 lr:0.0000316 epoch_Time:292.0min:
[2023-12-25 16:00:42,161][model5_pretrain.py][INFO] Epoch:[0/1](66200/112548) loss:2.932 lr:0.0000313 epoch_Time:291.0min:
[2023-12-25 16:01:19,888][model5_pretrain.py][INFO] Epoch:[0/1](66300/112548) loss:3.225 lr:0.0000310 epoch_Time:290.0min:
[2023-12-25 16:01:57,612][model5_pretrain.py][INFO] Epoch:[0/1](66400/112548) loss:2.960 lr:0.0000307 epoch_Time:290.0min:
[2023-12-25 16:02:35,342][model5_pretrain.py][INFO] Epoch:[0/1](66500/112548) loss:2.584 lr:0.0000304 epoch_Time:289.0min:
[2023-12-25 16:03:13,072][model5_pretrain.py][INFO] Epoch:[0/1](66600/112548) loss:2.988 lr:0.0000301 epoch_Time:289.0min:
[2023-12-25 16:03:50,790][model5_pretrain.py][INFO] Epoch:[0/1](66700/112548) loss:2.320 lr:0.0000298 epoch_Time:288.0min:
[2023-12-25 16:04:28,519][model5_pretrain.py][INFO] Epoch:[0/1](66800/112548) loss:2.647 lr:0.0000295 epoch_Time:287.0min:
[2023-12-25 16:05:06,247][model5_pretrain.py][INFO] Epoch:[0/1](66900/112548) loss:2.866 lr:0.0000292 epoch_Time:287.0min:
[2023-12-25 16:05:43,971][model5_pretrain.py][INFO] Epoch:[0/1](67000/112548) loss:2.950 lr:0.0000289 epoch_Time:286.0min:
[2023-12-25 16:06:21,696][model5_pretrain.py][INFO] Epoch:[0/1](67100/112548) loss:2.669 lr:0.0000287 epoch_Time:285.0min:
[2023-12-25 16:06:59,424][model5_pretrain.py][INFO] Epoch:[0/1](67200/112548) loss:2.562 lr:0.0000284 epoch_Time:285.0min:
[2023-12-25 16:07:37,151][model5_pretrain.py][INFO] Epoch:[0/1](67300/112548) loss:2.740 lr:0.0000281 epoch_Time:284.0min:
[2023-12-25 16:08:14,880][model5_pretrain.py][INFO] Epoch:[0/1](67400/112548) loss:2.866 lr:0.0000278 epoch_Time:284.0min:
[2023-12-25 16:08:52,612][model5_pretrain.py][INFO] Epoch:[0/1](67500/112548) loss:2.658 lr:0.0000275 epoch_Time:283.0min:
[2023-12-25 16:09:30,344][model5_pretrain.py][INFO] Epoch:[0/1](67600/112548) loss:2.492 lr:0.0000273 epoch_Time:282.0min:
[2023-12-25 16:10:08,075][model5_pretrain.py][INFO] Epoch:[0/1](67700/112548) loss:2.670 lr:0.0000270 epoch_Time:282.0min:
[2023-12-25 16:10:45,804][model5_pretrain.py][INFO] Epoch:[0/1](67800/112548) loss:2.812 lr:0.0000267 epoch_Time:281.0min:
[2023-12-25 16:11:23,537][model5_pretrain.py][INFO] Epoch:[0/1](67900/112548) loss:2.614 lr:0.0000265 epoch_Time:280.0min:
[2023-12-25 16:12:01,266][model5_pretrain.py][INFO] Epoch:[0/1](68000/112548) loss:2.642 lr:0.0000262 epoch_Time:280.0min:
[2023-12-25 16:12:38,996][model5_pretrain.py][INFO] Epoch:[0/1](68100/112548) loss:2.505 lr:0.0000259 epoch_Time:279.0min:
[2023-12-25 16:13:16,731][model5_pretrain.py][INFO] Epoch:[0/1](68200/112548) loss:2.525 lr:0.0000257 epoch_Time:279.0min:
[2023-12-25 16:13:54,466][model5_pretrain.py][INFO] Epoch:[0/1](68300/112548) loss:2.784 lr:0.0000254 epoch_Time:278.0min:
[2023-12-25 16:14:32,196][model5_pretrain.py][INFO] Epoch:[0/1](68400/112548) loss:1.901 lr:0.0000252 epoch_Time:277.0min:
[2023-12-25 16:15:09,928][model5_pretrain.py][INFO] Epoch:[0/1](68500/112548) loss:2.954 lr:0.0000249 epoch_Time:277.0min:
[2023-12-25 16:15:47,662][model5_pretrain.py][INFO] Epoch:[0/1](68600/112548) loss:3.190 lr:0.0000246 epoch_Time:276.0min:
[2023-12-25 16:16:25,392][model5_pretrain.py][INFO] Epoch:[0/1](68700/112548) loss:2.724 lr:0.0000244 epoch_Time:275.0min:
[2023-12-25 16:17:03,132][model5_pretrain.py][INFO] Epoch:[0/1](68800/112548) loss:2.984 lr:0.0000241 epoch_Time:275.0min:
[2023-12-25 16:17:40,858][model5_pretrain.py][INFO] Epoch:[0/1](68900/112548) loss:2.670 lr:0.0000239 epoch_Time:274.0min:
[2023-12-25 16:18:18,557][model5_pretrain.py][INFO] Epoch:[0/1](69000/112548) loss:2.719 lr:0.0000237 epoch_Time:273.0min:
[2023-12-25 16:18:56,281][model5_pretrain.py][INFO] Epoch:[0/1](69100/112548) loss:2.872 lr:0.0000234 epoch_Time:273.0min:
[2023-12-25 16:19:34,010][model5_pretrain.py][INFO] Epoch:[0/1](69200/112548) loss:2.405 lr:0.0000232 epoch_Time:272.0min:
[2023-12-25 16:20:11,731][model5_pretrain.py][INFO] Epoch:[0/1](69300/112548) loss:3.248 lr:0.0000229 epoch_Time:272.0min:
[2023-12-25 16:20:49,464][model5_pretrain.py][INFO] Epoch:[0/1](69400/112548) loss:2.290 lr:0.0000227 epoch_Time:271.0min:
[2023-12-25 16:21:27,193][model5_pretrain.py][INFO] Epoch:[0/1](69500/112548) loss:3.148 lr:0.0000225 epoch_Time:270.0min:
[2023-12-25 16:22:04,926][model5_pretrain.py][INFO] Epoch:[0/1](69600/112548) loss:2.837 lr:0.0000222 epoch_Time:270.0min:
[2023-12-25 16:22:42,622][model5_pretrain.py][INFO] Epoch:[0/1](69700/112548) loss:3.012 lr:0.0000220 epoch_Time:269.0min:
[2023-12-25 16:23:20,345][model5_pretrain.py][INFO] Epoch:[0/1](69800/112548) loss:2.510 lr:0.0000218 epoch_Time:268.0min:
[2023-12-25 16:23:58,067][model5_pretrain.py][INFO] Epoch:[0/1](69900/112548) loss:2.907 lr:0.0000215 epoch_Time:268.0min:
[2023-12-25 16:24:35,794][model5_pretrain.py][INFO] Epoch:[0/1](70000/112548) loss:2.683 lr:0.0000213 epoch_Time:267.0min:
[2023-12-25 16:25:13,522][model5_pretrain.py][INFO] Epoch:[0/1](70100/112548) loss:3.167 lr:0.0000211 epoch_Time:267.0min:
[2023-12-25 16:25:51,239][model5_pretrain.py][INFO] Epoch:[0/1](70200/112548) loss:2.661 lr:0.0000209 epoch_Time:266.0min:
[2023-12-25 16:26:28,960][model5_pretrain.py][INFO] Epoch:[0/1](70300/112548) loss:2.958 lr:0.0000207 epoch_Time:265.0min:
[2023-12-25 16:27:06,668][model5_pretrain.py][INFO] Epoch:[0/1](70400/112548) loss:2.748 lr:0.0000204 epoch_Time:265.0min:
[2023-12-25 16:27:44,369][model5_pretrain.py][INFO] Epoch:[0/1](70500/112548) loss:3.010 lr:0.0000202 epoch_Time:264.0min:
[2023-12-25 16:28:22,073][model5_pretrain.py][INFO] Epoch:[0/1](70600/112548) loss:2.543 lr:0.0000200 epoch_Time:263.0min:
[2023-12-25 16:28:59,778][model5_pretrain.py][INFO] Epoch:[0/1](70700/112548) loss:3.019 lr:0.0000198 epoch_Time:263.0min:
[2023-12-25 16:29:37,481][model5_pretrain.py][INFO] Epoch:[0/1](70800/112548) loss:2.755 lr:0.0000196 epoch_Time:262.0min:
[2023-12-25 16:30:15,187][model5_pretrain.py][INFO] Epoch:[0/1](70900/112548) loss:3.103 lr:0.0000194 epoch_Time:262.0min:
[2023-12-25 16:30:52,900][model5_pretrain.py][INFO] Epoch:[0/1](71000/112548) loss:2.763 lr:0.0000192 epoch_Time:261.0min:
[2023-12-25 16:31:30,606][model5_pretrain.py][INFO] Epoch:[0/1](71100/112548) loss:2.611 lr:0.0000190 epoch_Time:260.0min:
[2023-12-25 16:32:08,318][model5_pretrain.py][INFO] Epoch:[0/1](71200/112548) loss:2.801 lr:0.0000188 epoch_Time:260.0min:
[2023-12-25 16:32:46,024][model5_pretrain.py][INFO] Epoch:[0/1](71300/112548) loss:2.513 lr:0.0000186 epoch_Time:259.0min:
[2023-12-25 16:33:23,732][model5_pretrain.py][INFO] Epoch:[0/1](71400/112548) loss:2.558 lr:0.0000184 epoch_Time:258.0min:
[2023-12-25 16:34:01,447][model5_pretrain.py][INFO] Epoch:[0/1](71500/112548) loss:2.362 lr:0.0000182 epoch_Time:258.0min:
[2023-12-25 16:34:39,156][model5_pretrain.py][INFO] Epoch:[0/1](71600/112548) loss:2.344 lr:0.0000180 epoch_Time:257.0min:
[2023-12-25 16:35:16,869][model5_pretrain.py][INFO] Epoch:[0/1](71700/112548) loss:3.448 lr:0.0000178 epoch_Time:257.0min:
[2023-12-25 16:35:54,566][model5_pretrain.py][INFO] Epoch:[0/1](71800/112548) loss:3.127 lr:0.0000176 epoch_Time:256.0min:
[2023-12-25 16:36:32,565][model5_pretrain.py][INFO] Epoch:[0/1](71900/112548) loss:2.720 lr:0.0000175 epoch_Time:255.0min:
[2023-12-25 16:37:10,280][model5_pretrain.py][INFO] Epoch:[0/1](72000/112548) loss:2.870 lr:0.0000173 epoch_Time:255.0min:
[2023-12-25 16:37:47,990][model5_pretrain.py][INFO] Epoch:[0/1](72100/112548) loss:2.296 lr:0.0000171 epoch_Time:254.0min:
[2023-12-25 16:38:25,700][model5_pretrain.py][INFO] Epoch:[0/1](72200/112548) loss:2.500 lr:0.0000169 epoch_Time:253.0min:
[2023-12-25 16:39:03,410][model5_pretrain.py][INFO] Epoch:[0/1](72300/112548) loss:2.238 lr:0.0000167 epoch_Time:253.0min:
[2023-12-25 16:39:41,118][model5_pretrain.py][INFO] Epoch:[0/1](72400/112548) loss:2.080 lr:0.0000166 epoch_Time:252.0min:
[2023-12-25 16:40:18,826][model5_pretrain.py][INFO] Epoch:[0/1](72500/112548) loss:2.861 lr:0.0000164 epoch_Time:251.0min:
[2023-12-25 16:40:56,543][model5_pretrain.py][INFO] Epoch:[0/1](72600/112548) loss:2.592 lr:0.0000162 epoch_Time:251.0min:
[2023-12-25 16:41:34,264][model5_pretrain.py][INFO] Epoch:[0/1](72700/112548) loss:2.836 lr:0.0000161 epoch_Time:250.0min:
[2023-12-25 16:42:11,975][model5_pretrain.py][INFO] Epoch:[0/1](72800/112548) loss:2.755 lr:0.0000159 epoch_Time:250.0min:
[2023-12-25 16:42:49,688][model5_pretrain.py][INFO] Epoch:[0/1](72900/112548) loss:3.014 lr:0.0000157 epoch_Time:249.0min:
[2023-12-25 16:43:27,393][model5_pretrain.py][INFO] Epoch:[0/1](73000/112548) loss:2.894 lr:0.0000156 epoch_Time:248.0min:
[2023-12-25 16:44:05,103][model5_pretrain.py][INFO] Epoch:[0/1](73100/112548) loss:2.383 lr:0.0000154 epoch_Time:248.0min:
[2023-12-25 16:44:42,812][model5_pretrain.py][INFO] Epoch:[0/1](73200/112548) loss:3.022 lr:0.0000153 epoch_Time:247.0min:
[2023-12-25 16:45:20,527][model5_pretrain.py][INFO] Epoch:[0/1](73300/112548) loss:2.739 lr:0.0000151 epoch_Time:246.0min:
[2023-12-25 16:45:58,244][model5_pretrain.py][INFO] Epoch:[0/1](73400/112548) loss:2.118 lr:0.0000150 epoch_Time:246.0min:
[2023-12-25 16:46:35,949][model5_pretrain.py][INFO] Epoch:[0/1](73500/112548) loss:2.615 lr:0.0000148 epoch_Time:245.0min:
[2023-12-25 16:47:13,651][model5_pretrain.py][INFO] Epoch:[0/1](73600/112548) loss:2.925 lr:0.0000147 epoch_Time:245.0min:
[2023-12-25 16:47:51,332][model5_pretrain.py][INFO] Epoch:[0/1](73700/112548) loss:2.279 lr:0.0000145 epoch_Time:244.0min:
[2023-12-25 16:48:29,050][model5_pretrain.py][INFO] Epoch:[0/1](73800/112548) loss:2.795 lr:0.0000144 epoch_Time:243.0min:
[2023-12-25 16:49:06,770][model5_pretrain.py][INFO] Epoch:[0/1](73900/112548) loss:3.090 lr:0.0000142 epoch_Time:243.0min:
[2023-12-25 16:49:44,482][model5_pretrain.py][INFO] Epoch:[0/1](74000/112548) loss:2.645 lr:0.0000141 epoch_Time:242.0min:
[2023-12-25 16:50:22,192][model5_pretrain.py][INFO] Epoch:[0/1](74100/112548) loss:2.711 lr:0.0000140 epoch_Time:241.0min:
[2023-12-25 16:50:59,917][model5_pretrain.py][INFO] Epoch:[0/1](74200/112548) loss:3.051 lr:0.0000138 epoch_Time:241.0min:
[2023-12-25 16:51:37,638][model5_pretrain.py][INFO] Epoch:[0/1](74300/112548) loss:2.922 lr:0.0000137 epoch_Time:240.0min:
[2023-12-25 16:52:15,354][model5_pretrain.py][INFO] Epoch:[0/1](74400/112548) loss:2.734 lr:0.0000136 epoch_Time:240.0min:
[2023-12-25 16:52:53,072][model5_pretrain.py][INFO] Epoch:[0/1](74500/112548) loss:2.583 lr:0.0000135 epoch_Time:239.0min:
[2023-12-25 16:53:30,791][model5_pretrain.py][INFO] Epoch:[0/1](74600/112548) loss:2.817 lr:0.0000133 epoch_Time:238.0min:
[2023-12-25 16:54:08,500][model5_pretrain.py][INFO] Epoch:[0/1](74700/112548) loss:2.627 lr:0.0000132 epoch_Time:238.0min:
[2023-12-25 16:54:46,209][model5_pretrain.py][INFO] Epoch:[0/1](74800/112548) loss:2.856 lr:0.0000131 epoch_Time:237.0min:
[2023-12-25 16:55:23,920][model5_pretrain.py][INFO] Epoch:[0/1](74900/112548) loss:3.208 lr:0.0000130 epoch_Time:236.0min:
[2023-12-25 16:56:01,632][model5_pretrain.py][INFO] Epoch:[0/1](75000/112548) loss:2.075 lr:0.0000129 epoch_Time:236.0min:
[2023-12-25 16:56:39,306][model5_pretrain.py][INFO] Epoch:[0/1](75100/112548) loss:2.494 lr:0.0000127 epoch_Time:235.0min:
[2023-12-25 16:57:17,004][model5_pretrain.py][INFO] Epoch:[0/1](75200/112548) loss:2.750 lr:0.0000126 epoch_Time:235.0min:
[2023-12-25 16:57:54,712][model5_pretrain.py][INFO] Epoch:[0/1](75300/112548) loss:2.892 lr:0.0000125 epoch_Time:234.0min:
[2023-12-25 16:58:32,419][model5_pretrain.py][INFO] Epoch:[0/1](75400/112548) loss:3.177 lr:0.0000124 epoch_Time:233.0min:
[2023-12-25 16:59:10,121][model5_pretrain.py][INFO] Epoch:[0/1](75500/112548) loss:3.248 lr:0.0000123 epoch_Time:233.0min:
[2023-12-25 16:59:47,820][model5_pretrain.py][INFO] Epoch:[0/1](75600/112548) loss:3.127 lr:0.0000122 epoch_Time:232.0min:
[2023-12-25 17:00:25,537][model5_pretrain.py][INFO] Epoch:[0/1](75700/112548) loss:3.346 lr:0.0000121 epoch_Time:231.0min:
[2023-12-25 17:01:03,240][model5_pretrain.py][INFO] Epoch:[0/1](75800/112548) loss:2.984 lr:0.0000120 epoch_Time:231.0min:
[2023-12-25 17:01:40,947][model5_pretrain.py][INFO] Epoch:[0/1](75900/112548) loss:2.726 lr:0.0000119 epoch_Time:230.0min:
[2023-12-25 17:02:18,660][model5_pretrain.py][INFO] Epoch:[0/1](76000/112548) loss:2.837 lr:0.0000118 epoch_Time:229.0min:
[2023-12-25 17:02:56,367][model5_pretrain.py][INFO] Epoch:[0/1](76100/112548) loss:3.114 lr:0.0000117 epoch_Time:229.0min:
[2023-12-25 17:03:34,088][model5_pretrain.py][INFO] Epoch:[0/1](76200/112548) loss:2.304 lr:0.0000117 epoch_Time:228.0min:
[2023-12-25 17:04:11,799][model5_pretrain.py][INFO] Epoch:[0/1](76300/112548) loss:2.872 lr:0.0000116 epoch_Time:228.0min:
[2023-12-25 17:04:49,510][model5_pretrain.py][INFO] Epoch:[0/1](76400/112548) loss:3.141 lr:0.0000115 epoch_Time:227.0min:
[2023-12-25 17:05:27,209][model5_pretrain.py][INFO] Epoch:[0/1](76500/112548) loss:2.619 lr:0.0000114 epoch_Time:226.0min:
[2023-12-25 17:06:04,908][model5_pretrain.py][INFO] Epoch:[0/1](76600/112548) loss:2.825 lr:0.0000113 epoch_Time:226.0min:
[2023-12-25 17:06:42,615][model5_pretrain.py][INFO] Epoch:[0/1](76700/112548) loss:2.693 lr:0.0000112 epoch_Time:225.0min:
[2023-12-25 17:07:20,320][model5_pretrain.py][INFO] Epoch:[0/1](76800/112548) loss:2.408 lr:0.0000112 epoch_Time:224.0min:
[2023-12-25 17:07:58,034][model5_pretrain.py][INFO] Epoch:[0/1](76900/112548) loss:1.680 lr:0.0000111 epoch_Time:224.0min:
[2023-12-25 17:08:35,746][model5_pretrain.py][INFO] Epoch:[0/1](77000/112548) loss:2.815 lr:0.0000110 epoch_Time:223.0min:
[2023-12-25 17:09:13,466][model5_pretrain.py][INFO] Epoch:[0/1](77100/112548) loss:3.068 lr:0.0000110 epoch_Time:223.0min:
[2023-12-25 17:09:51,190][model5_pretrain.py][INFO] Epoch:[0/1](77200/112548) loss:2.841 lr:0.0000109 epoch_Time:222.0min:
[2023-12-25 17:10:28,904][model5_pretrain.py][INFO] Epoch:[0/1](77300/112548) loss:2.555 lr:0.0000108 epoch_Time:221.0min:
[2023-12-25 17:11:06,624][model5_pretrain.py][INFO] Epoch:[0/1](77400/112548) loss:2.319 lr:0.0000108 epoch_Time:221.0min:
[2023-12-25 17:11:44,333][model5_pretrain.py][INFO] Epoch:[0/1](77500/112548) loss:2.659 lr:0.0000107 epoch_Time:220.0min:
[2023-12-25 17:12:22,039][model5_pretrain.py][INFO] Epoch:[0/1](77600/112548) loss:2.695 lr:0.0000107 epoch_Time:219.0min:
[2023-12-25 17:12:59,744][model5_pretrain.py][INFO] Epoch:[0/1](77700/112548) loss:2.372 lr:0.0000106 epoch_Time:219.0min:
[2023-12-25 17:13:37,429][model5_pretrain.py][INFO] Epoch:[0/1](77800/112548) loss:2.869 lr:0.0000106 epoch_Time:218.0min:
[2023-12-25 17:14:15,126][model5_pretrain.py][INFO] Epoch:[0/1](77900/112548) loss:1.966 lr:0.0000105 epoch_Time:218.0min:
[2023-12-25 17:14:52,830][model5_pretrain.py][INFO] Epoch:[0/1](78000/112548) loss:2.535 lr:0.0000105 epoch_Time:217.0min:
[2023-12-25 17:15:30,536][model5_pretrain.py][INFO] Epoch:[0/1](78100/112548) loss:2.392 lr:0.0000104 epoch_Time:216.0min:
[2023-12-25 17:16:08,251][model5_pretrain.py][INFO] Epoch:[0/1](78200/112548) loss:2.895 lr:0.0000104 epoch_Time:216.0min:
[2023-12-25 17:16:45,959][model5_pretrain.py][INFO] Epoch:[0/1](78300/112548) loss:3.011 lr:0.0000103 epoch_Time:215.0min:
[2023-12-25 17:17:23,686][model5_pretrain.py][INFO] Epoch:[0/1](78400/112548) loss:2.146 lr:0.0000103 epoch_Time:214.0min:
[2023-12-25 17:18:01,403][model5_pretrain.py][INFO] Epoch:[0/1](78500/112548) loss:3.141 lr:0.0000103 epoch_Time:214.0min:
[2023-12-25 17:18:39,126][model5_pretrain.py][INFO] Epoch:[0/1](78600/112548) loss:2.725 lr:0.0000102 epoch_Time:213.0min:
[2023-12-25 17:19:16,838][model5_pretrain.py][INFO] Epoch:[0/1](78700/112548) loss:3.257 lr:0.0000102 epoch_Time:213.0min:
[2023-12-25 17:19:54,558][model5_pretrain.py][INFO] Epoch:[0/1](78800/112548) loss:2.573 lr:0.0000102 epoch_Time:212.0min:
[2023-12-25 17:20:32,277][model5_pretrain.py][INFO] Epoch:[0/1](78900/112548) loss:2.494 lr:0.0000101 epoch_Time:211.0min:
[2023-12-25 17:21:09,986][model5_pretrain.py][INFO] Epoch:[0/1](79000/112548) loss:2.534 lr:0.0000101 epoch_Time:211.0min:
[2023-12-25 17:21:47,696][model5_pretrain.py][INFO] Epoch:[0/1](79100/112548) loss:2.139 lr:0.0000101 epoch_Time:210.0min:
[2023-12-25 17:22:25,421][model5_pretrain.py][INFO] Epoch:[0/1](79200/112548) loss:2.699 lr:0.0000101 epoch_Time:209.0min:
[2023-12-25 17:23:03,127][model5_pretrain.py][INFO] Epoch:[0/1](79300/112548) loss:2.825 lr:0.0000101 epoch_Time:209.0min:
[2023-12-25 17:23:40,841][model5_pretrain.py][INFO] Epoch:[0/1](79400/112548) loss:2.533 lr:0.0000100 epoch_Time:208.0min:
[2023-12-25 17:24:18,562][model5_pretrain.py][INFO] Epoch:[0/1](79500/112548) loss:3.179 lr:0.0000100 epoch_Time:207.0min:
[2023-12-25 17:24:56,273][model5_pretrain.py][INFO] Epoch:[0/1](79600/112548) loss:2.930 lr:0.0000100 epoch_Time:207.0min:
[2023-12-25 17:25:33,982][model5_pretrain.py][INFO] Epoch:[0/1](79700/112548) loss:2.523 lr:0.0000100 epoch_Time:206.0min:
[2023-12-25 17:26:11,689][model5_pretrain.py][INFO] Epoch:[0/1](79800/112548) loss:2.864 lr:0.0000100 epoch_Time:206.0min:
[2023-12-25 17:26:49,397][model5_pretrain.py][INFO] Epoch:[0/1](79900/112548) loss:3.451 lr:0.0000100 epoch_Time:205.0min:
[2023-12-25 17:27:27,104][model5_pretrain.py][INFO] Epoch:[0/1](80000/112548) loss:2.541 lr:0.0000100 epoch_Time:204.0min:
[2023-12-25 17:28:11,775][model5_pretrain.py][INFO] Epoch:[0/1](80100/112548) loss:2.196 lr:0.0000100 epoch_Time:205.0min:
[2023-12-25 17:28:49,492][model5_pretrain.py][INFO] Epoch:[0/1](80200/112548) loss:2.333 lr:0.0000100 epoch_Time:204.0min:
[2023-12-25 17:29:27,218][model5_pretrain.py][INFO] Epoch:[0/1](80300/112548) loss:2.846 lr:0.0000100 epoch_Time:203.0min:
[2023-12-25 17:30:04,944][model5_pretrain.py][INFO] Epoch:[0/1](80400/112548) loss:2.446 lr:0.0000100 epoch_Time:203.0min:
[2023-12-25 17:30:42,652][model5_pretrain.py][INFO] Epoch:[0/1](80500/112548) loss:2.448 lr:0.0000100 epoch_Time:202.0min:
[2023-12-25 17:31:20,356][model5_pretrain.py][INFO] Epoch:[0/1](80600/112548) loss:2.688 lr:0.0000100 epoch_Time:201.0min:
[2023-12-25 17:31:58,044][model5_pretrain.py][INFO] Epoch:[0/1](80700/112548) loss:2.640 lr:0.0000100 epoch_Time:201.0min:
[2023-12-25 17:32:35,764][model5_pretrain.py][INFO] Epoch:[0/1](80800/112548) loss:2.499 lr:0.0000100 epoch_Time:200.0min:
[2023-12-25 17:33:13,479][model5_pretrain.py][INFO] Epoch:[0/1](80900/112548) loss:2.433 lr:0.0000100 epoch_Time:200.0min:
[2023-12-25 17:33:51,188][model5_pretrain.py][INFO] Epoch:[0/1](81000/112548) loss:2.964 lr:0.0000100 epoch_Time:199.0min:
[2023-12-25 17:34:28,895][model5_pretrain.py][INFO] Epoch:[0/1](81100/112548) loss:2.643 lr:0.0000100 epoch_Time:198.0min:
[2023-12-25 17:35:06,612][model5_pretrain.py][INFO] Epoch:[0/1](81200/112548) loss:2.874 lr:0.0000100 epoch_Time:198.0min:
[2023-12-25 17:35:44,327][model5_pretrain.py][INFO] Epoch:[0/1](81300/112548) loss:2.860 lr:0.0000100 epoch_Time:197.0min:
[2023-12-25 17:36:22,036][model5_pretrain.py][INFO] Epoch:[0/1](81400/112548) loss:2.936 lr:0.0000100 epoch_Time:196.0min:
[2023-12-25 17:36:59,736][model5_pretrain.py][INFO] Epoch:[0/1](81500/112548) loss:2.721 lr:0.0000100 epoch_Time:196.0min:
[2023-12-25 17:37:37,446][model5_pretrain.py][INFO] Epoch:[0/1](81600/112548) loss:2.693 lr:0.0000100 epoch_Time:195.0min:
[2023-12-25 17:38:15,174][model5_pretrain.py][INFO] Epoch:[0/1](81700/112548) loss:3.128 lr:0.0000100 epoch_Time:195.0min:
[2023-12-25 17:38:52,892][model5_pretrain.py][INFO] Epoch:[0/1](81800/112548) loss:2.775 lr:0.0000100 epoch_Time:194.0min:
[2023-12-25 17:39:30,606][model5_pretrain.py][INFO] Epoch:[0/1](81900/112548) loss:2.561 lr:0.0000100 epoch_Time:193.0min:
[2023-12-25 17:40:08,319][model5_pretrain.py][INFO] Epoch:[0/1](82000/112548) loss:2.662 lr:0.0000100 epoch_Time:193.0min:
[2023-12-25 17:40:46,027][model5_pretrain.py][INFO] Epoch:[0/1](82100/112548) loss:2.607 lr:0.0000100 epoch_Time:192.0min:
[2023-12-25 17:41:23,738][model5_pretrain.py][INFO] Epoch:[0/1](82200/112548) loss:3.006 lr:0.0000100 epoch_Time:191.0min:
[2023-12-25 17:42:01,480][model5_pretrain.py][INFO] Epoch:[0/1](82300/112548) loss:2.414 lr:0.0000100 epoch_Time:191.0min:
[2023-12-25 17:42:39,195][model5_pretrain.py][INFO] Epoch:[0/1](82400/112548) loss:2.775 lr:0.0000100 epoch_Time:190.0min:
[2023-12-25 17:43:16,908][model5_pretrain.py][INFO] Epoch:[0/1](82500/112548) loss:2.558 lr:0.0000100 epoch_Time:190.0min:
[2023-12-25 17:43:54,624][model5_pretrain.py][INFO] Epoch:[0/1](82600/112548) loss:2.880 lr:0.0000100 epoch_Time:189.0min:
[2023-12-25 17:44:32,338][model5_pretrain.py][INFO] Epoch:[0/1](82700/112548) loss:3.014 lr:0.0000100 epoch_Time:188.0min:
[2023-12-25 17:45:10,058][model5_pretrain.py][INFO] Epoch:[0/1](82800/112548) loss:2.753 lr:0.0000100 epoch_Time:188.0min:
[2023-12-25 17:45:47,775][model5_pretrain.py][INFO] Epoch:[0/1](82900/112548) loss:2.555 lr:0.0000100 epoch_Time:187.0min:
[2023-12-25 17:46:25,488][model5_pretrain.py][INFO] Epoch:[0/1](83000/112548) loss:2.753 lr:0.0000100 epoch_Time:186.0min:
[2023-12-25 17:47:03,198][model5_pretrain.py][INFO] Epoch:[0/1](83100/112548) loss:2.592 lr:0.0000100 epoch_Time:186.0min:
[2023-12-25 17:47:40,909][model5_pretrain.py][INFO] Epoch:[0/1](83200/112548) loss:3.173 lr:0.0000100 epoch_Time:185.0min:
[2023-12-25 17:48:18,620][model5_pretrain.py][INFO] Epoch:[0/1](83300/112548) loss:3.091 lr:0.0000100 epoch_Time:184.0min:
[2023-12-25 17:48:56,327][model5_pretrain.py][INFO] Epoch:[0/1](83400/112548) loss:2.930 lr:0.0000100 epoch_Time:184.0min:
[2023-12-25 17:49:34,036][model5_pretrain.py][INFO] Epoch:[0/1](83500/112548) loss:2.892 lr:0.0000100 epoch_Time:183.0min:
[2023-12-25 17:50:11,742][model5_pretrain.py][INFO] Epoch:[0/1](83600/112548) loss:2.147 lr:0.0000100 epoch_Time:183.0min:
[2023-12-25 17:50:49,447][model5_pretrain.py][INFO] Epoch:[0/1](83700/112548) loss:2.720 lr:0.0000100 epoch_Time:182.0min:
[2023-12-25 17:51:27,155][model5_pretrain.py][INFO] Epoch:[0/1](83800/112548) loss:3.207 lr:0.0000100 epoch_Time:181.0min:
[2023-12-25 17:52:04,871][model5_pretrain.py][INFO] Epoch:[0/1](83900/112548) loss:2.750 lr:0.0000100 epoch_Time:181.0min:
[2023-12-25 17:52:42,583][model5_pretrain.py][INFO] Epoch:[0/1](84000/112548) loss:2.552 lr:0.0000100 epoch_Time:180.0min:
[2023-12-25 17:53:20,300][model5_pretrain.py][INFO] Epoch:[0/1](84100/112548) loss:3.061 lr:0.0000100 epoch_Time:178.0min:
[2023-12-25 17:53:58,021][model5_pretrain.py][INFO] Epoch:[0/1](84200/112548) loss:2.523 lr:0.0000100 epoch_Time:178.0min:
[2023-12-25 17:54:35,737][model5_pretrain.py][INFO] Epoch:[0/1](84300/112548) loss:2.617 lr:0.0000100 epoch_Time:177.0min:
[2023-12-25 17:55:13,443][model5_pretrain.py][INFO] Epoch:[0/1](84400/112548) loss:3.078 lr:0.0000100 epoch_Time:177.0min:
[2023-12-25 17:55:51,163][model5_pretrain.py][INFO] Epoch:[0/1](84500/112548) loss:2.623 lr:0.0000100 epoch_Time:176.0min:
[2023-12-25 17:56:28,874][model5_pretrain.py][INFO] Epoch:[0/1](84600/112548) loss:2.449 lr:0.0000100 epoch_Time:175.0min:
[2023-12-25 17:57:06,592][model5_pretrain.py][INFO] Epoch:[0/1](84700/112548) loss:2.749 lr:0.0000100 epoch_Time:175.0min:
[2023-12-25 17:57:44,314][model5_pretrain.py][INFO] Epoch:[0/1](84800/112548) loss:3.015 lr:0.0000100 epoch_Time:174.0min:
[2023-12-25 17:58:22,001][model5_pretrain.py][INFO] Epoch:[0/1](84900/112548) loss:2.620 lr:0.0000100 epoch_Time:173.0min:
[2023-12-25 17:58:59,704][model5_pretrain.py][INFO] Epoch:[0/1](85000/112548) loss:2.586 lr:0.0000100 epoch_Time:173.0min:
[2023-12-25 17:59:37,417][model5_pretrain.py][INFO] Epoch:[0/1](85100/112548) loss:3.194 lr:0.0000100 epoch_Time:172.0min:
[2023-12-25 18:00:15,123][model5_pretrain.py][INFO] Epoch:[0/1](85200/112548) loss:2.792 lr:0.0000100 epoch_Time:172.0min:
[2023-12-25 18:00:52,827][model5_pretrain.py][INFO] Epoch:[0/1](85300/112548) loss:2.350 lr:0.0000100 epoch_Time:171.0min:
[2023-12-25 18:01:30,536][model5_pretrain.py][INFO] Epoch:[0/1](85400/112548) loss:2.921 lr:0.0000100 epoch_Time:170.0min:
[2023-12-25 18:02:08,249][model5_pretrain.py][INFO] Epoch:[0/1](85500/112548) loss:2.337 lr:0.0000100 epoch_Time:170.0min:
[2023-12-25 18:02:45,959][model5_pretrain.py][INFO] Epoch:[0/1](85600/112548) loss:2.435 lr:0.0000100 epoch_Time:169.0min:
[2023-12-25 18:03:23,680][model5_pretrain.py][INFO] Epoch:[0/1](85700/112548) loss:3.040 lr:0.0000100 epoch_Time:168.0min:
[2023-12-25 18:04:01,394][model5_pretrain.py][INFO] Epoch:[0/1](85800/112548) loss:2.588 lr:0.0000100 epoch_Time:168.0min:
[2023-12-25 18:04:39,096][model5_pretrain.py][INFO] Epoch:[0/1](85900/112548) loss:2.657 lr:0.0000100 epoch_Time:167.0min:
[2023-12-25 18:05:16,830][model5_pretrain.py][INFO] Epoch:[0/1](86000/112548) loss:2.500 lr:0.0000100 epoch_Time:167.0min:
[2023-12-25 18:05:54,551][model5_pretrain.py][INFO] Epoch:[0/1](86100/112548) loss:2.392 lr:0.0000100 epoch_Time:166.0min:
[2023-12-25 18:06:32,271][model5_pretrain.py][INFO] Epoch:[0/1](86200/112548) loss:1.924 lr:0.0000100 epoch_Time:165.0min:
[2023-12-25 18:07:10,016][model5_pretrain.py][INFO] Epoch:[0/1](86300/112548) loss:2.713 lr:0.0000100 epoch_Time:165.0min:
[2023-12-25 18:07:47,885][model5_pretrain.py][INFO] Epoch:[0/1](86400/112548) loss:2.580 lr:0.0000100 epoch_Time:164.0min:
[2023-12-25 18:08:25,558][model5_pretrain.py][INFO] Epoch:[0/1](86500/112548) loss:2.595 lr:0.0000100 epoch_Time:163.0min:
[2023-12-25 18:09:03,267][model5_pretrain.py][INFO] Epoch:[0/1](86600/112548) loss:2.809 lr:0.0000100 epoch_Time:163.0min:
[2023-12-25 18:09:40,979][model5_pretrain.py][INFO] Epoch:[0/1](86700/112548) loss:2.242 lr:0.0000100 epoch_Time:162.0min:
[2023-12-25 18:10:18,695][model5_pretrain.py][INFO] Epoch:[0/1](86800/112548) loss:2.455 lr:0.0000100 epoch_Time:161.0min:
[2023-12-25 18:10:56,403][model5_pretrain.py][INFO] Epoch:[0/1](86900/112548) loss:2.663 lr:0.0000100 epoch_Time:161.0min:
[2023-12-25 18:11:34,104][model5_pretrain.py][INFO] Epoch:[0/1](87000/112548) loss:2.771 lr:0.0000100 epoch_Time:160.0min:
[2023-12-25 18:12:11,808][model5_pretrain.py][INFO] Epoch:[0/1](87100/112548) loss:2.267 lr:0.0000100 epoch_Time:160.0min:
[2023-12-25 18:12:49,512][model5_pretrain.py][INFO] Epoch:[0/1](87200/112548) loss:3.444 lr:0.0000100 epoch_Time:159.0min:
[2023-12-25 18:13:27,226][model5_pretrain.py][INFO] Epoch:[0/1](87300/112548) loss:2.694 lr:0.0000100 epoch_Time:158.0min:
[2023-12-25 18:14:04,939][model5_pretrain.py][INFO] Epoch:[0/1](87400/112548) loss:2.975 lr:0.0000100 epoch_Time:158.0min:
[2023-12-25 18:14:42,643][model5_pretrain.py][INFO] Epoch:[0/1](87500/112548) loss:2.183 lr:0.0000100 epoch_Time:157.0min:
[2023-12-25 18:15:20,358][model5_pretrain.py][INFO] Epoch:[0/1](87600/112548) loss:2.996 lr:0.0000100 epoch_Time:156.0min:
[2023-12-25 18:15:58,068][model5_pretrain.py][INFO] Epoch:[0/1](87700/112548) loss:2.422 lr:0.0000100 epoch_Time:156.0min:
[2023-12-25 18:16:35,777][model5_pretrain.py][INFO] Epoch:[0/1](87800/112548) loss:3.287 lr:0.0000100 epoch_Time:155.0min:
[2023-12-25 18:17:13,491][model5_pretrain.py][INFO] Epoch:[0/1](87900/112548) loss:2.652 lr:0.0000100 epoch_Time:155.0min:
[2023-12-25 18:17:51,219][model5_pretrain.py][INFO] Epoch:[0/1](88000/112548) loss:2.703 lr:0.0000100 epoch_Time:154.0min:
[2023-12-25 18:18:28,935][model5_pretrain.py][INFO] Epoch:[0/1](88100/112548) loss:3.202 lr:0.0000100 epoch_Time:153.0min:
[2023-12-25 18:19:06,648][model5_pretrain.py][INFO] Epoch:[0/1](88200/112548) loss:2.065 lr:0.0000100 epoch_Time:153.0min:
[2023-12-25 18:19:44,362][model5_pretrain.py][INFO] Epoch:[0/1](88300/112548) loss:2.101 lr:0.0000100 epoch_Time:152.0min:
[2023-12-25 18:20:22,082][model5_pretrain.py][INFO] Epoch:[0/1](88400/112548) loss:2.846 lr:0.0000100 epoch_Time:151.0min:
[2023-12-25 18:20:59,798][model5_pretrain.py][INFO] Epoch:[0/1](88500/112548) loss:2.428 lr:0.0000100 epoch_Time:151.0min:
[2023-12-25 18:21:37,509][model5_pretrain.py][INFO] Epoch:[0/1](88600/112548) loss:2.646 lr:0.0000100 epoch_Time:150.0min:
[2023-12-25 18:22:15,225][model5_pretrain.py][INFO] Epoch:[0/1](88700/112548) loss:2.629 lr:0.0000100 epoch_Time:150.0min:
[2023-12-25 18:22:52,929][model5_pretrain.py][INFO] Epoch:[0/1](88800/112548) loss:2.740 lr:0.0000100 epoch_Time:149.0min:
[2023-12-25 18:23:30,643][model5_pretrain.py][INFO] Epoch:[0/1](88900/112548) loss:1.655 lr:0.0000100 epoch_Time:148.0min:
[2023-12-25 18:24:08,359][model5_pretrain.py][INFO] Epoch:[0/1](89000/112548) loss:2.636 lr:0.0000100 epoch_Time:148.0min:
[2023-12-25 18:24:46,080][model5_pretrain.py][INFO] Epoch:[0/1](89100/112548) loss:2.732 lr:0.0000100 epoch_Time:147.0min:
[2023-12-25 18:25:23,803][model5_pretrain.py][INFO] Epoch:[0/1](89200/112548) loss:2.902 lr:0.0000100 epoch_Time:146.0min:
[2023-12-25 18:26:01,522][model5_pretrain.py][INFO] Epoch:[0/1](89300/112548) loss:3.011 lr:0.0000100 epoch_Time:146.0min:
[2023-12-25 18:26:39,242][model5_pretrain.py][INFO] Epoch:[0/1](89400/112548) loss:2.528 lr:0.0000100 epoch_Time:145.0min:
[2023-12-25 18:27:16,955][model5_pretrain.py][INFO] Epoch:[0/1](89500/112548) loss:3.181 lr:0.0000100 epoch_Time:145.0min:
[2023-12-25 18:27:54,679][model5_pretrain.py][INFO] Epoch:[0/1](89600/112548) loss:2.272 lr:0.0000100 epoch_Time:144.0min:
[2023-12-25 18:28:32,385][model5_pretrain.py][INFO] Epoch:[0/1](89700/112548) loss:2.132 lr:0.0000100 epoch_Time:143.0min:
[2023-12-25 18:29:10,093][model5_pretrain.py][INFO] Epoch:[0/1](89800/112548) loss:2.847 lr:0.0000100 epoch_Time:143.0min:
[2023-12-25 18:29:47,807][model5_pretrain.py][INFO] Epoch:[0/1](89900/112548) loss:2.177 lr:0.0000100 epoch_Time:142.0min:
[2023-12-25 18:30:25,528][model5_pretrain.py][INFO] Epoch:[0/1](90000/112548) loss:2.819 lr:0.0000100 epoch_Time:141.0min:
[2023-12-25 18:31:03,239][model5_pretrain.py][INFO] Epoch:[0/1](90100/112548) loss:2.453 lr:0.0000100 epoch_Time:141.0min:
[2023-12-25 18:31:40,948][model5_pretrain.py][INFO] Epoch:[0/1](90200/112548) loss:2.609 lr:0.0000100 epoch_Time:140.0min:
[2023-12-25 18:32:18,672][model5_pretrain.py][INFO] Epoch:[0/1](90300/112548) loss:2.464 lr:0.0000100 epoch_Time:139.0min:
[2023-12-25 18:32:56,408][model5_pretrain.py][INFO] Epoch:[0/1](90400/112548) loss:2.728 lr:0.0000100 epoch_Time:139.0min:
[2023-12-25 18:33:34,117][model5_pretrain.py][INFO] Epoch:[0/1](90500/112548) loss:2.762 lr:0.0000100 epoch_Time:138.0min:
[2023-12-25 18:34:11,868][model5_pretrain.py][INFO] Epoch:[0/1](90600/112548) loss:2.484 lr:0.0000100 epoch_Time:138.0min:
[2023-12-25 18:34:49,543][model5_pretrain.py][INFO] Epoch:[0/1](90700/112548) loss:2.597 lr:0.0000100 epoch_Time:137.0min:
[2023-12-25 18:35:27,254][model5_pretrain.py][INFO] Epoch:[0/1](90800/112548) loss:2.347 lr:0.0000100 epoch_Time:136.0min:
[2023-12-25 18:36:04,965][model5_pretrain.py][INFO] Epoch:[0/1](90900/112548) loss:2.628 lr:0.0000100 epoch_Time:136.0min:
[2023-12-25 18:36:42,673][model5_pretrain.py][INFO] Epoch:[0/1](91000/112548) loss:2.438 lr:0.0000100 epoch_Time:135.0min:
[2023-12-25 18:37:20,395][model5_pretrain.py][INFO] Epoch:[0/1](91100/112548) loss:2.495 lr:0.0000100 epoch_Time:134.0min:
[2023-12-25 18:37:58,104][model5_pretrain.py][INFO] Epoch:[0/1](91200/112548) loss:2.761 lr:0.0000100 epoch_Time:134.0min:
[2023-12-25 18:38:35,819][model5_pretrain.py][INFO] Epoch:[0/1](91300/112548) loss:2.318 lr:0.0000100 epoch_Time:133.0min:
[2023-12-25 18:39:13,528][model5_pretrain.py][INFO] Epoch:[0/1](91400/112548) loss:1.812 lr:0.0000100 epoch_Time:133.0min:
[2023-12-25 18:39:51,241][model5_pretrain.py][INFO] Epoch:[0/1](91500/112548) loss:2.919 lr:0.0000100 epoch_Time:132.0min:
[2023-12-25 18:40:28,946][model5_pretrain.py][INFO] Epoch:[0/1](91600/112548) loss:2.426 lr:0.0000100 epoch_Time:131.0min:
[2023-12-25 18:41:06,660][model5_pretrain.py][INFO] Epoch:[0/1](91700/112548) loss:2.901 lr:0.0000100 epoch_Time:131.0min:
[2023-12-25 18:41:44,361][model5_pretrain.py][INFO] Epoch:[0/1](91800/112548) loss:2.171 lr:0.0000100 epoch_Time:130.0min:
[2023-12-25 18:42:22,079][model5_pretrain.py][INFO] Epoch:[0/1](91900/112548) loss:2.685 lr:0.0000100 epoch_Time:129.0min:
[2023-12-25 18:42:59,833][model5_pretrain.py][INFO] Epoch:[0/1](92000/112548) loss:2.157 lr:0.0000100 epoch_Time:129.0min:
[2023-12-25 18:43:37,554][model5_pretrain.py][INFO] Epoch:[0/1](92100/112548) loss:2.608 lr:0.0000100 epoch_Time:128.0min:
[2023-12-25 18:44:15,268][model5_pretrain.py][INFO] Epoch:[0/1](92200/112548) loss:2.474 lr:0.0000100 epoch_Time:128.0min:
[2023-12-25 18:44:52,990][model5_pretrain.py][INFO] Epoch:[0/1](92300/112548) loss:2.744 lr:0.0000100 epoch_Time:127.0min:
[2023-12-25 18:45:30,712][model5_pretrain.py][INFO] Epoch:[0/1](92400/112548) loss:2.621 lr:0.0000100 epoch_Time:126.0min:
[2023-12-25 18:46:08,428][model5_pretrain.py][INFO] Epoch:[0/1](92500/112548) loss:2.277 lr:0.0000100 epoch_Time:126.0min:
[2023-12-25 18:46:46,163][model5_pretrain.py][INFO] Epoch:[0/1](92600/112548) loss:2.422 lr:0.0000100 epoch_Time:125.0min:
[2023-12-25 18:47:23,882][model5_pretrain.py][INFO] Epoch:[0/1](92700/112548) loss:2.168 lr:0.0000100 epoch_Time:124.0min:
[2023-12-25 18:48:01,595][model5_pretrain.py][INFO] Epoch:[0/1](92800/112548) loss:2.820 lr:0.0000100 epoch_Time:124.0min:
[2023-12-25 18:48:39,321][model5_pretrain.py][INFO] Epoch:[0/1](92900/112548) loss:2.379 lr:0.0000100 epoch_Time:123.0min:
[2023-12-25 18:49:17,056][model5_pretrain.py][INFO] Epoch:[0/1](93000/112548) loss:3.245 lr:0.0000100 epoch_Time:123.0min:
[2023-12-25 18:49:54,777][model5_pretrain.py][INFO] Epoch:[0/1](93100/112548) loss:2.568 lr:0.0000100 epoch_Time:122.0min:
[2023-12-25 18:50:32,456][model5_pretrain.py][INFO] Epoch:[0/1](93200/112548) loss:2.636 lr:0.0000100 epoch_Time:121.0min:
[2023-12-25 18:51:10,176][model5_pretrain.py][INFO] Epoch:[0/1](93300/112548) loss:3.349 lr:0.0000100 epoch_Time:121.0min:
[2023-12-25 18:51:47,879][model5_pretrain.py][INFO] Epoch:[0/1](93400/112548) loss:2.404 lr:0.0000100 epoch_Time:120.0min:
[2023-12-25 18:52:25,593][model5_pretrain.py][INFO] Epoch:[0/1](93500/112548) loss:2.434 lr:0.0000100 epoch_Time:119.0min:
[2023-12-25 18:53:03,301][model5_pretrain.py][INFO] Epoch:[0/1](93600/112548) loss:2.387 lr:0.0000100 epoch_Time:119.0min:
[2023-12-25 18:53:41,017][model5_pretrain.py][INFO] Epoch:[0/1](93700/112548) loss:2.599 lr:0.0000100 epoch_Time:118.0min:
[2023-12-25 18:54:18,728][model5_pretrain.py][INFO] Epoch:[0/1](93800/112548) loss:2.922 lr:0.0000100 epoch_Time:117.0min:
[2023-12-25 18:54:56,429][model5_pretrain.py][INFO] Epoch:[0/1](93900/112548) loss:2.327 lr:0.0000100 epoch_Time:117.0min:
[2023-12-25 18:55:34,122][model5_pretrain.py][INFO] Epoch:[0/1](94000/112548) loss:2.134 lr:0.0000100 epoch_Time:116.0min:
[2023-12-25 18:56:11,832][model5_pretrain.py][INFO] Epoch:[0/1](94100/112548) loss:2.612 lr:0.0000100 epoch_Time:116.0min:
[2023-12-25 18:56:49,541][model5_pretrain.py][INFO] Epoch:[0/1](94200/112548) loss:2.724 lr:0.0000100 epoch_Time:115.0min:
[2023-12-25 18:57:27,242][model5_pretrain.py][INFO] Epoch:[0/1](94300/112548) loss:2.018 lr:0.0000100 epoch_Time:114.0min:
[2023-12-25 18:58:04,957][model5_pretrain.py][INFO] Epoch:[0/1](94400/112548) loss:2.727 lr:0.0000100 epoch_Time:114.0min:
[2023-12-25 18:58:42,659][model5_pretrain.py][INFO] Epoch:[0/1](94500/112548) loss:3.076 lr:0.0000100 epoch_Time:113.0min:
[2023-12-25 18:59:20,366][model5_pretrain.py][INFO] Epoch:[0/1](94600/112548) loss:2.576 lr:0.0000100 epoch_Time:112.0min:
[2023-12-25 18:59:58,065][model5_pretrain.py][INFO] Epoch:[0/1](94700/112548) loss:2.413 lr:0.0000100 epoch_Time:112.0min:
[2023-12-25 19:00:35,777][model5_pretrain.py][INFO] Epoch:[0/1](94800/112548) loss:2.816 lr:0.0000100 epoch_Time:111.0min:
[2023-12-25 19:01:13,483][model5_pretrain.py][INFO] Epoch:[0/1](94900/112548) loss:2.534 lr:0.0000100 epoch_Time:111.0min:
[2023-12-25 19:01:51,193][model5_pretrain.py][INFO] Epoch:[0/1](95000/112548) loss:2.639 lr:0.0000100 epoch_Time:110.0min:
[2023-12-25 19:02:28,906][model5_pretrain.py][INFO] Epoch:[0/1](95100/112548) loss:3.025 lr:0.0000100 epoch_Time:109.0min:
[2023-12-25 19:03:06,591][model5_pretrain.py][INFO] Epoch:[0/1](95200/112548) loss:2.549 lr:0.0000100 epoch_Time:109.0min:
[2023-12-25 19:03:44,302][model5_pretrain.py][INFO] Epoch:[0/1](95300/112548) loss:2.578 lr:0.0000100 epoch_Time:108.0min:
[2023-12-25 19:04:22,012][model5_pretrain.py][INFO] Epoch:[0/1](95400/112548) loss:2.923 lr:0.0000100 epoch_Time:107.0min:
[2023-12-25 19:04:59,737][model5_pretrain.py][INFO] Epoch:[0/1](95500/112548) loss:3.032 lr:0.0000100 epoch_Time:107.0min:
[2023-12-25 19:05:37,448][model5_pretrain.py][INFO] Epoch:[0/1](95600/112548) loss:2.590 lr:0.0000100 epoch_Time:106.0min:
[2023-12-25 19:06:15,157][model5_pretrain.py][INFO] Epoch:[0/1](95700/112548) loss:2.625 lr:0.0000100 epoch_Time:106.0min:
[2023-12-25 19:06:52,868][model5_pretrain.py][INFO] Epoch:[0/1](95800/112548) loss:2.848 lr:0.0000100 epoch_Time:105.0min:
[2023-12-25 19:07:30,592][model5_pretrain.py][INFO] Epoch:[0/1](95900/112548) loss:2.675 lr:0.0000100 epoch_Time:104.0min:
[2023-12-25 19:08:08,305][model5_pretrain.py][INFO] Epoch:[0/1](96000/112548) loss:3.057 lr:0.0000100 epoch_Time:104.0min:
[2023-12-25 19:08:46,010][model5_pretrain.py][INFO] Epoch:[0/1](96100/112548) loss:2.941 lr:0.0000100 epoch_Time:103.0min:
[2023-12-25 19:09:23,721][model5_pretrain.py][INFO] Epoch:[0/1](96200/112548) loss:2.513 lr:0.0000100 epoch_Time:102.0min:
[2023-12-25 19:10:01,451][model5_pretrain.py][INFO] Epoch:[0/1](96300/112548) loss:2.999 lr:0.0000100 epoch_Time:102.0min:
[2023-12-25 19:10:39,165][model5_pretrain.py][INFO] Epoch:[0/1](96400/112548) loss:2.643 lr:0.0000100 epoch_Time:101.0min:
[2023-12-25 19:11:16,875][model5_pretrain.py][INFO] Epoch:[0/1](96500/112548) loss:3.063 lr:0.0000100 epoch_Time:101.0min:
[2023-12-25 19:11:54,578][model5_pretrain.py][INFO] Epoch:[0/1](96600/112548) loss:2.539 lr:0.0000100 epoch_Time:100.0min:
[2023-12-25 19:12:32,290][model5_pretrain.py][INFO] Epoch:[0/1](96700/112548) loss:2.714 lr:0.0000100 epoch_Time:99.0min:
[2023-12-25 19:13:09,998][model5_pretrain.py][INFO] Epoch:[0/1](96800/112548) loss:3.096 lr:0.0000100 epoch_Time:99.0min:
[2023-12-25 19:13:47,696][model5_pretrain.py][INFO] Epoch:[0/1](96900/112548) loss:2.160 lr:0.0000100 epoch_Time:98.0min:
[2023-12-25 19:14:25,415][model5_pretrain.py][INFO] Epoch:[0/1](97000/112548) loss:2.668 lr:0.0000100 epoch_Time:97.0min:
[2023-12-25 19:15:03,120][model5_pretrain.py][INFO] Epoch:[0/1](97100/112548) loss:2.920 lr:0.0000100 epoch_Time:97.0min:
[2023-12-25 19:15:40,830][model5_pretrain.py][INFO] Epoch:[0/1](97200/112548) loss:3.030 lr:0.0000100 epoch_Time:96.0min:
[2023-12-25 19:16:18,544][model5_pretrain.py][INFO] Epoch:[0/1](97300/112548) loss:2.416 lr:0.0000100 epoch_Time:95.0min:
[2023-12-25 19:16:56,257][model5_pretrain.py][INFO] Epoch:[0/1](97400/112548) loss:3.095 lr:0.0000100 epoch_Time:95.0min:
[2023-12-25 19:17:33,936][model5_pretrain.py][INFO] Epoch:[0/1](97500/112548) loss:2.990 lr:0.0000100 epoch_Time:94.0min:
[2023-12-25 19:18:11,637][model5_pretrain.py][INFO] Epoch:[0/1](97600/112548) loss:2.449 lr:0.0000100 epoch_Time:94.0min:
[2023-12-25 19:18:49,343][model5_pretrain.py][INFO] Epoch:[0/1](97700/112548) loss:3.180 lr:0.0000100 epoch_Time:93.0min:
[2023-12-25 19:19:27,043][model5_pretrain.py][INFO] Epoch:[0/1](97800/112548) loss:2.433 lr:0.0000100 epoch_Time:92.0min:
[2023-12-25 19:20:04,759][model5_pretrain.py][INFO] Epoch:[0/1](97900/112548) loss:2.446 lr:0.0000100 epoch_Time:92.0min:
[2023-12-25 19:20:42,464][model5_pretrain.py][INFO] Epoch:[0/1](98000/112548) loss:3.001 lr:0.0000100 epoch_Time:91.0min:
[2023-12-25 19:21:20,166][model5_pretrain.py][INFO] Epoch:[0/1](98100/112548) loss:2.308 lr:0.0000100 epoch_Time:90.0min:
[2023-12-25 19:21:57,869][model5_pretrain.py][INFO] Epoch:[0/1](98200/112548) loss:2.553 lr:0.0000100 epoch_Time:90.0min:
[2023-12-25 19:22:35,577][model5_pretrain.py][INFO] Epoch:[0/1](98300/112548) loss:3.157 lr:0.0000100 epoch_Time:89.0min:
[2023-12-25 19:23:13,280][model5_pretrain.py][INFO] Epoch:[0/1](98400/112548) loss:2.828 lr:0.0000100 epoch_Time:89.0min:
[2023-12-25 19:23:50,954][model5_pretrain.py][INFO] Epoch:[0/1](98500/112548) loss:2.506 lr:0.0000100 epoch_Time:88.0min:
[2023-12-25 19:24:28,659][model5_pretrain.py][INFO] Epoch:[0/1](98600/112548) loss:2.832 lr:0.0000100 epoch_Time:87.0min:
[2023-12-25 19:25:06,362][model5_pretrain.py][INFO] Epoch:[0/1](98700/112548) loss:2.641 lr:0.0000100 epoch_Time:87.0min:
[2023-12-25 19:25:44,062][model5_pretrain.py][INFO] Epoch:[0/1](98800/112548) loss:2.508 lr:0.0000100 epoch_Time:86.0min:
[2023-12-25 19:26:21,769][model5_pretrain.py][INFO] Epoch:[0/1](98900/112548) loss:2.922 lr:0.0000100 epoch_Time:85.0min:
[2023-12-25 19:26:59,456][model5_pretrain.py][INFO] Epoch:[0/1](99000/112548) loss:2.739 lr:0.0000100 epoch_Time:85.0min:
[2023-12-25 19:27:37,158][model5_pretrain.py][INFO] Epoch:[0/1](99100/112548) loss:2.846 lr:0.0000100 epoch_Time:84.0min:
[2023-12-25 19:28:14,855][model5_pretrain.py][INFO] Epoch:[0/1](99200/112548) loss:3.071 lr:0.0000100 epoch_Time:84.0min:
[2023-12-25 19:28:52,540][model5_pretrain.py][INFO] Epoch:[0/1](99300/112548) loss:2.660 lr:0.0000100 epoch_Time:83.0min:
[2023-12-25 19:29:30,237][model5_pretrain.py][INFO] Epoch:[0/1](99400/112548) loss:2.631 lr:0.0000100 epoch_Time:82.0min:
[2023-12-25 19:30:07,946][model5_pretrain.py][INFO] Epoch:[0/1](99500/112548) loss:2.456 lr:0.0000100 epoch_Time:82.0min:
[2023-12-25 19:30:45,642][model5_pretrain.py][INFO] Epoch:[0/1](99600/112548) loss:3.165 lr:0.0000100 epoch_Time:81.0min:
[2023-12-25 19:31:23,339][model5_pretrain.py][INFO] Epoch:[0/1](99700/112548) loss:2.384 lr:0.0000100 epoch_Time:80.0min:
[2023-12-25 19:32:01,038][model5_pretrain.py][INFO] Epoch:[0/1](99800/112548) loss:3.062 lr:0.0000100 epoch_Time:80.0min:
[2023-12-25 19:32:38,742][model5_pretrain.py][INFO] Epoch:[0/1](99900/112548) loss:2.474 lr:0.0000100 epoch_Time:79.0min:
[2023-12-25 19:33:16,450][model5_pretrain.py][INFO] Epoch:[0/1](100000/112548) loss:3.114 lr:0.0000100 epoch_Time:79.0min:
[2023-12-25 19:33:59,749][model5_pretrain.py][INFO] Epoch:[0/1](100100/112548) loss:3.096 lr:0.0000100 epoch_Time:79.0min:
[2023-12-25 19:34:37,462][model5_pretrain.py][INFO] Epoch:[0/1](100200/112548) loss:2.194 lr:0.0000100 epoch_Time:78.0min:
[2023-12-25 19:35:15,184][model5_pretrain.py][INFO] Epoch:[0/1](100300/112548) loss:2.436 lr:0.0000100 epoch_Time:78.0min:
[2023-12-25 19:35:52,908][model5_pretrain.py][INFO] Epoch:[0/1](100400/112548) loss:2.462 lr:0.0000100 epoch_Time:77.0min:
[2023-12-25 19:36:30,634][model5_pretrain.py][INFO] Epoch:[0/1](100500/112548) loss:2.388 lr:0.0000100 epoch_Time:76.0min:
[2023-12-25 19:37:08,358][model5_pretrain.py][INFO] Epoch:[0/1](100600/112548) loss:2.533 lr:0.0000100 epoch_Time:76.0min:
[2023-12-25 19:37:46,086][model5_pretrain.py][INFO] Epoch:[0/1](100700/112548) loss:2.696 lr:0.0000100 epoch_Time:75.0min:
[2023-12-25 19:38:23,817][model5_pretrain.py][INFO] Epoch:[0/1](100800/112548) loss:3.879 lr:0.0000100 epoch_Time:74.0min:
[2023-12-25 19:39:01,547][model5_pretrain.py][INFO] Epoch:[0/1](100900/112548) loss:2.135 lr:0.0000100 epoch_Time:74.0min:
[2023-12-25 19:39:39,263][model5_pretrain.py][INFO] Epoch:[0/1](101000/112548) loss:2.525 lr:0.0000100 epoch_Time:73.0min:
[2023-12-25 19:40:16,992][model5_pretrain.py][INFO] Epoch:[0/1](101100/112548) loss:2.387 lr:0.0000100 epoch_Time:73.0min:
[2023-12-25 19:40:54,717][model5_pretrain.py][INFO] Epoch:[0/1](101200/112548) loss:2.355 lr:0.0000100 epoch_Time:72.0min:
[2023-12-25 19:41:32,440][model5_pretrain.py][INFO] Epoch:[0/1](101300/112548) loss:2.871 lr:0.0000100 epoch_Time:71.0min:
[2023-12-25 19:42:10,171][model5_pretrain.py][INFO] Epoch:[0/1](101400/112548) loss:3.112 lr:0.0000100 epoch_Time:71.0min:
[2023-12-25 19:42:47,894][model5_pretrain.py][INFO] Epoch:[0/1](101500/112548) loss:2.360 lr:0.0000100 epoch_Time:70.0min:
[2023-12-25 19:43:25,615][model5_pretrain.py][INFO] Epoch:[0/1](101600/112548) loss:2.313 lr:0.0000100 epoch_Time:69.0min:
[2023-12-25 19:44:03,335][model5_pretrain.py][INFO] Epoch:[0/1](101700/112548) loss:2.186 lr:0.0000100 epoch_Time:69.0min:
[2023-12-25 19:44:41,050][model5_pretrain.py][INFO] Epoch:[0/1](101800/112548) loss:2.620 lr:0.0000100 epoch_Time:68.0min:
[2023-12-25 19:45:18,780][model5_pretrain.py][INFO] Epoch:[0/1](101900/112548) loss:2.728 lr:0.0000100 epoch_Time:67.0min:
[2023-12-25 19:45:56,504][model5_pretrain.py][INFO] Epoch:[0/1](102000/112548) loss:2.979 lr:0.0000100 epoch_Time:67.0min:
[2023-12-25 19:46:34,233][model5_pretrain.py][INFO] Epoch:[0/1](102100/112548) loss:2.120 lr:0.0000100 epoch_Time:66.0min:
[2023-12-25 19:47:11,950][model5_pretrain.py][INFO] Epoch:[0/1](102200/112548) loss:2.719 lr:0.0000100 epoch_Time:65.0min:
[2023-12-25 19:47:49,666][model5_pretrain.py][INFO] Epoch:[0/1](102300/112548) loss:2.349 lr:0.0000100 epoch_Time:64.0min:
[2023-12-25 19:48:27,378][model5_pretrain.py][INFO] Epoch:[0/1](102400/112548) loss:3.185 lr:0.0000100 epoch_Time:63.0min:
[2023-12-25 19:49:05,111][model5_pretrain.py][INFO] Epoch:[0/1](102500/112548) loss:2.485 lr:0.0000100 epoch_Time:63.0min:
[2023-12-25 19:49:42,835][model5_pretrain.py][INFO] Epoch:[0/1](102600/112548) loss:2.754 lr:0.0000100 epoch_Time:62.0min:
[2023-12-25 19:50:20,568][model5_pretrain.py][INFO] Epoch:[0/1](102700/112548) loss:2.318 lr:0.0000100 epoch_Time:61.0min:
[2023-12-25 19:50:58,301][model5_pretrain.py][INFO] Epoch:[0/1](102800/112548) loss:2.573 lr:0.0000100 epoch_Time:61.0min:
[2023-12-25 19:51:36,035][model5_pretrain.py][INFO] Epoch:[0/1](102900/112548) loss:2.699 lr:0.0000100 epoch_Time:60.0min:
[2023-12-25 19:52:13,769][model5_pretrain.py][INFO] Epoch:[0/1](103000/112548) loss:2.632 lr:0.0000100 epoch_Time:60.0min:
[2023-12-25 19:52:51,492][model5_pretrain.py][INFO] Epoch:[0/1](103100/112548) loss:2.837 lr:0.0000100 epoch_Time:59.0min:
[2023-12-25 19:53:29,213][model5_pretrain.py][INFO] Epoch:[0/1](103200/112548) loss:2.390 lr:0.0000100 epoch_Time:58.0min:
[2023-12-25 19:54:06,939][model5_pretrain.py][INFO] Epoch:[0/1](103300/112548) loss:2.519 lr:0.0000100 epoch_Time:58.0min:
[2023-12-25 19:54:44,654][model5_pretrain.py][INFO] Epoch:[0/1](103400/112548) loss:2.889 lr:0.0000100 epoch_Time:57.0min:
[2023-12-25 19:55:22,372][model5_pretrain.py][INFO] Epoch:[0/1](103500/112548) loss:2.586 lr:0.0000100 epoch_Time:56.0min:
[2023-12-25 19:56:00,092][model5_pretrain.py][INFO] Epoch:[0/1](103600/112548) loss:2.464 lr:0.0000100 epoch_Time:56.0min:
[2023-12-25 19:56:37,810][model5_pretrain.py][INFO] Epoch:[0/1](103700/112548) loss:2.329 lr:0.0000100 epoch_Time:55.0min:
[2023-12-25 19:57:15,537][model5_pretrain.py][INFO] Epoch:[0/1](103800/112548) loss:2.700 lr:0.0000100 epoch_Time:55.0min:
[2023-12-25 19:57:53,267][model5_pretrain.py][INFO] Epoch:[0/1](103900/112548) loss:2.713 lr:0.0000100 epoch_Time:54.0min:
[2023-12-25 19:58:31,046][model5_pretrain.py][INFO] Epoch:[0/1](104000/112548) loss:2.238 lr:0.0000100 epoch_Time:53.0min:
[2023-12-25 19:59:08,842][model5_pretrain.py][INFO] Epoch:[0/1](104100/112548) loss:2.752 lr:0.0000100 epoch_Time:53.0min:
[2023-12-25 19:59:46,575][model5_pretrain.py][INFO] Epoch:[0/1](104200/112548) loss:2.676 lr:0.0000100 epoch_Time:52.0min:
[2023-12-25 20:00:24,305][model5_pretrain.py][INFO] Epoch:[0/1](104300/112548) loss:3.073 lr:0.0000100 epoch_Time:51.0min:
[2023-12-25 20:01:02,035][model5_pretrain.py][INFO] Epoch:[0/1](104400/112548) loss:2.831 lr:0.0000100 epoch_Time:51.0min:
[2023-12-25 20:01:39,761][model5_pretrain.py][INFO] Epoch:[0/1](104500/112548) loss:2.594 lr:0.0000100 epoch_Time:50.0min:
[2023-12-25 20:02:17,505][model5_pretrain.py][INFO] Epoch:[0/1](104600/112548) loss:3.038 lr:0.0000100 epoch_Time:49.0min:
[2023-12-25 20:02:55,237][model5_pretrain.py][INFO] Epoch:[0/1](104700/112548) loss:2.711 lr:0.0000100 epoch_Time:49.0min:
[2023-12-25 20:03:33,086][model5_pretrain.py][INFO] Epoch:[0/1](104800/112548) loss:2.583 lr:0.0000100 epoch_Time:48.0min:
[2023-12-25 20:04:10,860][model5_pretrain.py][INFO] Epoch:[0/1](104900/112548) loss:3.136 lr:0.0000100 epoch_Time:48.0min:
[2023-12-25 20:04:48,633][model5_pretrain.py][INFO] Epoch:[0/1](105000/112548) loss:2.287 lr:0.0000100 epoch_Time:47.0min:
[2023-12-25 20:05:26,428][model5_pretrain.py][INFO] Epoch:[0/1](105100/112548) loss:2.044 lr:0.0000100 epoch_Time:46.0min:
[2023-12-25 20:06:04,157][model5_pretrain.py][INFO] Epoch:[0/1](105200/112548) loss:2.919 lr:0.0000100 epoch_Time:46.0min:
[2023-12-25 20:06:41,877][model5_pretrain.py][INFO] Epoch:[0/1](105300/112548) loss:2.666 lr:0.0000100 epoch_Time:45.0min:
[2023-12-25 20:07:19,623][model5_pretrain.py][INFO] Epoch:[0/1](105400/112548) loss:2.599 lr:0.0000100 epoch_Time:44.0min:
[2023-12-25 20:07:57,351][model5_pretrain.py][INFO] Epoch:[0/1](105500/112548) loss:2.588 lr:0.0000100 epoch_Time:44.0min:
[2023-12-25 20:08:35,074][model5_pretrain.py][INFO] Epoch:[0/1](105600/112548) loss:2.512 lr:0.0000100 epoch_Time:43.0min:
[2023-12-25 20:09:12,801][model5_pretrain.py][INFO] Epoch:[0/1](105700/112548) loss:1.638 lr:0.0000100 epoch_Time:43.0min:
[2023-12-25 20:09:50,532][model5_pretrain.py][INFO] Epoch:[0/1](105800/112548) loss:3.073 lr:0.0000100 epoch_Time:42.0min:
[2023-12-25 20:10:28,231][model5_pretrain.py][INFO] Epoch:[0/1](105900/112548) loss:2.893 lr:0.0000100 epoch_Time:41.0min:
[2023-12-25 20:11:05,964][model5_pretrain.py][INFO] Epoch:[0/1](106000/112548) loss:3.040 lr:0.0000100 epoch_Time:41.0min:
[2023-12-25 20:11:43,694][model5_pretrain.py][INFO] Epoch:[0/1](106100/112548) loss:2.535 lr:0.0000100 epoch_Time:40.0min:
[2023-12-25 20:12:21,419][model5_pretrain.py][INFO] Epoch:[0/1](106200/112548) loss:2.649 lr:0.0000100 epoch_Time:39.0min:
[2023-12-25 20:12:59,140][model5_pretrain.py][INFO] Epoch:[0/1](106300/112548) loss:3.070 lr:0.0000100 epoch_Time:39.0min:
[2023-12-25 20:13:36,862][model5_pretrain.py][INFO] Epoch:[0/1](106400/112548) loss:2.227 lr:0.0000100 epoch_Time:38.0min:
[2023-12-25 20:14:14,591][model5_pretrain.py][INFO] Epoch:[0/1](106500/112548) loss:2.477 lr:0.0000100 epoch_Time:38.0min:
[2023-12-25 20:14:52,315][model5_pretrain.py][INFO] Epoch:[0/1](106600/112548) loss:3.027 lr:0.0000100 epoch_Time:37.0min:
[2023-12-25 20:15:30,047][model5_pretrain.py][INFO] Epoch:[0/1](106700/112548) loss:3.070 lr:0.0000100 epoch_Time:36.0min:
[2023-12-25 20:16:07,776][model5_pretrain.py][INFO] Epoch:[0/1](106800/112548) loss:2.762 lr:0.0000100 epoch_Time:36.0min:
[2023-12-25 20:16:45,512][model5_pretrain.py][INFO] Epoch:[0/1](106900/112548) loss:2.369 lr:0.0000100 epoch_Time:35.0min:
[2023-12-25 20:17:23,235][model5_pretrain.py][INFO] Epoch:[0/1](107000/112548) loss:2.342 lr:0.0000100 epoch_Time:34.0min:
[2023-12-25 20:18:00,963][model5_pretrain.py][INFO] Epoch:[0/1](107100/112548) loss:3.482 lr:0.0000100 epoch_Time:34.0min:
[2023-12-25 20:18:38,678][model5_pretrain.py][INFO] Epoch:[0/1](107200/112548) loss:2.774 lr:0.0000100 epoch_Time:33.0min:
[2023-12-25 20:19:16,407][model5_pretrain.py][INFO] Epoch:[0/1](107300/112548) loss:2.790 lr:0.0000100 epoch_Time:33.0min:
[2023-12-25 20:19:54,234][model5_pretrain.py][INFO] Epoch:[0/1](107400/112548) loss:2.808 lr:0.0000100 epoch_Time:32.0min:
[2023-12-25 20:20:31,955][model5_pretrain.py][INFO] Epoch:[0/1](107500/112548) loss:2.658 lr:0.0000100 epoch_Time:31.0min:
[2023-12-25 20:21:09,684][model5_pretrain.py][INFO] Epoch:[0/1](107600/112548) loss:2.795 lr:0.0000100 epoch_Time:31.0min:
[2023-12-25 20:21:47,406][model5_pretrain.py][INFO] Epoch:[0/1](107700/112548) loss:2.708 lr:0.0000100 epoch_Time:30.0min:
[2023-12-25 20:22:25,134][model5_pretrain.py][INFO] Epoch:[0/1](107800/112548) loss:2.714 lr:0.0000100 epoch_Time:29.0min:
[2023-12-25 20:23:02,860][model5_pretrain.py][INFO] Epoch:[0/1](107900/112548) loss:2.377 lr:0.0000100 epoch_Time:29.0min:
[2023-12-25 20:23:40,593][model5_pretrain.py][INFO] Epoch:[0/1](108000/112548) loss:2.932 lr:0.0000100 epoch_Time:28.0min:
[2023-12-25 20:24:18,324][model5_pretrain.py][INFO] Epoch:[0/1](108100/112548) loss:1.815 lr:0.0000100 epoch_Time:27.0min:
[2023-12-25 20:24:56,022][model5_pretrain.py][INFO] Epoch:[0/1](108200/112548) loss:2.560 lr:0.0000100 epoch_Time:27.0min:
[2023-12-25 20:25:33,710][model5_pretrain.py][INFO] Epoch:[0/1](108300/112548) loss:2.550 lr:0.0000100 epoch_Time:26.0min:
[2023-12-25 20:26:11,437][model5_pretrain.py][INFO] Epoch:[0/1](108400/112548) loss:2.663 lr:0.0000100 epoch_Time:26.0min:
[2023-12-25 20:26:49,157][model5_pretrain.py][INFO] Epoch:[0/1](108500/112548) loss:3.081 lr:0.0000100 epoch_Time:25.0min:
[2023-12-25 20:27:26,882][model5_pretrain.py][INFO] Epoch:[0/1](108600/112548) loss:2.882 lr:0.0000100 epoch_Time:24.0min:
[2023-12-25 20:28:04,599][model5_pretrain.py][INFO] Epoch:[0/1](108700/112548) loss:2.066 lr:0.0000100 epoch_Time:24.0min:
[2023-12-25 20:28:42,314][model5_pretrain.py][INFO] Epoch:[0/1](108800/112548) loss:2.932 lr:0.0000100 epoch_Time:23.0min:
[2023-12-25 20:29:20,041][model5_pretrain.py][INFO] Epoch:[0/1](108900/112548) loss:2.774 lr:0.0000100 epoch_Time:22.0min:
[2023-12-25 20:29:57,753][model5_pretrain.py][INFO] Epoch:[0/1](109000/112548) loss:2.913 lr:0.0000100 epoch_Time:22.0min:
[2023-12-25 20:30:35,478][model5_pretrain.py][INFO] Epoch:[0/1](109100/112548) loss:2.679 lr:0.0000100 epoch_Time:21.0min:
[2023-12-25 20:31:13,290][model5_pretrain.py][INFO] Epoch:[0/1](109200/112548) loss:2.989 lr:0.0000100 epoch_Time:21.0min:
[2023-12-25 20:31:51,016][model5_pretrain.py][INFO] Epoch:[0/1](109300/112548) loss:3.029 lr:0.0000100 epoch_Time:20.0min:
[2023-12-25 20:32:28,749][model5_pretrain.py][INFO] Epoch:[0/1](109400/112548) loss:2.729 lr:0.0000100 epoch_Time:19.0min:
[2023-12-25 20:33:06,478][model5_pretrain.py][INFO] Epoch:[0/1](109500/112548) loss:2.950 lr:0.0000100 epoch_Time:19.0min:
[2023-12-25 20:33:44,205][model5_pretrain.py][INFO] Epoch:[0/1](109600/112548) loss:2.090 lr:0.0000100 epoch_Time:18.0min:
[2023-12-25 20:34:21,920][model5_pretrain.py][INFO] Epoch:[0/1](109700/112548) loss:2.934 lr:0.0000100 epoch_Time:17.0min:
[2023-12-25 20:34:59,632][model5_pretrain.py][INFO] Epoch:[0/1](109800/112548) loss:2.240 lr:0.0000100 epoch_Time:17.0min:
[2023-12-25 20:35:37,346][model5_pretrain.py][INFO] Epoch:[0/1](109900/112548) loss:2.413 lr:0.0000100 epoch_Time:16.0min:
[2023-12-25 20:36:15,070][model5_pretrain.py][INFO] Epoch:[0/1](110000/112548) loss:2.522 lr:0.0000100 epoch_Time:16.0min:
[2023-12-25 20:36:52,780][model5_pretrain.py][INFO] Epoch:[0/1](110100/112548) loss:2.433 lr:0.0000100 epoch_Time:15.0min:
[2023-12-25 20:37:30,501][model5_pretrain.py][INFO] Epoch:[0/1](110200/112548) loss:2.853 lr:0.0000100 epoch_Time:14.0min:
[2023-12-25 20:38:08,221][model5_pretrain.py][INFO] Epoch:[0/1](110300/112548) loss:2.486 lr:0.0000100 epoch_Time:14.0min:
[2023-12-25 20:38:45,945][model5_pretrain.py][INFO] Epoch:[0/1](110400/112548) loss:2.451 lr:0.0000100 epoch_Time:13.0min:
[2023-12-25 20:39:23,664][model5_pretrain.py][INFO] Epoch:[0/1](110500/112548) loss:3.050 lr:0.0000100 epoch_Time:12.0min:
[2023-12-25 20:40:01,382][model5_pretrain.py][INFO] Epoch:[0/1](110600/112548) loss:3.025 lr:0.0000100 epoch_Time:12.0min:
[2023-12-25 20:40:39,113][model5_pretrain.py][INFO] Epoch:[0/1](110700/112548) loss:2.725 lr:0.0000100 epoch_Time:11.0min:
[2023-12-25 20:41:16,840][model5_pretrain.py][INFO] Epoch:[0/1](110800/112548) loss:2.089 lr:0.0000100 epoch_Time:11.0min:
[2023-12-25 20:41:54,562][model5_pretrain.py][INFO] Epoch:[0/1](110900/112548) loss:2.659 lr:0.0000100 epoch_Time:10.0min:
[2023-12-25 20:42:32,290][model5_pretrain.py][INFO] Epoch:[0/1](111000/112548) loss:2.447 lr:0.0000100 epoch_Time:9.0min:
[2023-12-25 20:43:10,024][model5_pretrain.py][INFO] Epoch:[0/1](111100/112548) loss:2.109 lr:0.0000100 epoch_Time:9.0min:
[2023-12-25 20:43:47,751][model5_pretrain.py][INFO] Epoch:[0/1](111200/112548) loss:2.149 lr:0.0000100 epoch_Time:8.0min:
[2023-12-25 20:44:25,471][model5_pretrain.py][INFO] Epoch:[0/1](111300/112548) loss:2.510 lr:0.0000100 epoch_Time:7.0min:
[2023-12-25 20:45:03,191][model5_pretrain.py][INFO] Epoch:[0/1](111400/112548) loss:2.999 lr:0.0000100 epoch_Time:7.0min:
[2023-12-25 20:45:40,923][model5_pretrain.py][INFO] Epoch:[0/1](111500/112548) loss:2.513 lr:0.0000100 epoch_Time:6.0min:
[2023-12-25 20:46:18,658][model5_pretrain.py][INFO] Epoch:[0/1](111600/112548) loss:2.370 lr:0.0000100 epoch_Time:5.0min:
[2023-12-25 20:46:56,385][model5_pretrain.py][INFO] Epoch:[0/1](111700/112548) loss:2.737 lr:0.0000100 epoch_Time:5.0min:
[2023-12-25 20:47:34,108][model5_pretrain.py][INFO] Epoch:[0/1](111800/112548) loss:2.619 lr:0.0000100 epoch_Time:4.0min:
[2023-12-25 20:48:11,836][model5_pretrain.py][INFO] Epoch:[0/1](111900/112548) loss:3.082 lr:0.0000100 epoch_Time:4.0min:
[2023-12-25 20:48:49,566][model5_pretrain.py][INFO] Epoch:[0/1](112000/112548) loss:2.275 lr:0.0000100 epoch_Time:3.0min:
[2023-12-25 20:49:27,292][model5_pretrain.py][INFO] Epoch:[0/1](112100/112548) loss:2.723 lr:0.0000100 epoch_Time:2.0min:
[2023-12-25 20:50:05,025][model5_pretrain.py][INFO] Epoch:[0/1](112200/112548) loss:2.343 lr:0.0000100 epoch_Time:2.0min:
[2023-12-25 20:50:42,736][model5_pretrain.py][INFO] Epoch:[0/1](112300/112548) loss:2.203 lr:0.0000100 epoch_Time:1.0min:
[2023-12-25 20:51:20,466][model5_pretrain.py][INFO] Epoch:[0/1](112400/112548) loss:2.854 lr:0.0000100 epoch_Time:0.0min:
[2023-12-25 20:51:58,163][model5_pretrain.py][INFO] Epoch:[0/1](112500/112548) loss:3.296 lr:0.0000100 epoch_Time:0.0min:
112100/112548) loss:2.652 lr:0.0000100 epoch_Time:2.0min:
[2023-12-25 20:50:05,025][model5_pretrain.py][INFO] Epoch:[0/1](112200/112548) loss:2.942 lr:0.0000100 epoch_Time:2.0min:
[2023-12-25 20:50:42,735][model5_pretrain.py][INFO] Epoch:[0/1](112300/112548) loss:2.539 lr:0.0000100 epoch_Time:1.0min:
[2023-12-25 20:51:20,465][model5_pretrain.py][INFO] Epoch:[0/1](112400/112548) loss:2.428 lr:0.0000100 epoch_Time:0.0min:
[2023-12-25 20:51:58,163][model5_pretrain.py][INFO] Epoch:[0/1](112500/112548) loss:2.316 lr:0.0000100 epoch_Time:0.0min: