|
Training 1/1 epoch (loss 2.7825): 0%| | 0/1250 [00:08<?, ?it/s]
Training 1/1 epoch (loss 2.7825): 0%| | 1/1250 [00:08<3:02:12, 8.75s/it]
Training 1/1 epoch (loss 2.8381): 0%| | 1/1250 [00:10<3:02:12, 8.75s/it]
Training 1/1 epoch (loss 2.8381): 0%| | 2/1250 [00:10<1:40:15, 4.82s/it]
Training 1/1 epoch (loss 2.7174): 0%| | 2/1250 [00:12<1:40:15, 4.82s/it]
Training 1/1 epoch (loss 2.7174): 0%| | 3/1250 [00:12<1:07:40, 3.26s/it]
Training 1/1 epoch (loss 3.0754): 0%| | 3/1250 [00:13<1:07:40, 3.26s/it]
Training 1/1 epoch (loss 3.0754): 0%| | 4/1250 [00:13<49:48, 2.40s/it]
Training 1/1 epoch (loss 2.8864): 0%| | 4/1250 [00:15<49:48, 2.40s/it]
Training 1/1 epoch (loss 2.8864): 0%| | 5/1250 [00:15<45:42, 2.20s/it]
Training 1/1 epoch (loss 3.0688): 0%| | 5/1250 [00:16<45:42, 2.20s/it]
Training 1/1 epoch (loss 3.0688): 0%| | 6/1250 [00:16<39:33, 1.91s/it]
Training 1/1 epoch (loss 2.8450): 0%| | 6/1250 [00:17<39:33, 1.91s/it]
Training 1/1 epoch (loss 2.8450): 1%| | 7/1250 [00:17<35:06, 1.69s/it]
Training 1/1 epoch (loss 2.7007): 1%| | 7/1250 [00:19<35:06, 1.69s/it]
Training 1/1 epoch (loss 2.7007): 1%| | 8/1250 [00:19<35:49, 1.73s/it]
Training 1/1 epoch (loss 2.8259): 1%| | 8/1250 [00:21<35:49, 1.73s/it]
Training 1/1 epoch (loss 2.8259): 1%| | 9/1250 [00:21<34:16, 1.66s/it]
Training 1/1 epoch (loss 2.6164): 1%| | 9/1250 [00:22<34:16, 1.66s/it]
Training 1/1 epoch (loss 2.6164): 1%| | 10/1250 [00:22<34:54, 1.69s/it]
Training 1/1 epoch (loss 2.8855): 1%| | 10/1250 [00:24<34:54, 1.69s/it]
Training 1/1 epoch (loss 2.8855): 1%| | 11/1250 [00:24<35:24, 1.71s/it]
Training 1/1 epoch (loss 2.8187): 1%| | 11/1250 [00:25<35:24, 1.71s/it]
Training 1/1 epoch (loss 2.8187): 1%| | 12/1250 [00:25<28:11, 1.37s/it]
Training 1/1 epoch (loss 2.7314): 1%| | 12/1250 [00:26<28:11, 1.37s/it]
Training 1/1 epoch (loss 2.7314): 1%| | 13/1250 [00:26<29:29, 1.43s/it]
Training 1/1 epoch (loss 2.5498): 1%| | 13/1250 [00:28<29:29, 1.43s/it]
Training 1/1 epoch (loss 2.5498): 1%| | 14/1250 [00:28<28:50, 1.40s/it]
Training 1/1 epoch (loss 2.8452): 1%| | 14/1250 [00:28<28:50, 1.40s/it]
Training 1/1 epoch (loss 2.8452): 1%| | 15/1250 [00:28<24:25, 1.19s/it]
Training 1/1 epoch (loss 2.7703): 1%| | 15/1250 [00:30<24:25, 1.19s/it]
Training 1/1 epoch (loss 2.7703): 1%|β | 16/1250 [00:30<28:18, 1.38s/it]
Training 1/1 epoch (loss 2.8560): 1%|β | 16/1250 [00:32<28:18, 1.38s/it]
Training 1/1 epoch (loss 2.8560): 1%|β | 17/1250 [00:32<33:30, 1.63s/it]
Training 1/1 epoch (loss 2.9152): 1%|β | 17/1250 [00:33<33:30, 1.63s/it]
Training 1/1 epoch (loss 2.9152): 1%|β | 18/1250 [00:33<26:31, 1.29s/it]
Training 1/1 epoch (loss 2.8806): 1%|β | 18/1250 [00:34<26:31, 1.29s/it]
Training 1/1 epoch (loss 2.8806): 2%|β | 19/1250 [00:34<27:08, 1.32s/it]
Training 1/1 epoch (loss 2.8216): 2%|β | 19/1250 [00:36<27:08, 1.32s/it]
Training 1/1 epoch (loss 2.8216): 2%|β | 20/1250 [00:36<30:16, 1.48s/it]
Training 1/1 epoch (loss 2.7419): 2%|β | 20/1250 [00:37<30:16, 1.48s/it]
Training 1/1 epoch (loss 2.7419): 2%|β | 21/1250 [00:37<25:50, 1.26s/it]
Training 1/1 epoch (loss 3.0364): 2%|β | 21/1250 [00:38<25:50, 1.26s/it]
Training 1/1 epoch (loss 3.0364): 2%|β | 22/1250 [00:38<28:26, 1.39s/it]
Training 1/1 epoch (loss 2.8515): 2%|β | 22/1250 [00:40<28:26, 1.39s/it]
Training 1/1 epoch (loss 2.8515): 2%|β | 23/1250 [00:40<31:26, 1.54s/it]
Training 1/1 epoch (loss 2.4850): 2%|β | 23/1250 [00:41<31:26, 1.54s/it]
Training 1/1 epoch (loss 2.4850): 2%|β | 24/1250 [00:41<25:39, 1.26s/it]
Training 1/1 epoch (loss 2.6985): 2%|β | 24/1250 [00:44<25:39, 1.26s/it]
Training 1/1 epoch (loss 2.6985): 2%|β | 25/1250 [00:44<34:06, 1.67s/it]
Training 1/1 epoch (loss 2.6274): 2%|β | 25/1250 [00:45<34:06, 1.67s/it]
Training 1/1 epoch (loss 2.6274): 2%|β | 26/1250 [00:45<30:46, 1.51s/it]
Training 1/1 epoch (loss 2.9189): 2%|β | 26/1250 [00:47<30:46, 1.51s/it]
Training 1/1 epoch (loss 2.9189): 2%|β | 27/1250 [00:47<33:18, 1.63s/it]
Training 1/1 epoch (loss 2.8557): 2%|β | 27/1250 [00:48<33:18, 1.63s/it]
Training 1/1 epoch (loss 2.8557): 2%|β | 28/1250 [00:48<33:34, 1.65s/it]
Training 1/1 epoch (loss 2.8313): 2%|β | 28/1250 [00:49<33:34, 1.65s/it]
Training 1/1 epoch (loss 2.8313): 2%|β | 29/1250 [00:49<28:21, 1.39s/it]
Training 1/1 epoch (loss 2.8147): 2%|β | 29/1250 [00:51<28:21, 1.39s/it]
Training 1/1 epoch (loss 2.8147): 2%|β | 30/1250 [00:51<33:01, 1.62s/it]
Training 1/1 epoch (loss 3.0220): 2%|β | 30/1250 [00:53<33:01, 1.62s/it]
Training 1/1 epoch (loss 3.0220): 2%|β | 31/1250 [00:53<32:25, 1.60s/it]
Training 1/1 epoch (loss 2.8544): 2%|β | 31/1250 [00:54<32:25, 1.60s/it]
Training 1/1 epoch (loss 2.8544): 3%|β | 32/1250 [00:54<29:28, 1.45s/it]
Training 1/1 epoch (loss 2.8856): 3%|β | 32/1250 [00:56<29:28, 1.45s/it]
Training 1/1 epoch (loss 2.8856): 3%|β | 33/1250 [00:56<30:15, 1.49s/it]
Training 1/1 epoch (loss 2.7000): 3%|β | 33/1250 [00:57<30:15, 1.49s/it]
Training 1/1 epoch (loss 2.7000): 3%|β | 34/1250 [00:57<28:04, 1.39s/it]
Training 1/1 epoch (loss 2.8119): 3%|β | 34/1250 [00:59<28:04, 1.39s/it]
Training 1/1 epoch (loss 2.8119): 3%|β | 35/1250 [00:59<31:11, 1.54s/it]
Training 1/1 epoch (loss 2.8285): 3%|β | 35/1250 [01:00<31:11, 1.54s/it]
Training 1/1 epoch (loss 2.8285): 3%|β | 36/1250 [01:00<33:13, 1.64s/it]
Training 1/1 epoch (loss 2.6431): 3%|β | 36/1250 [01:01<33:13, 1.64s/it]
Training 1/1 epoch (loss 2.6431): 3%|β | 37/1250 [01:01<26:21, 1.30s/it]
Training 1/1 epoch (loss 2.6267): 3%|β | 37/1250 [01:03<26:21, 1.30s/it]
Training 1/1 epoch (loss 2.6267): 3%|β | 38/1250 [01:03<28:55, 1.43s/it]
Training 1/1 epoch (loss 2.9829): 3%|β | 38/1250 [01:04<28:55, 1.43s/it]
Training 1/1 epoch (loss 2.9829): 3%|β | 39/1250 [01:04<29:39, 1.47s/it]
Training 1/1 epoch (loss 2.6993): 3%|β | 39/1250 [01:05<29:39, 1.47s/it]
Training 1/1 epoch (loss 2.6993): 3%|β | 40/1250 [01:05<25:09, 1.25s/it]
Training 1/1 epoch (loss 2.8046): 3%|β | 40/1250 [01:06<25:09, 1.25s/it]
Training 1/1 epoch (loss 2.8046): 3%|β | 41/1250 [01:06<26:33, 1.32s/it]
Training 1/1 epoch (loss 2.6315): 3%|β | 41/1250 [01:08<26:33, 1.32s/it]
Training 1/1 epoch (loss 2.6315): 3%|β | 42/1250 [01:08<26:03, 1.29s/it]
Training 1/1 epoch (loss 2.7297): 3%|β | 42/1250 [01:09<26:03, 1.29s/it]
Training 1/1 epoch (loss 2.7297): 3%|β | 43/1250 [01:09<23:34, 1.17s/it]
Training 1/1 epoch (loss 2.7907): 3%|β | 43/1250 [01:11<23:34, 1.17s/it]
Training 1/1 epoch (loss 2.7907): 4%|β | 44/1250 [01:11<28:25, 1.41s/it]
Training 1/1 epoch (loss 2.7904): 4%|β | 44/1250 [01:12<28:25, 1.41s/it]
Training 1/1 epoch (loss 2.7904): 4%|β | 45/1250 [01:12<26:45, 1.33s/it]
Training 1/1 epoch (loss 2.6014): 4%|β | 45/1250 [01:13<26:45, 1.33s/it]
Training 1/1 epoch (loss 2.6014): 4%|β | 46/1250 [01:13<23:39, 1.18s/it]
Training 1/1 epoch (loss 2.8016): 4%|β | 46/1250 [01:14<23:39, 1.18s/it]
Training 1/1 epoch (loss 2.8016): 4%|β | 47/1250 [01:14<25:54, 1.29s/it]
Training 1/1 epoch (loss 2.8421): 4%|β | 47/1250 [01:15<25:54, 1.29s/it]
Training 1/1 epoch (loss 2.8421): 4%|β | 48/1250 [01:15<25:43, 1.28s/it]
Training 1/1 epoch (loss 2.6627): 4%|β | 48/1250 [01:17<25:43, 1.28s/it]
Training 1/1 epoch (loss 2.6627): 4%|β | 49/1250 [01:17<28:03, 1.40s/it]
Training 1/1 epoch (loss 2.8325): 4%|β | 49/1250 [01:19<28:03, 1.40s/it]
Training 1/1 epoch (loss 2.8325): 4%|β | 50/1250 [01:19<31:27, 1.57s/it]
Training 1/1 epoch (loss 2.7099): 4%|β | 50/1250 [01:20<31:27, 1.57s/it]
Training 1/1 epoch (loss 2.7099): 4%|β | 51/1250 [01:20<26:14, 1.31s/it]
Training 1/1 epoch (loss 2.7463): 4%|β | 51/1250 [01:22<26:14, 1.31s/it]
Training 1/1 epoch (loss 2.7463): 4%|β | 52/1250 [01:22<32:35, 1.63s/it]
Training 1/1 epoch (loss 2.7927): 4%|β | 52/1250 [01:24<32:35, 1.63s/it]
Training 1/1 epoch (loss 2.7927): 4%|β | 53/1250 [01:24<34:11, 1.71s/it]
Training 1/1 epoch (loss 2.6439): 4%|β | 53/1250 [01:24<34:11, 1.71s/it]
Training 1/1 epoch (loss 2.6439): 4%|β | 54/1250 [01:24<26:40, 1.34s/it]
Training 1/1 epoch (loss 2.6755): 4%|β | 54/1250 [01:26<26:40, 1.34s/it]
Training 1/1 epoch (loss 2.6755): 4%|β | 55/1250 [01:26<29:49, 1.50s/it]
Training 1/1 epoch (loss 2.6845): 4%|β | 55/1250 [01:28<29:49, 1.50s/it]
Training 1/1 epoch (loss 2.6845): 4%|β | 56/1250 [01:28<30:07, 1.51s/it]
Training 1/1 epoch (loss 2.6321): 4%|β | 56/1250 [01:29<30:07, 1.51s/it]
Training 1/1 epoch (loss 2.6321): 5%|β | 57/1250 [01:29<25:21, 1.28s/it]
Training 1/1 epoch (loss 2.5127): 5%|β | 57/1250 [01:31<25:21, 1.28s/it]
Training 1/1 epoch (loss 2.5127): 5%|β | 58/1250 [01:31<29:53, 1.50s/it]
Training 1/1 epoch (loss 2.5811): 5%|β | 58/1250 [01:32<29:53, 1.50s/it]
Training 1/1 epoch (loss 2.5811): 5%|β | 59/1250 [01:32<28:19, 1.43s/it]
Training 1/1 epoch (loss 2.7577): 5%|β | 59/1250 [01:33<28:19, 1.43s/it]
Training 1/1 epoch (loss 2.7577): 5%|β | 60/1250 [01:33<24:35, 1.24s/it]
Training 1/1 epoch (loss 2.6248): 5%|β | 60/1250 [01:35<24:35, 1.24s/it]
Training 1/1 epoch (loss 2.6248): 5%|β | 61/1250 [01:35<28:52, 1.46s/it]
Training 1/1 epoch (loss 2.7175): 5%|β | 61/1250 [01:36<28:52, 1.46s/it]
Training 1/1 epoch (loss 2.7175): 5%|β | 62/1250 [01:36<26:39, 1.35s/it]
Training 1/1 epoch (loss 2.8555): 5%|β | 62/1250 [01:37<26:39, 1.35s/it]
Training 1/1 epoch (loss 2.8555): 5%|β | 63/1250 [01:37<28:56, 1.46s/it]
Training 1/1 epoch (loss 2.9321): 5%|β | 63/1250 [01:39<28:56, 1.46s/it]
Training 1/1 epoch (loss 2.9321): 5%|β | 64/1250 [01:39<29:43, 1.50s/it]
Training 1/1 epoch (loss 2.7765): 5%|β | 64/1250 [01:40<29:43, 1.50s/it]
Training 1/1 epoch (loss 2.7765): 5%|β | 65/1250 [01:40<25:17, 1.28s/it]
Training 1/1 epoch (loss 2.6755): 5%|β | 65/1250 [01:41<25:17, 1.28s/it]
Training 1/1 epoch (loss 2.6755): 5%|β | 66/1250 [01:41<24:48, 1.26s/it]
Training 1/1 epoch (loss 2.7054): 5%|β | 66/1250 [01:43<24:48, 1.26s/it]
Training 1/1 epoch (loss 2.7054): 5%|β | 67/1250 [01:43<27:53, 1.41s/it]
Training 1/1 epoch (loss 2.8186): 5%|β | 67/1250 [01:43<27:53, 1.41s/it]
Training 1/1 epoch (loss 2.8186): 5%|β | 68/1250 [01:43<22:54, 1.16s/it]
Training 1/1 epoch (loss 2.6538): 5%|β | 68/1250 [01:46<22:54, 1.16s/it]
Training 1/1 epoch (loss 2.6538): 6%|β | 69/1250 [01:46<29:12, 1.48s/it]
Training 1/1 epoch (loss 2.8031): 6%|β | 69/1250 [01:48<29:12, 1.48s/it]
Training 1/1 epoch (loss 2.8031): 6%|β | 70/1250 [01:48<34:58, 1.78s/it]
Training 1/1 epoch (loss 2.7030): 6%|β | 70/1250 [01:48<34:58, 1.78s/it]
Training 1/1 epoch (loss 2.7030): 6%|β | 71/1250 [01:48<26:52, 1.37s/it]
Training 1/1 epoch (loss 2.6311): 6%|β | 71/1250 [01:50<26:52, 1.37s/it]
Training 1/1 epoch (loss 2.6311): 6%|β | 72/1250 [01:50<29:24, 1.50s/it]
Training 1/1 epoch (loss 2.8330): 6%|β | 72/1250 [01:53<29:24, 1.50s/it]
Training 1/1 epoch (loss 2.8330): 6%|β | 73/1250 [01:53<34:46, 1.77s/it]
Training 1/1 epoch (loss 2.7869): 6%|β | 73/1250 [01:53<34:46, 1.77s/it]
Training 1/1 epoch (loss 2.7869): 6%|β | 74/1250 [01:53<27:15, 1.39s/it]
Training 1/1 epoch (loss 2.6565): 6%|β | 74/1250 [01:55<27:15, 1.39s/it]
Training 1/1 epoch (loss 2.6565): 6%|β | 75/1250 [01:55<28:36, 1.46s/it]
Training 1/1 epoch (loss 2.7496): 6%|β | 75/1250 [01:57<28:36, 1.46s/it]
Training 1/1 epoch (loss 2.7496): 6%|β | 76/1250 [01:57<30:20, 1.55s/it]
Training 1/1 epoch (loss 2.6737): 6%|β | 76/1250 [01:57<30:20, 1.55s/it]
Training 1/1 epoch (loss 2.6737): 6%|β | 77/1250 [01:57<25:48, 1.32s/it]
Training 1/1 epoch (loss 2.8733): 6%|β | 77/1250 [01:59<25:48, 1.32s/it]
Training 1/1 epoch (loss 2.8733): 6%|β | 78/1250 [01:59<26:05, 1.34s/it]
Training 1/1 epoch (loss 2.7228): 6%|β | 78/1250 [02:00<26:05, 1.34s/it]
Training 1/1 epoch (loss 2.7228): 6%|β | 79/1250 [02:00<25:30, 1.31s/it]
Training 1/1 epoch (loss 2.7072): 6%|β | 79/1250 [02:01<25:30, 1.31s/it]
Training 1/1 epoch (loss 2.7072): 6%|β | 80/1250 [02:01<23:31, 1.21s/it]
Training 1/1 epoch (loss 2.8044): 6%|β | 80/1250 [02:03<23:31, 1.21s/it]
Training 1/1 epoch (loss 2.8044): 6%|β | 81/1250 [02:03<28:30, 1.46s/it]
Training 1/1 epoch (loss 2.8267): 6%|β | 81/1250 [02:04<28:30, 1.46s/it]
Training 1/1 epoch (loss 2.8267): 7%|β | 82/1250 [02:04<27:58, 1.44s/it]
Training 1/1 epoch (loss 2.8648): 7%|β | 82/1250 [02:06<27:58, 1.44s/it]
Training 1/1 epoch (loss 2.8648): 7%|β | 83/1250 [02:06<28:44, 1.48s/it]
Training 1/1 epoch (loss 2.7378): 7%|β | 83/1250 [02:08<28:44, 1.48s/it]
Training 1/1 epoch (loss 2.7378): 7%|β | 84/1250 [02:08<29:34, 1.52s/it]
Training 1/1 epoch (loss 2.6156): 7%|β | 84/1250 [02:08<29:34, 1.52s/it]
Training 1/1 epoch (loss 2.6156): 7%|β | 85/1250 [02:08<23:36, 1.22s/it]
Training 1/1 epoch (loss 2.6394): 7%|β | 85/1250 [02:10<23:36, 1.22s/it]
Training 1/1 epoch (loss 2.6394): 7%|β | 86/1250 [02:10<26:24, 1.36s/it]
Training 1/1 epoch (loss 2.8572): 7%|β | 86/1250 [02:12<26:24, 1.36s/it]
Training 1/1 epoch (loss 2.8572): 7%|β | 87/1250 [02:12<29:37, 1.53s/it]
Training 1/1 epoch (loss 2.6215): 7%|β | 87/1250 [02:12<29:37, 1.53s/it]
Training 1/1 epoch (loss 2.6215): 7%|β | 88/1250 [02:12<24:22, 1.26s/it]
Training 1/1 epoch (loss 2.7461): 7%|β | 88/1250 [02:15<24:22, 1.26s/it]
Training 1/1 epoch (loss 2.7461): 7%|β | 89/1250 [02:15<29:46, 1.54s/it]
Training 1/1 epoch (loss 2.9440): 7%|β | 89/1250 [02:16<29:46, 1.54s/it]
Training 1/1 epoch (loss 2.9440): 7%|β | 90/1250 [02:16<28:16, 1.46s/it]
Training 1/1 epoch (loss 2.5917): 7%|β | 90/1250 [02:17<28:16, 1.46s/it]
Training 1/1 epoch (loss 2.5917): 7%|β | 91/1250 [02:17<27:16, 1.41s/it]
Training 1/1 epoch (loss 2.6900): 7%|β | 91/1250 [02:18<27:16, 1.41s/it]
Training 1/1 epoch (loss 2.6900): 7%|β | 92/1250 [02:18<24:42, 1.28s/it]
Training 1/1 epoch (loss 2.6506): 7%|β | 92/1250 [02:19<24:42, 1.28s/it]
Training 1/1 epoch (loss 2.6506): 7%|β | 93/1250 [02:19<25:03, 1.30s/it]
Training 1/1 epoch (loss 2.6501): 7%|β | 93/1250 [02:21<25:03, 1.30s/it]
Training 1/1 epoch (loss 2.6501): 8%|β | 94/1250 [02:21<23:47, 1.24s/it]
Training 1/1 epoch (loss 2.7155): 8%|β | 94/1250 [02:22<23:47, 1.24s/it]
Training 1/1 epoch (loss 2.7155): 8%|β | 95/1250 [02:22<26:28, 1.37s/it]
Training 1/1 epoch (loss 2.7344): 8%|β | 95/1250 [02:24<26:28, 1.37s/it]
Training 1/1 epoch (loss 2.7344): 8%|β | 96/1250 [02:24<26:04, 1.36s/it]
Training 1/1 epoch (loss 2.6499): 8%|β | 96/1250 [02:24<26:04, 1.36s/it]
Training 1/1 epoch (loss 2.6499): 8%|β | 97/1250 [02:24<22:45, 1.18s/it]
Training 1/1 epoch (loss 2.6636): 8%|β | 97/1250 [02:26<22:45, 1.18s/it]
Training 1/1 epoch (loss 2.6636): 8%|β | 98/1250 [02:26<26:24, 1.38s/it]
Training 1/1 epoch (loss 2.7437): 8%|β | 98/1250 [02:27<26:24, 1.38s/it]
Training 1/1 epoch (loss 2.7437): 8%|β | 99/1250 [02:27<24:16, 1.27s/it]
Training 1/1 epoch (loss 2.9754): 8%|β | 99/1250 [02:30<24:16, 1.27s/it]
Training 1/1 epoch (loss 2.9754): 8%|β | 100/1250 [02:30<30:42, 1.60s/it]
Training 1/1 epoch (loss 2.7140): 8%|β | 100/1250 [02:32<30:42, 1.60s/it]
Training 1/1 epoch (loss 2.7140): 8%|β | 101/1250 [02:32<33:56, 1.77s/it]
Training 1/1 epoch (loss 2.6409): 8%|β | 101/1250 [02:33<33:56, 1.77s/it]
Training 1/1 epoch (loss 2.6409): 8%|β | 102/1250 [02:33<29:50, 1.56s/it]
Training 1/1 epoch (loss 2.7096): 8%|β | 102/1250 [02:35<29:50, 1.56s/it]
Training 1/1 epoch (loss 2.7096): 8%|β | 103/1250 [02:35<31:08, 1.63s/it]
Training 1/1 epoch (loss 2.8445): 8%|β | 103/1250 [02:36<31:08, 1.63s/it]
Training 1/1 epoch (loss 2.8445): 8%|β | 104/1250 [02:36<30:03, 1.57s/it]
Training 1/1 epoch (loss 2.6796): 8%|β | 104/1250 [02:37<30:03, 1.57s/it]
Training 1/1 epoch (loss 2.6796): 8%|β | 105/1250 [02:37<29:26, 1.54s/it]
Training 1/1 epoch (loss 2.6712): 8%|β | 105/1250 [02:39<29:26, 1.54s/it]
Training 1/1 epoch (loss 2.6712): 8%|β | 106/1250 [02:39<31:14, 1.64s/it]
Training 1/1 epoch (loss 2.6820): 8%|β | 106/1250 [02:40<31:14, 1.64s/it]
Training 1/1 epoch (loss 2.6820): 9%|β | 107/1250 [02:40<27:09, 1.43s/it]
Training 1/1 epoch (loss 2.9033): 9%|β | 107/1250 [02:42<27:09, 1.43s/it]
Training 1/1 epoch (loss 2.9033): 9%|β | 108/1250 [02:42<29:42, 1.56s/it]
Training 1/1 epoch (loss 2.8078): 9%|β | 108/1250 [02:44<29:42, 1.56s/it]
Training 1/1 epoch (loss 2.8078): 9%|β | 109/1250 [02:44<30:46, 1.62s/it]
Training 1/1 epoch (loss 2.7253): 9%|β | 109/1250 [02:44<30:46, 1.62s/it]
Training 1/1 epoch (loss 2.7253): 9%|β | 110/1250 [02:44<24:49, 1.31s/it]
Training 1/1 epoch (loss 2.7917): 9%|β | 110/1250 [02:46<24:49, 1.31s/it]
Training 1/1 epoch (loss 2.7917): 9%|β | 111/1250 [02:46<25:07, 1.32s/it]
Training 1/1 epoch (loss 2.6320): 9%|β | 111/1250 [02:47<25:07, 1.32s/it]
Training 1/1 epoch (loss 2.6320): 9%|β | 112/1250 [02:47<25:34, 1.35s/it]
Training 1/1 epoch (loss 2.5869): 9%|β | 112/1250 [02:48<25:34, 1.35s/it]
Training 1/1 epoch (loss 2.5869): 9%|β | 113/1250 [02:48<20:54, 1.10s/it]
Training 1/1 epoch (loss 2.5663): 9%|β | 113/1250 [02:50<20:54, 1.10s/it]
Training 1/1 epoch (loss 2.5663): 9%|β | 114/1250 [02:50<25:35, 1.35s/it]
Training 1/1 epoch (loss 2.6244): 9%|β | 114/1250 [02:51<25:35, 1.35s/it]
Training 1/1 epoch (loss 2.6244): 9%|β | 115/1250 [02:51<26:31, 1.40s/it]
Training 1/1 epoch (loss 2.7133): 9%|β | 115/1250 [02:52<26:31, 1.40s/it]
Training 1/1 epoch (loss 2.7133): 9%|β | 116/1250 [02:52<22:50, 1.21s/it]
Training 1/1 epoch (loss 2.6198): 9%|β | 116/1250 [02:54<22:50, 1.21s/it]
Training 1/1 epoch (loss 2.6198): 9%|β | 117/1250 [02:54<29:43, 1.57s/it]
Training 1/1 epoch (loss 2.6369): 9%|β | 117/1250 [02:55<29:43, 1.57s/it]
Training 1/1 epoch (loss 2.6369): 9%|β | 118/1250 [02:55<25:20, 1.34s/it]
Training 1/1 epoch (loss 2.6812): 9%|β | 118/1250 [02:57<25:20, 1.34s/it]
Training 1/1 epoch (loss 2.6812): 10%|β | 119/1250 [02:57<26:00, 1.38s/it]
Training 1/1 epoch (loss 2.8294): 10%|β | 119/1250 [02:58<26:00, 1.38s/it]
Training 1/1 epoch (loss 2.8294): 10%|β | 120/1250 [02:58<26:28, 1.41s/it]
Training 1/1 epoch (loss 2.7543): 10%|β | 120/1250 [02:59<26:28, 1.41s/it]
Training 1/1 epoch (loss 2.7543): 10%|β | 121/1250 [02:59<21:25, 1.14s/it]
Training 1/1 epoch (loss 2.6154): 10%|β | 121/1250 [03:00<21:25, 1.14s/it]
Training 1/1 epoch (loss 2.6154): 10%|β | 122/1250 [03:00<24:15, 1.29s/it]
Training 1/1 epoch (loss 2.7420): 10%|β | 122/1250 [03:02<24:15, 1.29s/it]
Training 1/1 epoch (loss 2.7420): 10%|β | 123/1250 [03:02<25:36, 1.36s/it]
Training 1/1 epoch (loss 2.5794): 10%|β | 123/1250 [03:03<25:36, 1.36s/it]
Training 1/1 epoch (loss 2.5794): 10%|β | 124/1250 [03:03<22:33, 1.20s/it]
Training 1/1 epoch (loss 2.5107): 10%|β | 124/1250 [03:04<22:33, 1.20s/it]
Training 1/1 epoch (loss 2.5107): 10%|β | 125/1250 [03:04<24:36, 1.31s/it]
Training 1/1 epoch (loss 2.7556): 10%|β | 125/1250 [03:06<24:36, 1.31s/it]
Training 1/1 epoch (loss 2.7556): 10%|β | 126/1250 [03:06<24:31, 1.31s/it]
Training 1/1 epoch (loss 2.5796): 10%|β | 126/1250 [03:07<24:31, 1.31s/it]
Training 1/1 epoch (loss 2.5796): 10%|β | 127/1250 [03:07<24:27, 1.31s/it]
Training 1/1 epoch (loss 2.7560): 10%|β | 127/1250 [03:08<24:27, 1.31s/it]
Training 1/1 epoch (loss 2.7560): 10%|β | 128/1250 [03:08<25:49, 1.38s/it]
Training 1/1 epoch (loss 2.6411): 10%|β | 128/1250 [03:09<25:49, 1.38s/it]
Training 1/1 epoch (loss 2.6411): 10%|β | 129/1250 [03:09<23:33, 1.26s/it]
Training 1/1 epoch (loss 2.6505): 10%|β | 129/1250 [03:11<23:33, 1.26s/it]
Training 1/1 epoch (loss 2.6505): 10%|β | 130/1250 [03:11<26:13, 1.40s/it]
Training 1/1 epoch (loss 2.8241): 10%|β | 130/1250 [03:13<26:13, 1.40s/it]
Training 1/1 epoch (loss 2.8241): 10%|β | 131/1250 [03:13<28:08, 1.51s/it]
Training 1/1 epoch (loss 2.8201): 10%|β | 131/1250 [03:13<28:08, 1.51s/it]
Training 1/1 epoch (loss 2.8201): 11%|β | 132/1250 [03:13<22:31, 1.21s/it]
Training 1/1 epoch (loss 2.7562): 11%|β | 132/1250 [03:16<22:31, 1.21s/it]
Training 1/1 epoch (loss 2.7562): 11%|β | 133/1250 [03:16<28:00, 1.50s/it]
Training 1/1 epoch (loss 2.8455): 11%|β | 133/1250 [03:17<28:00, 1.50s/it]
Training 1/1 epoch (loss 2.8455): 11%|β | 134/1250 [03:17<29:59, 1.61s/it]
Training 1/1 epoch (loss 2.7396): 11%|β | 134/1250 [03:19<29:59, 1.61s/it]
Training 1/1 epoch (loss 2.7396): 11%|β | 135/1250 [03:19<27:29, 1.48s/it]
Training 1/1 epoch (loss 2.7006): 11%|β | 135/1250 [03:21<27:29, 1.48s/it]
Training 1/1 epoch (loss 2.7006): 11%|β | 136/1250 [03:21<34:36, 1.86s/it]
Training 1/1 epoch (loss 2.6153): 11%|β | 136/1250 [03:22<34:36, 1.86s/it]
Training 1/1 epoch (loss 2.6153): 11%|β | 137/1250 [03:22<30:29, 1.64s/it]
Training 1/1 epoch (loss 2.6174): 11%|β | 137/1250 [03:24<30:29, 1.64s/it]
Training 1/1 epoch (loss 2.6174): 11%|β | 138/1250 [03:24<31:21, 1.69s/it]
Training 1/1 epoch (loss 2.8204): 11%|β | 138/1250 [03:26<31:21, 1.69s/it]
Training 1/1 epoch (loss 2.8204): 11%|β | 139/1250 [03:26<31:19, 1.69s/it]
Training 1/1 epoch (loss 2.7859): 11%|β | 139/1250 [03:27<31:19, 1.69s/it]
Training 1/1 epoch (loss 2.7859): 11%|β | 140/1250 [03:27<26:16, 1.42s/it]
Training 1/1 epoch (loss 2.6638): 11%|β | 140/1250 [03:29<26:16, 1.42s/it]
Training 1/1 epoch (loss 2.6638): 11%|ββ | 141/1250 [03:29<29:07, 1.58s/it]
Training 1/1 epoch (loss 2.8033): 11%|ββ | 141/1250 [03:31<29:07, 1.58s/it]
Training 1/1 epoch (loss 2.8033): 11%|ββ | 142/1250 [03:31<31:09, 1.69s/it]
Training 1/1 epoch (loss 2.6877): 11%|ββ | 142/1250 [03:31<31:09, 1.69s/it]
Training 1/1 epoch (loss 2.6877): 11%|ββ | 143/1250 [03:31<25:12, 1.37s/it]
Training 1/1 epoch (loss 2.8617): 11%|ββ | 143/1250 [03:34<25:12, 1.37s/it]
Training 1/1 epoch (loss 2.8617): 12%|ββ | 144/1250 [03:34<31:21, 1.70s/it]
Training 1/1 epoch (loss 2.7457): 12%|ββ | 144/1250 [03:36<31:21, 1.70s/it]
Training 1/1 epoch (loss 2.7457): 12%|ββ | 145/1250 [03:36<32:40, 1.77s/it]
Training 1/1 epoch (loss 2.8137): 12%|ββ | 145/1250 [03:37<32:40, 1.77s/it]
Training 1/1 epoch (loss 2.8137): 12%|ββ | 146/1250 [03:37<29:34, 1.61s/it]
Training 1/1 epoch (loss 2.6377): 12%|ββ | 146/1250 [03:39<29:34, 1.61s/it]
Training 1/1 epoch (loss 2.6377): 12%|ββ | 147/1250 [03:39<34:35, 1.88s/it]
Training 1/1 epoch (loss 2.7898): 12%|ββ | 147/1250 [03:40<34:35, 1.88s/it]
Training 1/1 epoch (loss 2.7898): 12%|ββ | 148/1250 [03:40<29:39, 1.61s/it]
Training 1/1 epoch (loss 2.6782): 12%|ββ | 148/1250 [03:42<29:39, 1.61s/it]
Training 1/1 epoch (loss 2.6782): 12%|ββ | 149/1250 [03:42<29:27, 1.61s/it]
Training 1/1 epoch (loss 2.9538): 12%|ββ | 149/1250 [03:43<29:27, 1.61s/it]
Training 1/1 epoch (loss 2.9538): 12%|ββ | 150/1250 [03:43<28:47, 1.57s/it]
Training 1/1 epoch (loss 2.7011): 12%|ββ | 150/1250 [03:44<28:47, 1.57s/it]
Training 1/1 epoch (loss 2.7011): 12%|ββ | 151/1250 [03:44<24:02, 1.31s/it]
Training 1/1 epoch (loss 2.7690): 12%|ββ | 151/1250 [03:46<24:02, 1.31s/it]
Training 1/1 epoch (loss 2.7690): 12%|ββ | 152/1250 [03:46<28:30, 1.56s/it]
Training 1/1 epoch (loss 2.8976): 12%|ββ | 152/1250 [03:48<28:30, 1.56s/it]
Training 1/1 epoch (loss 2.8976): 12%|ββ | 153/1250 [03:48<30:50, 1.69s/it]
Training 1/1 epoch (loss 2.7954): 12%|ββ | 153/1250 [03:49<30:50, 1.69s/it]
Training 1/1 epoch (loss 2.7954): 12%|ββ | 154/1250 [03:49<23:55, 1.31s/it]
Training 1/1 epoch (loss 2.7561): 12%|ββ | 154/1250 [03:51<23:55, 1.31s/it]
Training 1/1 epoch (loss 2.7561): 12%|ββ | 155/1250 [03:51<29:48, 1.63s/it]
Training 1/1 epoch (loss 2.8465): 12%|ββ | 155/1250 [03:52<29:48, 1.63s/it]
Training 1/1 epoch (loss 2.8465): 12%|ββ | 156/1250 [03:52<27:01, 1.48s/it]
Training 1/1 epoch (loss 2.8486): 12%|ββ | 156/1250 [03:53<27:01, 1.48s/it]
Training 1/1 epoch (loss 2.8486): 13%|ββ | 157/1250 [03:53<22:02, 1.21s/it]
Training 1/1 epoch (loss 2.8475): 13%|ββ | 157/1250 [03:55<22:02, 1.21s/it]
Training 1/1 epoch (loss 2.8475): 13%|ββ | 158/1250 [03:55<28:50, 1.58s/it]
Training 1/1 epoch (loss 2.9731): 13%|ββ | 158/1250 [03:56<28:50, 1.58s/it]
Training 1/1 epoch (loss 2.9731): 13%|ββ | 159/1250 [03:56<26:32, 1.46s/it]
Training 1/1 epoch (loss 2.6849): 13%|ββ | 159/1250 [03:57<26:32, 1.46s/it]
Training 1/1 epoch (loss 2.6849): 13%|ββ | 160/1250 [03:57<21:58, 1.21s/it]
Training 1/1 epoch (loss 2.6897): 13%|ββ | 160/1250 [03:59<21:58, 1.21s/it]
Training 1/1 epoch (loss 2.6897): 13%|ββ | 161/1250 [03:59<25:07, 1.38s/it]
Training 1/1 epoch (loss 2.6093): 13%|ββ | 161/1250 [04:00<25:07, 1.38s/it]
Training 1/1 epoch (loss 2.6093): 13%|ββ | 162/1250 [04:00<24:06, 1.33s/it]
Training 1/1 epoch (loss 2.7855): 13%|ββ | 162/1250 [04:01<24:06, 1.33s/it]
Training 1/1 epoch (loss 2.7855): 13%|ββ | 163/1250 [04:01<20:21, 1.12s/it]
Training 1/1 epoch (loss 2.5838): 13%|ββ | 163/1250 [04:02<20:21, 1.12s/it]
Training 1/1 epoch (loss 2.5838): 13%|ββ | 164/1250 [04:02<23:13, 1.28s/it]
Training 1/1 epoch (loss 2.6879): 13%|ββ | 164/1250 [04:04<23:13, 1.28s/it]
Training 1/1 epoch (loss 2.6879): 13%|ββ | 165/1250 [04:04<23:14, 1.29s/it]
Training 1/1 epoch (loss 2.7240): 13%|ββ | 165/1250 [04:04<23:14, 1.29s/it]
Training 1/1 epoch (loss 2.7240): 13%|ββ | 166/1250 [04:04<19:14, 1.07s/it]
Training 1/1 epoch (loss 2.6637): 13%|ββ | 166/1250 [04:07<19:14, 1.07s/it]
Training 1/1 epoch (loss 2.6637): 13%|ββ | 167/1250 [04:07<25:45, 1.43s/it]
Training 1/1 epoch (loss 2.6975): 13%|ββ | 167/1250 [04:08<25:45, 1.43s/it]
Training 1/1 epoch (loss 2.6975): 13%|ββ | 168/1250 [04:08<27:20, 1.52s/it]
Training 1/1 epoch (loss 2.7213): 13%|ββ | 168/1250 [04:10<27:20, 1.52s/it]
Training 1/1 epoch (loss 2.7213): 14%|ββ | 169/1250 [04:10<26:17, 1.46s/it]
Training 1/1 epoch (loss 2.7753): 14%|ββ | 169/1250 [04:11<26:17, 1.46s/it]
Training 1/1 epoch (loss 2.7753): 14%|ββ | 170/1250 [04:11<26:35, 1.48s/it]
Training 1/1 epoch (loss 2.7366): 14%|ββ | 170/1250 [04:12<26:35, 1.48s/it]
Training 1/1 epoch (loss 2.7366): 14%|ββ | 171/1250 [04:12<24:24, 1.36s/it]
Training 1/1 epoch (loss 2.6314): 14%|ββ | 171/1250 [04:13<24:24, 1.36s/it]
Training 1/1 epoch (loss 2.6314): 14%|ββ | 172/1250 [04:13<22:53, 1.27s/it]
Training 1/1 epoch (loss 2.5646): 14%|ββ | 172/1250 [04:14<22:53, 1.27s/it]
Training 1/1 epoch (loss 2.5646): 14%|ββ | 173/1250 [04:14<21:26, 1.19s/it]
Training 1/1 epoch (loss 2.8368): 14%|ββ | 173/1250 [04:15<21:26, 1.19s/it]
Training 1/1 epoch (loss 2.8368): 14%|ββ | 174/1250 [04:15<21:22, 1.19s/it]
Training 1/1 epoch (loss 2.8711): 14%|ββ | 174/1250 [04:17<21:22, 1.19s/it]
Training 1/1 epoch (loss 2.8711): 14%|ββ | 175/1250 [04:17<25:12, 1.41s/it]
Training 1/1 epoch (loss 2.6904): 14%|ββ | 175/1250 [04:19<25:12, 1.41s/it]
Training 1/1 epoch (loss 2.6904): 14%|ββ | 176/1250 [04:19<27:52, 1.56s/it]
Training 1/1 epoch (loss 2.9227): 14%|ββ | 176/1250 [04:20<27:52, 1.56s/it]
Training 1/1 epoch (loss 2.9227): 14%|ββ | 177/1250 [04:20<24:13, 1.35s/it]
Training 1/1 epoch (loss 2.7351): 14%|ββ | 177/1250 [04:22<24:13, 1.35s/it]
Training 1/1 epoch (loss 2.7351): 14%|ββ | 178/1250 [04:22<24:44, 1.39s/it]
Training 1/1 epoch (loss 2.9244): 14%|ββ | 178/1250 [04:23<24:44, 1.39s/it]
Training 1/1 epoch (loss 2.9244): 14%|ββ | 179/1250 [04:23<24:08, 1.35s/it]
Training 1/1 epoch (loss 2.8791): 14%|ββ | 179/1250 [04:24<24:08, 1.35s/it]
Training 1/1 epoch (loss 2.8791): 14%|ββ | 180/1250 [04:24<21:53, 1.23s/it]
Training 1/1 epoch (loss 2.9022): 14%|ββ | 180/1250 [04:25<21:53, 1.23s/it]
Training 1/1 epoch (loss 2.9022): 14%|ββ | 181/1250 [04:25<23:06, 1.30s/it]
Training 1/1 epoch (loss 2.7654): 14%|ββ | 181/1250 [04:26<23:06, 1.30s/it]
Training 1/1 epoch (loss 2.7654): 15%|ββ | 182/1250 [04:26<20:45, 1.17s/it]
Training 1/1 epoch (loss 2.8061): 15%|ββ | 182/1250 [04:27<20:45, 1.17s/it]
Training 1/1 epoch (loss 2.8061): 15%|ββ | 183/1250 [04:27<20:08, 1.13s/it]
Training 1/1 epoch (loss 2.7868): 15%|ββ | 183/1250 [04:30<20:08, 1.13s/it]
Training 1/1 epoch (loss 2.7868): 15%|ββ | 184/1250 [04:30<27:43, 1.56s/it]
Training 1/1 epoch (loss 2.6616): 15%|ββ | 184/1250 [04:30<27:43, 1.56s/it]
Training 1/1 epoch (loss 2.6616): 15%|ββ | 185/1250 [04:30<23:14, 1.31s/it]
Training 1/1 epoch (loss 2.7505): 15%|ββ | 185/1250 [04:32<23:14, 1.31s/it]
Training 1/1 epoch (loss 2.7505): 15%|ββ | 186/1250 [04:32<23:10, 1.31s/it]
Training 1/1 epoch (loss 2.8355): 15%|ββ | 186/1250 [04:34<23:10, 1.31s/it]
Training 1/1 epoch (loss 2.8355): 15%|ββ | 187/1250 [04:34<28:47, 1.63s/it]
Training 1/1 epoch (loss 2.5569): 15%|ββ | 187/1250 [04:35<28:47, 1.63s/it]
Training 1/1 epoch (loss 2.5569): 15%|ββ | 188/1250 [04:35<22:30, 1.27s/it]
Training 1/1 epoch (loss 2.6328): 15%|ββ | 188/1250 [04:37<22:30, 1.27s/it]
Training 1/1 epoch (loss 2.6328): 15%|ββ | 189/1250 [04:37<27:47, 1.57s/it]
Training 1/1 epoch (loss 2.6099): 15%|ββ | 189/1250 [04:39<27:47, 1.57s/it]
Training 1/1 epoch (loss 2.6099): 15%|ββ | 190/1250 [04:39<29:05, 1.65s/it]
Training 1/1 epoch (loss 2.7061): 15%|ββ | 190/1250 [04:39<29:05, 1.65s/it]
Training 1/1 epoch (loss 2.7061): 15%|ββ | 191/1250 [04:39<23:46, 1.35s/it]
Training 1/1 epoch (loss 2.9433): 15%|ββ | 191/1250 [04:41<23:46, 1.35s/it]
Training 1/1 epoch (loss 2.9433): 15%|ββ | 192/1250 [04:41<27:30, 1.56s/it]
Training 1/1 epoch (loss 2.6951): 15%|ββ | 192/1250 [04:43<27:30, 1.56s/it]
Training 1/1 epoch (loss 2.6951): 15%|ββ | 193/1250 [04:43<26:43, 1.52s/it]
Training 1/1 epoch (loss 2.8614): 15%|ββ | 193/1250 [04:44<26:43, 1.52s/it]
Training 1/1 epoch (loss 2.8614): 16%|ββ | 194/1250 [04:44<23:29, 1.33s/it]
Training 1/1 epoch (loss 2.7291): 16%|ββ | 194/1250 [04:45<23:29, 1.33s/it]
Training 1/1 epoch (loss 2.7291): 16%|ββ | 195/1250 [04:45<22:24, 1.27s/it]
Training 1/1 epoch (loss 2.5920): 16%|ββ | 195/1250 [04:46<22:24, 1.27s/it]
Training 1/1 epoch (loss 2.5920): 16%|ββ | 196/1250 [04:46<19:50, 1.13s/it]
Training 1/1 epoch (loss 2.6773): 16%|ββ | 196/1250 [04:48<19:50, 1.13s/it]
Training 1/1 epoch (loss 2.6773): 16%|ββ | 197/1250 [04:48<24:33, 1.40s/it]
Training 1/1 epoch (loss 2.8307): 16%|ββ | 197/1250 [04:49<24:33, 1.40s/it]
Training 1/1 epoch (loss 2.8307): 16%|ββ | 198/1250 [04:49<26:43, 1.52s/it]
Training 1/1 epoch (loss 2.7678): 16%|ββ | 198/1250 [04:50<26:43, 1.52s/it]
Training 1/1 epoch (loss 2.7678): 16%|ββ | 199/1250 [04:50<21:18, 1.22s/it]
Training 1/1 epoch (loss 2.6885): 16%|ββ | 199/1250 [04:51<21:18, 1.22s/it]
Training 1/1 epoch (loss 2.6885): 16%|ββ | 200/1250 [04:51<21:37, 1.24s/it]
Training 1/1 epoch (loss 2.8067): 16%|ββ | 200/1250 [04:54<21:37, 1.24s/it]
Training 1/1 epoch (loss 2.8067): 16%|ββ | 201/1250 [04:54<27:49, 1.59s/it]
Training 1/1 epoch (loss 2.6999): 16%|ββ | 201/1250 [04:54<27:49, 1.59s/it]
Training 1/1 epoch (loss 2.6999): 16%|ββ | 202/1250 [04:54<23:03, 1.32s/it]
Training 1/1 epoch (loss 2.6414): 16%|ββ | 202/1250 [04:56<23:03, 1.32s/it]
Training 1/1 epoch (loss 2.6414): 16%|ββ | 203/1250 [04:56<27:09, 1.56s/it]
Training 1/1 epoch (loss 2.6257): 16%|ββ | 203/1250 [04:59<27:09, 1.56s/it]
Training 1/1 epoch (loss 2.6257): 16%|ββ | 204/1250 [04:59<31:35, 1.81s/it]
Training 1/1 epoch (loss 2.6314): 16%|ββ | 204/1250 [05:00<31:35, 1.81s/it]
Training 1/1 epoch (loss 2.6314): 16%|ββ | 205/1250 [05:00<26:29, 1.52s/it]
Training 1/1 epoch (loss 2.8827): 16%|ββ | 205/1250 [05:00<26:29, 1.52s/it]
Training 1/1 epoch (loss 2.8827): 16%|ββ | 206/1250 [05:00<22:09, 1.27s/it]
Training 1/1 epoch (loss 2.7362): 16%|ββ | 206/1250 [05:02<22:09, 1.27s/it]
Training 1/1 epoch (loss 2.7362): 17%|ββ | 207/1250 [05:02<24:16, 1.40s/it]
Training 1/1 epoch (loss 2.7467): 17%|ββ | 207/1250 [05:03<24:16, 1.40s/it]
Training 1/1 epoch (loss 2.7467): 17%|ββ | 208/1250 [05:03<21:46, 1.25s/it]
Training 1/1 epoch (loss 2.7294): 17%|ββ | 208/1250 [05:05<21:46, 1.25s/it]
Training 1/1 epoch (loss 2.7294): 17%|ββ | 209/1250 [05:05<27:48, 1.60s/it]
Training 1/1 epoch (loss 2.7301): 17%|ββ | 209/1250 [05:07<27:48, 1.60s/it]
Training 1/1 epoch (loss 2.7301): 17%|ββ | 210/1250 [05:07<25:39, 1.48s/it]
Training 1/1 epoch (loss 2.7560): 17%|ββ | 210/1250 [05:08<25:39, 1.48s/it]
Training 1/1 epoch (loss 2.7560): 17%|ββ | 211/1250 [05:08<25:38, 1.48s/it]
Training 1/1 epoch (loss 2.9067): 17%|ββ | 211/1250 [05:10<25:38, 1.48s/it]
Training 1/1 epoch (loss 2.9067): 17%|ββ | 212/1250 [05:10<25:38, 1.48s/it]
Training 1/1 epoch (loss 2.6924): 17%|ββ | 212/1250 [05:10<25:38, 1.48s/it]
Training 1/1 epoch (loss 2.6924): 17%|ββ | 213/1250 [05:10<22:28, 1.30s/it]
Training 1/1 epoch (loss 2.5839): 17%|ββ | 213/1250 [05:12<22:28, 1.30s/it]
Training 1/1 epoch (loss 2.5839): 17%|ββ | 214/1250 [05:12<24:00, 1.39s/it]
Training 1/1 epoch (loss 2.5754): 17%|ββ | 214/1250 [05:13<24:00, 1.39s/it]
Training 1/1 epoch (loss 2.5754): 17%|ββ | 215/1250 [05:13<23:37, 1.37s/it]
Training 1/1 epoch (loss 2.4751): 17%|ββ | 215/1250 [05:14<23:37, 1.37s/it]
Training 1/1 epoch (loss 2.4751): 17%|ββ | 216/1250 [05:14<21:40, 1.26s/it]
Training 1/1 epoch (loss 2.5835): 17%|ββ | 216/1250 [05:16<21:40, 1.26s/it]
Training 1/1 epoch (loss 2.5835): 17%|ββ | 217/1250 [05:16<23:55, 1.39s/it]
Training 1/1 epoch (loss 2.7904): 17%|ββ | 217/1250 [05:17<23:55, 1.39s/it]
Training 1/1 epoch (loss 2.7904): 17%|ββ | 218/1250 [05:17<22:37, 1.32s/it]
Training 1/1 epoch (loss 2.8251): 17%|ββ | 218/1250 [05:18<22:37, 1.32s/it]
Training 1/1 epoch (loss 2.8251): 18%|ββ | 219/1250 [05:18<20:02, 1.17s/it]
Training 1/1 epoch (loss 2.5427): 18%|ββ | 219/1250 [05:20<20:02, 1.17s/it]
Training 1/1 epoch (loss 2.5427): 18%|ββ | 220/1250 [05:20<26:36, 1.55s/it]
Training 1/1 epoch (loss 2.5790): 18%|ββ | 220/1250 [05:22<26:36, 1.55s/it]
Training 1/1 epoch (loss 2.5790): 18%|ββ | 221/1250 [05:22<25:14, 1.47s/it]
Training 1/1 epoch (loss 2.7313): 18%|ββ | 221/1250 [05:24<25:14, 1.47s/it]
Training 1/1 epoch (loss 2.7313): 18%|ββ | 222/1250 [05:24<29:13, 1.71s/it]
Training 1/1 epoch (loss 2.6235): 18%|ββ | 222/1250 [05:25<29:13, 1.71s/it]
Training 1/1 epoch (loss 2.6235): 18%|ββ | 223/1250 [05:25<27:44, 1.62s/it]
Training 1/1 epoch (loss 2.6535): 18%|ββ | 223/1250 [05:26<27:44, 1.62s/it]
Training 1/1 epoch (loss 2.6535): 18%|ββ | 224/1250 [05:26<23:59, 1.40s/it]
Training 1/1 epoch (loss 2.5313): 18%|ββ | 224/1250 [05:29<23:59, 1.40s/it]
Training 1/1 epoch (loss 2.5313): 18%|ββ | 225/1250 [05:29<29:17, 1.71s/it]
Training 1/1 epoch (loss 2.6844): 18%|ββ | 225/1250 [05:31<29:17, 1.71s/it]
Training 1/1 epoch (loss 2.6844): 18%|ββ | 226/1250 [05:31<31:16, 1.83s/it]
Training 1/1 epoch (loss 2.9097): 18%|ββ | 226/1250 [05:31<31:16, 1.83s/it]
Training 1/1 epoch (loss 2.9097): 18%|ββ | 227/1250 [05:31<24:24, 1.43s/it]
Training 1/1 epoch (loss 2.8007): 18%|ββ | 227/1250 [05:33<24:24, 1.43s/it]
Training 1/1 epoch (loss 2.8007): 18%|ββ | 228/1250 [05:33<26:58, 1.58s/it]
Training 1/1 epoch (loss 2.7010): 18%|ββ | 228/1250 [05:35<26:58, 1.58s/it]
Training 1/1 epoch (loss 2.7010): 18%|ββ | 229/1250 [05:35<29:49, 1.75s/it]
Training 1/1 epoch (loss 2.6363): 18%|ββ | 229/1250 [05:36<29:49, 1.75s/it]
Training 1/1 epoch (loss 2.6363): 18%|ββ | 230/1250 [05:36<24:58, 1.47s/it]
Training 1/1 epoch (loss 2.8609): 18%|ββ | 230/1250 [05:39<24:58, 1.47s/it]
Training 1/1 epoch (loss 2.8609): 18%|ββ | 231/1250 [05:39<29:41, 1.75s/it]
Training 1/1 epoch (loss 2.8554): 18%|ββ | 231/1250 [05:41<29:41, 1.75s/it]
Training 1/1 epoch (loss 2.8554): 19%|ββ | 232/1250 [05:41<31:42, 1.87s/it]
Training 1/1 epoch (loss 2.7418): 19%|ββ | 232/1250 [05:42<31:42, 1.87s/it]
Training 1/1 epoch (loss 2.7418): 19%|ββ | 233/1250 [05:42<27:00, 1.59s/it]
Training 1/1 epoch (loss 2.7477): 19%|ββ | 233/1250 [05:43<27:00, 1.59s/it]
Training 1/1 epoch (loss 2.7477): 19%|ββ | 234/1250 [05:43<25:52, 1.53s/it]
Training 1/1 epoch (loss 2.7497): 19%|ββ | 234/1250 [05:44<25:52, 1.53s/it]
Training 1/1 epoch (loss 2.7497): 19%|ββ | 235/1250 [05:44<24:07, 1.43s/it]
Training 1/1 epoch (loss 2.7348): 19%|ββ | 235/1250 [05:45<24:07, 1.43s/it]
Training 1/1 epoch (loss 2.7348): 19%|ββ | 236/1250 [05:45<21:53, 1.30s/it]
Training 1/1 epoch (loss 2.5937): 19%|ββ | 236/1250 [05:47<21:53, 1.30s/it]
Training 1/1 epoch (loss 2.5937): 19%|ββ | 237/1250 [05:47<24:06, 1.43s/it]
Training 1/1 epoch (loss 2.7252): 19%|ββ | 237/1250 [05:48<24:06, 1.43s/it]
Training 1/1 epoch (loss 2.7252): 19%|ββ | 238/1250 [05:48<21:05, 1.25s/it]
Training 1/1 epoch (loss 2.6181): 19%|ββ | 238/1250 [05:50<21:05, 1.25s/it]
Training 1/1 epoch (loss 2.6181): 19%|ββ | 239/1250 [05:50<24:09, 1.43s/it]
Training 1/1 epoch (loss 2.6648): 19%|ββ | 239/1250 [05:52<24:09, 1.43s/it]
Training 1/1 epoch (loss 2.6648): 19%|ββ | 240/1250 [05:52<28:52, 1.72s/it]
Training 1/1 epoch (loss 2.5835): 19%|ββ | 240/1250 [05:53<28:52, 1.72s/it]
Training 1/1 epoch (loss 2.5835): 19%|ββ | 241/1250 [05:53<22:45, 1.35s/it]
Training 1/1 epoch (loss 2.6874): 19%|ββ | 241/1250 [05:55<22:45, 1.35s/it]
Training 1/1 epoch (loss 2.6874): 19%|ββ | 242/1250 [05:55<26:16, 1.56s/it]
Training 1/1 epoch (loss 2.5630): 19%|ββ | 242/1250 [05:56<26:16, 1.56s/it]
Training 1/1 epoch (loss 2.5630): 19%|ββ | 243/1250 [05:56<25:06, 1.50s/it]
Training 1/1 epoch (loss 2.4923): 19%|ββ | 243/1250 [05:56<25:06, 1.50s/it]
Training 1/1 epoch (loss 2.4923): 20%|ββ | 244/1250 [05:56<19:34, 1.17s/it]
Training 1/1 epoch (loss 2.6533): 20%|ββ | 244/1250 [05:57<19:34, 1.17s/it]
Training 1/1 epoch (loss 2.6533): 20%|ββ | 245/1250 [05:57<18:47, 1.12s/it]
Training 1/1 epoch (loss 2.7168): 20%|ββ | 245/1250 [05:59<18:47, 1.12s/it]
Training 1/1 epoch (loss 2.7168): 20%|ββ | 246/1250 [05:59<20:32, 1.23s/it]
Training 1/1 epoch (loss 2.6522): 20%|ββ | 246/1250 [06:00<20:32, 1.23s/it]
Training 1/1 epoch (loss 2.6522): 20%|ββ | 247/1250 [06:00<18:59, 1.14s/it]
Training 1/1 epoch (loss 2.6392): 20%|ββ | 247/1250 [06:01<18:59, 1.14s/it]
Training 1/1 epoch (loss 2.6392): 20%|ββ | 248/1250 [06:01<20:26, 1.22s/it]
Training 1/1 epoch (loss 2.7989): 20%|ββ | 248/1250 [06:03<20:26, 1.22s/it]
Training 1/1 epoch (loss 2.7989): 20%|ββ | 249/1250 [06:03<22:17, 1.34s/it]
Training 1/1 epoch (loss 2.7527): 20%|ββ | 249/1250 [06:04<22:17, 1.34s/it]
Training 1/1 epoch (loss 2.7527): 20%|ββ | 250/1250 [06:04<20:00, 1.20s/it]
Training 1/1 epoch (loss 2.6352): 20%|ββ | 250/1250 [06:06<20:00, 1.20s/it]
Training 1/1 epoch (loss 2.6352): 20%|ββ | 251/1250 [06:06<23:42, 1.42s/it]
Training 1/1 epoch (loss 2.7573): 20%|ββ | 251/1250 [06:07<23:42, 1.42s/it]
Training 1/1 epoch (loss 2.7573): 20%|ββ | 252/1250 [06:07<22:39, 1.36s/it]
Training 1/1 epoch (loss 2.6558): 20%|ββ | 252/1250 [06:08<22:39, 1.36s/it]
Training 1/1 epoch (loss 2.6558): 20%|ββ | 253/1250 [06:08<20:21, 1.22s/it]
Training 1/1 epoch (loss 2.7867): 20%|ββ | 253/1250 [06:10<20:21, 1.22s/it]
Training 1/1 epoch (loss 2.7867): 20%|ββ | 254/1250 [06:10<24:12, 1.46s/it]
Training 1/1 epoch (loss 2.6694): 20%|ββ | 254/1250 [06:11<24:12, 1.46s/it]
Training 1/1 epoch (loss 2.6694): 20%|ββ | 255/1250 [06:11<22:58, 1.39s/it]
Training 1/1 epoch (loss 2.7821): 20%|ββ | 255/1250 [06:14<22:58, 1.39s/it]
Training 1/1 epoch (loss 2.7821): 20%|ββ | 256/1250 [06:14<29:05, 1.76s/it]
Training 1/1 epoch (loss 2.7503): 20%|ββ | 256/1250 [06:16<29:05, 1.76s/it]
Training 1/1 epoch (loss 2.7503): 21%|ββ | 257/1250 [06:16<32:13, 1.95s/it]
Training 1/1 epoch (loss 2.7028): 21%|ββ | 257/1250 [06:17<32:13, 1.95s/it]
Training 1/1 epoch (loss 2.7028): 21%|ββ | 258/1250 [06:17<25:58, 1.57s/it]
Training 1/1 epoch (loss 2.6863): 21%|ββ | 258/1250 [06:18<25:58, 1.57s/it]
Training 1/1 epoch (loss 2.6863): 21%|ββ | 259/1250 [06:18<22:02, 1.33s/it]
Training 1/1 epoch (loss 2.7357): 21%|ββ | 259/1250 [06:20<22:02, 1.33s/it]
Training 1/1 epoch (loss 2.7357): 21%|ββ | 260/1250 [06:20<27:28, 1.66s/it]
Training 1/1 epoch (loss 2.6192): 21%|ββ | 260/1250 [06:20<27:28, 1.66s/it]
Training 1/1 epoch (loss 2.6192): 21%|ββ | 261/1250 [06:20<21:03, 1.28s/it]
Training 1/1 epoch (loss 2.6726): 21%|ββ | 261/1250 [06:22<21:03, 1.28s/it]
Training 1/1 epoch (loss 2.6726): 21%|ββ | 262/1250 [06:22<23:46, 1.44s/it]
Training 1/1 epoch (loss 2.6662): 21%|ββ | 262/1250 [06:24<23:46, 1.44s/it]
Training 1/1 epoch (loss 2.6662): 21%|ββ | 263/1250 [06:24<25:13, 1.53s/it]
Training 1/1 epoch (loss 2.4099): 21%|ββ | 263/1250 [06:25<25:13, 1.53s/it]
Training 1/1 epoch (loss 2.4099): 21%|ββ | 264/1250 [06:25<20:59, 1.28s/it]
Training 1/1 epoch (loss 2.7352): 21%|ββ | 264/1250 [06:26<20:59, 1.28s/it]
Training 1/1 epoch (loss 2.7352): 21%|ββ | 265/1250 [06:26<20:11, 1.23s/it]
Training 1/1 epoch (loss 2.7150): 21%|ββ | 265/1250 [06:28<20:11, 1.23s/it]
Training 1/1 epoch (loss 2.7150): 21%|βββ | 266/1250 [06:28<25:47, 1.57s/it]
Training 1/1 epoch (loss 2.8195): 21%|βββ | 266/1250 [06:29<25:47, 1.57s/it]
Training 1/1 epoch (loss 2.8195): 21%|βββ | 267/1250 [06:29<20:05, 1.23s/it]
Training 1/1 epoch (loss 2.7368): 21%|βββ | 267/1250 [06:30<20:05, 1.23s/it]
Training 1/1 epoch (loss 2.7368): 21%|βββ | 268/1250 [06:30<21:50, 1.33s/it]
Training 1/1 epoch (loss 2.6982): 21%|βββ | 268/1250 [06:31<21:50, 1.33s/it]
Training 1/1 epoch (loss 2.6982): 22%|βββ | 269/1250 [06:31<20:21, 1.25s/it]
Training 1/1 epoch (loss 2.6419): 22%|βββ | 269/1250 [06:32<20:21, 1.25s/it]
Training 1/1 epoch (loss 2.6419): 22%|βββ | 270/1250 [06:32<16:21, 1.00s/it]
Training 1/1 epoch (loss 2.5254): 22%|βββ | 270/1250 [06:34<16:21, 1.00s/it]
Training 1/1 epoch (loss 2.5254): 22%|βββ | 271/1250 [06:34<23:24, 1.43s/it]
Training 1/1 epoch (loss 2.4608): 22%|βββ | 271/1250 [06:36<23:24, 1.43s/it]
Training 1/1 epoch (loss 2.4608): 22%|βββ | 272/1250 [06:36<25:28, 1.56s/it]
Training 1/1 epoch (loss 2.6420): 22%|βββ | 272/1250 [06:37<25:28, 1.56s/it]
Training 1/1 epoch (loss 2.6420): 22%|βββ | 273/1250 [06:37<21:09, 1.30s/it]
Training 1/1 epoch (loss 2.7341): 22%|βββ | 273/1250 [06:38<21:09, 1.30s/it]
Training 1/1 epoch (loss 2.7341): 22%|βββ | 274/1250 [06:38<23:45, 1.46s/it]
Training 1/1 epoch (loss 2.7532): 22%|βββ | 274/1250 [06:40<23:45, 1.46s/it]
Training 1/1 epoch (loss 2.7532): 22%|βββ | 275/1250 [06:40<25:19, 1.56s/it]
Training 1/1 epoch (loss 2.7701): 22%|βββ | 275/1250 [06:41<25:19, 1.56s/it]
Training 1/1 epoch (loss 2.7701): 22%|βββ | 276/1250 [06:41<20:01, 1.23s/it]
Training 1/1 epoch (loss 2.7120): 22%|βββ | 276/1250 [06:42<20:01, 1.23s/it]
Training 1/1 epoch (loss 2.7120): 22%|βββ | 277/1250 [06:42<22:13, 1.37s/it]
Training 1/1 epoch (loss 2.7926): 22%|βββ | 277/1250 [06:44<22:13, 1.37s/it]
Training 1/1 epoch (loss 2.7926): 22%|βββ | 278/1250 [06:44<23:00, 1.42s/it]
Training 1/1 epoch (loss 2.8662): 22%|βββ | 278/1250 [06:45<23:00, 1.42s/it]
Training 1/1 epoch (loss 2.8662): 22%|βββ | 279/1250 [06:45<22:16, 1.38s/it]
Training 1/1 epoch (loss 2.6147): 22%|βββ | 279/1250 [06:48<22:16, 1.38s/it]
Training 1/1 epoch (loss 2.6147): 22%|βββ | 280/1250 [06:48<27:36, 1.71s/it]
Training 1/1 epoch (loss 2.8070): 22%|βββ | 280/1250 [06:49<27:36, 1.71s/it]
Training 1/1 epoch (loss 2.8070): 22%|βββ | 281/1250 [06:49<23:39, 1.47s/it]
Training 1/1 epoch (loss 2.6629): 22%|βββ | 281/1250 [06:50<23:39, 1.47s/it]
Training 1/1 epoch (loss 2.6629): 23%|βββ | 282/1250 [06:50<24:19, 1.51s/it]
Training 1/1 epoch (loss 2.7138): 23%|βββ | 282/1250 [06:51<24:19, 1.51s/it]
Training 1/1 epoch (loss 2.7138): 23%|βββ | 283/1250 [06:51<22:51, 1.42s/it]
Training 1/1 epoch (loss 2.8214): 23%|βββ | 283/1250 [06:52<22:51, 1.42s/it]
Training 1/1 epoch (loss 2.8214): 23%|βββ | 284/1250 [06:52<19:28, 1.21s/it]
Training 1/1 epoch (loss 2.6936): 23%|βββ | 284/1250 [06:54<19:28, 1.21s/it]
Training 1/1 epoch (loss 2.6936): 23%|βββ | 285/1250 [06:54<21:06, 1.31s/it]
Training 1/1 epoch (loss 2.5472): 23%|βββ | 285/1250 [06:55<21:06, 1.31s/it]
Training 1/1 epoch (loss 2.5472): 23%|βββ | 286/1250 [06:55<23:33, 1.47s/it]
Training 1/1 epoch (loss 2.6606): 23%|βββ | 286/1250 [06:56<23:33, 1.47s/it]
Training 1/1 epoch (loss 2.6606): 23%|βββ | 287/1250 [06:56<19:05, 1.19s/it]
Training 1/1 epoch (loss 2.6650): 23%|βββ | 287/1250 [06:58<19:05, 1.19s/it]
Training 1/1 epoch (loss 2.6650): 23%|βββ | 288/1250 [06:58<21:00, 1.31s/it]
Training 1/1 epoch (loss 2.6208): 23%|βββ | 288/1250 [06:59<21:00, 1.31s/it]
Training 1/1 epoch (loss 2.6208): 23%|βββ | 289/1250 [06:59<21:06, 1.32s/it]
Training 1/1 epoch (loss 2.5869): 23%|βββ | 289/1250 [07:00<21:06, 1.32s/it]
Training 1/1 epoch (loss 2.5869): 23%|βββ | 290/1250 [07:00<19:08, 1.20s/it]
Training 1/1 epoch (loss 2.6175): 23%|βββ | 290/1250 [07:02<19:08, 1.20s/it]
Training 1/1 epoch (loss 2.6175): 23%|βββ | 291/1250 [07:02<24:33, 1.54s/it]
Training 1/1 epoch (loss 2.9057): 23%|βββ | 291/1250 [07:04<24:33, 1.54s/it]
Training 1/1 epoch (loss 2.9057): 23%|βββ | 292/1250 [07:04<23:52, 1.49s/it]
Training 1/1 epoch (loss 2.8089): 23%|βββ | 292/1250 [07:04<23:52, 1.49s/it]
Training 1/1 epoch (loss 2.8089): 23%|βββ | 293/1250 [07:04<20:38, 1.29s/it]
Training 1/1 epoch (loss 2.6792): 23%|βββ | 293/1250 [07:06<20:38, 1.29s/it]
Training 1/1 epoch (loss 2.6792): 24%|βββ | 294/1250 [07:06<23:27, 1.47s/it]
Training 1/1 epoch (loss 2.6451): 24%|βββ | 294/1250 [07:07<23:27, 1.47s/it]
Training 1/1 epoch (loss 2.6451): 24%|βββ | 295/1250 [07:07<21:24, 1.35s/it]
Training 1/1 epoch (loss 2.7266): 24%|βββ | 295/1250 [07:08<21:24, 1.35s/it]
Training 1/1 epoch (loss 2.7266): 24%|βββ | 296/1250 [07:08<19:33, 1.23s/it]
Training 1/1 epoch (loss 2.6532): 24%|βββ | 296/1250 [07:10<19:33, 1.23s/it]
Training 1/1 epoch (loss 2.6532): 24%|βββ | 297/1250 [07:10<20:37, 1.30s/it]
Training 1/1 epoch (loss 2.7383): 24%|βββ | 297/1250 [07:11<20:37, 1.30s/it]
Training 1/1 epoch (loss 2.7383): 24%|βββ | 298/1250 [07:11<18:24, 1.16s/it]
Training 1/1 epoch (loss 2.6079): 24%|βββ | 298/1250 [07:12<18:24, 1.16s/it]
Training 1/1 epoch (loss 2.6079): 24%|βββ | 299/1250 [07:12<19:17, 1.22s/it]
Training 1/1 epoch (loss 2.7754): 24%|βββ | 299/1250 [07:14<19:17, 1.22s/it]
Training 1/1 epoch (loss 2.7754): 24%|βββ | 300/1250 [07:14<23:26, 1.48s/it]
Training 1/1 epoch (loss 2.6451): 24%|βββ | 300/1250 [07:15<23:26, 1.48s/it]
Training 1/1 epoch (loss 2.6451): 24%|βββ | 301/1250 [07:15<21:10, 1.34s/it]
Training 1/1 epoch (loss 2.5133): 24%|βββ | 301/1250 [07:17<21:10, 1.34s/it]
Training 1/1 epoch (loss 2.5133): 24%|βββ | 302/1250 [07:17<24:29, 1.55s/it]
Training 1/1 epoch (loss 2.3830): 24%|βββ | 302/1250 [07:19<24:29, 1.55s/it]
Training 1/1 epoch (loss 2.3830): 24%|βββ | 303/1250 [07:19<24:22, 1.54s/it]
Training 1/1 epoch (loss 2.7914): 24%|βββ | 303/1250 [07:19<24:22, 1.54s/it]
Training 1/1 epoch (loss 2.7914): 24%|βββ | 304/1250 [07:19<19:55, 1.26s/it]
Training 1/1 epoch (loss 2.8872): 24%|βββ | 304/1250 [07:20<19:55, 1.26s/it]
Training 1/1 epoch (loss 2.8872): 24%|βββ | 305/1250 [07:20<19:45, 1.25s/it]
Training 1/1 epoch (loss 2.8286): 24%|βββ | 305/1250 [07:22<19:45, 1.25s/it]
Training 1/1 epoch (loss 2.8286): 24%|βββ | 306/1250 [07:22<22:54, 1.46s/it]
Training 1/1 epoch (loss 2.6829): 24%|βββ | 306/1250 [07:23<22:54, 1.46s/it]
Training 1/1 epoch (loss 2.6829): 25%|βββ | 307/1250 [07:23<18:19, 1.17s/it]
Training 1/1 epoch (loss 2.6400): 25%|βββ | 307/1250 [07:25<18:19, 1.17s/it]
Training 1/1 epoch (loss 2.6400): 25%|βββ | 308/1250 [07:25<20:25, 1.30s/it]
Training 1/1 epoch (loss 2.6151): 25%|βββ | 308/1250 [07:26<20:25, 1.30s/it]
Training 1/1 epoch (loss 2.6151): 25%|βββ | 309/1250 [07:26<20:45, 1.32s/it]
Training 1/1 epoch (loss 2.6162): 25%|βββ | 309/1250 [07:27<20:45, 1.32s/it]
Training 1/1 epoch (loss 2.6162): 25%|βββ | 310/1250 [07:27<20:49, 1.33s/it]
Training 1/1 epoch (loss 2.7687): 25%|βββ | 310/1250 [07:29<20:49, 1.33s/it]
Training 1/1 epoch (loss 2.7687): 25%|βββ | 311/1250 [07:29<20:36, 1.32s/it]
Training 1/1 epoch (loss 2.6015): 25%|βββ | 311/1250 [07:30<20:36, 1.32s/it]
Training 1/1 epoch (loss 2.6015): 25%|βββ | 312/1250 [07:30<19:45, 1.26s/it]
Training 1/1 epoch (loss 2.4530): 25%|βββ | 312/1250 [07:32<19:45, 1.26s/it]
Training 1/1 epoch (loss 2.4530): 25%|βββ | 313/1250 [07:32<25:32, 1.64s/it]
Training 1/1 epoch (loss 2.6035): 25%|βββ | 313/1250 [07:34<25:32, 1.64s/it]
Training 1/1 epoch (loss 2.6035): 25%|βββ | 314/1250 [07:34<28:28, 1.83s/it]
Training 1/1 epoch (loss 2.7767): 25%|βββ | 314/1250 [07:36<28:28, 1.83s/it]
Training 1/1 epoch (loss 2.7767): 25%|βββ | 315/1250 [07:36<26:10, 1.68s/it]
Training 1/1 epoch (loss 2.6400): 25%|βββ | 315/1250 [07:38<26:10, 1.68s/it]
Training 1/1 epoch (loss 2.6400): 25%|βββ | 316/1250 [07:38<26:51, 1.73s/it]
Training 1/1 epoch (loss 2.7536): 25%|βββ | 316/1250 [07:39<26:51, 1.73s/it]
Training 1/1 epoch (loss 2.7536): 25%|βββ | 317/1250 [07:39<24:16, 1.56s/it]
Training 1/1 epoch (loss 2.8466): 25%|βββ | 317/1250 [07:40<24:16, 1.56s/it]
Training 1/1 epoch (loss 2.8466): 25%|βββ | 318/1250 [07:40<24:13, 1.56s/it]
Training 1/1 epoch (loss 2.6306): 25%|βββ | 318/1250 [07:41<24:13, 1.56s/it]
Training 1/1 epoch (loss 2.6306): 26%|βββ | 319/1250 [07:41<21:42, 1.40s/it]
Training 1/1 epoch (loss 2.7607): 26%|βββ | 319/1250 [07:43<21:42, 1.40s/it]
Training 1/1 epoch (loss 2.7607): 26%|βββ | 320/1250 [07:43<22:23, 1.44s/it]
Training 1/1 epoch (loss 2.6189): 26%|βββ | 320/1250 [07:44<22:23, 1.44s/it]
Training 1/1 epoch (loss 2.6189): 26%|βββ | 321/1250 [07:44<20:52, 1.35s/it]
Training 1/1 epoch (loss 2.7357): 26%|βββ | 321/1250 [07:46<20:52, 1.35s/it]
Training 1/1 epoch (loss 2.7357): 26%|βββ | 322/1250 [07:46<25:10, 1.63s/it]
Training 1/1 epoch (loss 2.5757): 26%|βββ | 322/1250 [07:47<25:10, 1.63s/it]
Training 1/1 epoch (loss 2.5757): 26%|βββ | 323/1250 [07:47<22:27, 1.45s/it]
Training 1/1 epoch (loss 2.8577): 26%|βββ | 323/1250 [07:49<22:27, 1.45s/it]
Training 1/1 epoch (loss 2.8577): 26%|βββ | 324/1250 [07:49<21:23, 1.39s/it]
Training 1/1 epoch (loss 2.4644): 26%|βββ | 324/1250 [07:50<21:23, 1.39s/it]
Training 1/1 epoch (loss 2.4644): 26%|βββ | 325/1250 [07:50<21:58, 1.43s/it]
Training 1/1 epoch (loss 2.7614): 26%|βββ | 325/1250 [07:51<21:58, 1.43s/it]
Training 1/1 epoch (loss 2.7614): 26%|βββ | 326/1250 [07:51<18:50, 1.22s/it]
Training 1/1 epoch (loss 2.5824): 26%|βββ | 326/1250 [07:53<18:50, 1.22s/it]
Training 1/1 epoch (loss 2.5824): 26%|βββ | 327/1250 [07:53<23:59, 1.56s/it]
Training 1/1 epoch (loss 2.4134): 26%|βββ | 327/1250 [07:54<23:59, 1.56s/it]
Training 1/1 epoch (loss 2.4134): 26%|βββ | 328/1250 [07:54<21:11, 1.38s/it]
Training 1/1 epoch (loss 2.9840): 26%|βββ | 328/1250 [07:55<21:11, 1.38s/it]
Training 1/1 epoch (loss 2.9840): 26%|βββ | 329/1250 [07:55<18:57, 1.23s/it]
Training 1/1 epoch (loss 2.5788): 26%|βββ | 329/1250 [07:57<18:57, 1.23s/it]
Training 1/1 epoch (loss 2.5788): 26%|βββ | 330/1250 [07:57<21:42, 1.42s/it]
Training 1/1 epoch (loss 2.6663): 26%|βββ | 330/1250 [07:59<21:42, 1.42s/it]
Training 1/1 epoch (loss 2.6663): 26%|βββ | 331/1250 [07:59<23:51, 1.56s/it]
Training 1/1 epoch (loss 2.8483): 26%|βββ | 331/1250 [08:00<23:51, 1.56s/it]
Training 1/1 epoch (loss 2.8483): 27%|βββ | 332/1250 [08:00<21:26, 1.40s/it]
Training 1/1 epoch (loss 2.6233): 27%|βββ | 332/1250 [08:02<21:26, 1.40s/it]
Training 1/1 epoch (loss 2.6233): 27%|βββ | 333/1250 [08:02<23:07, 1.51s/it]
Training 1/1 epoch (loss 2.7689): 27%|βββ | 333/1250 [08:03<23:07, 1.51s/it]
Training 1/1 epoch (loss 2.7689): 27%|βββ | 334/1250 [08:03<23:18, 1.53s/it]
Training 1/1 epoch (loss 2.6906): 27%|βββ | 334/1250 [08:05<23:18, 1.53s/it]
Training 1/1 epoch (loss 2.6906): 27%|βββ | 335/1250 [08:05<23:00, 1.51s/it]
Training 1/1 epoch (loss 2.8282): 27%|βββ | 335/1250 [08:06<23:00, 1.51s/it]
Training 1/1 epoch (loss 2.8282): 27%|βββ | 336/1250 [08:06<22:53, 1.50s/it]
Training 1/1 epoch (loss 2.9948): 27%|βββ | 336/1250 [08:07<22:53, 1.50s/it]
Training 1/1 epoch (loss 2.9948): 27%|βββ | 337/1250 [08:07<19:50, 1.30s/it]
Training 1/1 epoch (loss 2.7692): 27%|βββ | 337/1250 [08:09<19:50, 1.30s/it]
Training 1/1 epoch (loss 2.7692): 27%|βββ | 338/1250 [08:09<24:50, 1.63s/it]
Training 1/1 epoch (loss 2.7384): 27%|βββ | 338/1250 [08:11<24:50, 1.63s/it]
Training 1/1 epoch (loss 2.7384): 27%|βββ | 339/1250 [08:11<24:34, 1.62s/it]
Training 1/1 epoch (loss 2.7233): 27%|βββ | 339/1250 [08:11<24:34, 1.62s/it]
Training 1/1 epoch (loss 2.7233): 27%|βββ | 340/1250 [08:11<19:35, 1.29s/it]
Training 1/1 epoch (loss 2.5614): 27%|βββ | 340/1250 [08:13<19:35, 1.29s/it]
Training 1/1 epoch (loss 2.5614): 27%|βββ | 341/1250 [08:13<22:57, 1.52s/it]
Training 1/1 epoch (loss 2.6407): 27%|βββ | 341/1250 [08:16<22:57, 1.52s/it]
Training 1/1 epoch (loss 2.6407): 27%|βββ | 342/1250 [08:16<26:27, 1.75s/it]
Training 1/1 epoch (loss 2.5607): 27%|βββ | 342/1250 [08:16<26:27, 1.75s/it]
Training 1/1 epoch (loss 2.5607): 27%|βββ | 343/1250 [08:16<21:21, 1.41s/it]
Training 1/1 epoch (loss 2.7151): 27%|βββ | 343/1250 [08:19<21:21, 1.41s/it]
Training 1/1 epoch (loss 2.7151): 28%|βββ | 344/1250 [08:19<24:42, 1.64s/it]
Training 1/1 epoch (loss 2.6691): 28%|βββ | 344/1250 [08:20<24:42, 1.64s/it]
Training 1/1 epoch (loss 2.6691): 28%|βββ | 345/1250 [08:20<24:38, 1.63s/it]
Training 1/1 epoch (loss 2.6168): 28%|βββ | 345/1250 [08:21<24:38, 1.63s/it]
Training 1/1 epoch (loss 2.6168): 28%|βββ | 346/1250 [08:21<21:56, 1.46s/it]
Training 1/1 epoch (loss 2.7113): 28%|βββ | 346/1250 [08:23<21:56, 1.46s/it]
Training 1/1 epoch (loss 2.7113): 28%|βββ | 347/1250 [08:23<23:36, 1.57s/it]
Training 1/1 epoch (loss 2.8231): 28%|βββ | 347/1250 [08:25<23:36, 1.57s/it]
Training 1/1 epoch (loss 2.8231): 28%|βββ | 348/1250 [08:25<24:14, 1.61s/it]
Training 1/1 epoch (loss 2.6229): 28%|βββ | 348/1250 [08:26<24:14, 1.61s/it]
Training 1/1 epoch (loss 2.6229): 28%|βββ | 349/1250 [08:26<20:10, 1.34s/it]
Training 1/1 epoch (loss 2.8108): 28%|βββ | 349/1250 [08:27<20:10, 1.34s/it]
Training 1/1 epoch (loss 2.8108): 28%|βββ | 350/1250 [08:27<22:58, 1.53s/it]
Training 1/1 epoch (loss 2.6641): 28%|βββ | 350/1250 [08:28<22:58, 1.53s/it]
Training 1/1 epoch (loss 2.6641): 28%|βββ | 351/1250 [08:28<19:37, 1.31s/it]
Training 1/1 epoch (loss 2.8325): 28%|βββ | 351/1250 [08:30<19:37, 1.31s/it]
Training 1/1 epoch (loss 2.8325): 28%|βββ | 352/1250 [08:30<19:41, 1.32s/it]
Training 1/1 epoch (loss 2.8720): 28%|βββ | 352/1250 [08:31<19:41, 1.32s/it]
Training 1/1 epoch (loss 2.8720): 28%|βββ | 353/1250 [08:31<18:14, 1.22s/it]
Training 1/1 epoch (loss 2.7364): 28%|βββ | 353/1250 [08:31<18:14, 1.22s/it]
Training 1/1 epoch (loss 2.7364): 28%|βββ | 354/1250 [08:31<15:25, 1.03s/it]
Training 1/1 epoch (loss 2.5654): 28%|βββ | 354/1250 [08:33<15:25, 1.03s/it]
Training 1/1 epoch (loss 2.5654): 28%|βββ | 355/1250 [08:33<19:10, 1.29s/it]
Training 1/1 epoch (loss 2.6753): 28%|βββ | 355/1250 [08:35<19:10, 1.29s/it]
Training 1/1 epoch (loss 2.6753): 28%|βββ | 356/1250 [08:35<24:16, 1.63s/it]
Training 1/1 epoch (loss 2.8613): 28%|βββ | 356/1250 [08:36<24:16, 1.63s/it]
Training 1/1 epoch (loss 2.8613): 29%|βββ | 357/1250 [08:36<18:49, 1.26s/it]
Training 1/1 epoch (loss 2.8234): 29%|βββ | 357/1250 [08:38<18:49, 1.26s/it]
Training 1/1 epoch (loss 2.8234): 29%|βββ | 358/1250 [08:38<20:17, 1.37s/it]
Training 1/1 epoch (loss 2.6897): 29%|βββ | 358/1250 [08:40<20:17, 1.37s/it]
Training 1/1 epoch (loss 2.6897): 29%|βββ | 359/1250 [08:40<24:34, 1.65s/it]
Training 1/1 epoch (loss 2.5079): 29%|βββ | 359/1250 [08:41<24:34, 1.65s/it]
Training 1/1 epoch (loss 2.5079): 29%|βββ | 360/1250 [08:41<23:15, 1.57s/it]
Training 1/1 epoch (loss 2.5793): 29%|βββ | 360/1250 [08:43<23:15, 1.57s/it]
Training 1/1 epoch (loss 2.5793): 29%|βββ | 361/1250 [08:43<25:43, 1.74s/it]
Training 1/1 epoch (loss 2.6932): 29%|βββ | 361/1250 [08:45<25:43, 1.74s/it]
Training 1/1 epoch (loss 2.6932): 29%|βββ | 362/1250 [08:45<23:09, 1.57s/it]
Training 1/1 epoch (loss 2.7206): 29%|βββ | 362/1250 [08:47<23:09, 1.57s/it]
Training 1/1 epoch (loss 2.7206): 29%|βββ | 363/1250 [08:47<25:21, 1.72s/it]
Training 1/1 epoch (loss 2.8265): 29%|βββ | 363/1250 [08:48<25:21, 1.72s/it]
Training 1/1 epoch (loss 2.8265): 29%|βββ | 364/1250 [08:48<25:54, 1.75s/it]
Training 1/1 epoch (loss 2.5881): 29%|βββ | 364/1250 [08:49<25:54, 1.75s/it]
Training 1/1 epoch (loss 2.5881): 29%|βββ | 365/1250 [08:49<20:53, 1.42s/it]
Training 1/1 epoch (loss 2.4634): 29%|βββ | 365/1250 [08:51<20:53, 1.42s/it]
Training 1/1 epoch (loss 2.4634): 29%|βββ | 366/1250 [08:51<23:28, 1.59s/it]
Training 1/1 epoch (loss 2.5737): 29%|βββ | 366/1250 [08:53<23:28, 1.59s/it]
Training 1/1 epoch (loss 2.5737): 29%|βββ | 367/1250 [08:53<23:09, 1.57s/it]
Training 1/1 epoch (loss 2.6291): 29%|βββ | 367/1250 [08:53<23:09, 1.57s/it]
Training 1/1 epoch (loss 2.6291): 29%|βββ | 368/1250 [08:53<18:38, 1.27s/it]
Training 1/1 epoch (loss 2.7368): 29%|βββ | 368/1250 [08:56<18:38, 1.27s/it]
Training 1/1 epoch (loss 2.7368): 30%|βββ | 369/1250 [08:56<23:43, 1.62s/it]
Training 1/1 epoch (loss 2.8339): 30%|βββ | 369/1250 [08:57<23:43, 1.62s/it]
Training 1/1 epoch (loss 2.8339): 30%|βββ | 370/1250 [08:57<23:26, 1.60s/it]
Training 1/1 epoch (loss 2.5652): 30%|βββ | 370/1250 [08:59<23:26, 1.60s/it]
Training 1/1 epoch (loss 2.5652): 30%|βββ | 371/1250 [08:59<22:45, 1.55s/it]
Training 1/1 epoch (loss 2.6569): 30%|βββ | 371/1250 [09:00<22:45, 1.55s/it]
Training 1/1 epoch (loss 2.6569): 30%|βββ | 372/1250 [09:00<23:19, 1.59s/it]
Training 1/1 epoch (loss 2.6393): 30%|βββ | 372/1250 [09:02<23:19, 1.59s/it]
Training 1/1 epoch (loss 2.6393): 30%|βββ | 373/1250 [09:02<22:12, 1.52s/it]
Training 1/1 epoch (loss 2.4232): 30%|βββ | 373/1250 [09:04<22:12, 1.52s/it]
Training 1/1 epoch (loss 2.4232): 30%|βββ | 374/1250 [09:04<25:29, 1.75s/it]
Training 1/1 epoch (loss 2.7027): 30%|βββ | 374/1250 [09:05<25:29, 1.75s/it]
Training 1/1 epoch (loss 2.7027): 30%|βββ | 375/1250 [09:05<23:35, 1.62s/it]
Training 1/1 epoch (loss 2.8797): 30%|βββ | 375/1250 [09:06<23:35, 1.62s/it]
Training 1/1 epoch (loss 2.8797): 30%|βββ | 376/1250 [09:06<19:32, 1.34s/it]
Training 1/1 epoch (loss 2.8009): 30%|βββ | 376/1250 [09:08<19:32, 1.34s/it]
Training 1/1 epoch (loss 2.8009): 30%|βββ | 377/1250 [09:08<22:40, 1.56s/it]
Training 1/1 epoch (loss 2.6659): 30%|βββ | 377/1250 [09:09<22:40, 1.56s/it]
Training 1/1 epoch (loss 2.6659): 30%|βββ | 378/1250 [09:09<22:05, 1.52s/it]
Training 1/1 epoch (loss 2.5181): 30%|βββ | 378/1250 [09:10<22:05, 1.52s/it]
Training 1/1 epoch (loss 2.5181): 30%|βββ | 379/1250 [09:10<19:00, 1.31s/it]
Training 1/1 epoch (loss 2.7345): 30%|βββ | 379/1250 [09:11<19:00, 1.31s/it]
Training 1/1 epoch (loss 2.7345): 30%|βββ | 380/1250 [09:11<17:57, 1.24s/it]
Training 1/1 epoch (loss 2.7507): 30%|βββ | 380/1250 [09:12<17:57, 1.24s/it]
Training 1/1 epoch (loss 2.7507): 30%|βββ | 381/1250 [09:12<17:26, 1.20s/it]
Training 1/1 epoch (loss 2.8318): 30%|βββ | 381/1250 [09:14<17:26, 1.20s/it]
Training 1/1 epoch (loss 2.8318): 31%|βββ | 382/1250 [09:14<17:51, 1.23s/it]
Training 1/1 epoch (loss 2.6918): 31%|βββ | 382/1250 [09:15<17:51, 1.23s/it]
Training 1/1 epoch (loss 2.6918): 31%|βββ | 383/1250 [09:15<19:17, 1.34s/it]
Training 1/1 epoch (loss 2.5524): 31%|βββ | 383/1250 [09:16<19:17, 1.34s/it]
Training 1/1 epoch (loss 2.5524): 31%|βββ | 384/1250 [09:16<18:41, 1.29s/it]
Training 1/1 epoch (loss 2.6362): 31%|βββ | 384/1250 [09:17<18:41, 1.29s/it]
Training 1/1 epoch (loss 2.6362): 31%|βββ | 385/1250 [09:17<16:30, 1.15s/it]
Training 1/1 epoch (loss 2.4848): 31%|βββ | 385/1250 [09:19<16:30, 1.15s/it]
Training 1/1 epoch (loss 2.4848): 31%|βββ | 386/1250 [09:19<20:04, 1.39s/it]
Training 1/1 epoch (loss 2.5539): 31%|βββ | 386/1250 [09:20<20:04, 1.39s/it]
Training 1/1 epoch (loss 2.5539): 31%|βββ | 387/1250 [09:20<17:43, 1.23s/it]
Training 1/1 epoch (loss 2.6427): 31%|βββ | 387/1250 [09:21<17:43, 1.23s/it]
Training 1/1 epoch (loss 2.6427): 31%|βββ | 388/1250 [09:21<16:23, 1.14s/it]
Training 1/1 epoch (loss 2.7234): 31%|βββ | 388/1250 [09:23<16:23, 1.14s/it]
Training 1/1 epoch (loss 2.7234): 31%|βββ | 389/1250 [09:23<18:34, 1.29s/it]
Training 1/1 epoch (loss 2.6589): 31%|βββ | 389/1250 [09:24<18:34, 1.29s/it]
Training 1/1 epoch (loss 2.6589): 31%|βββ | 390/1250 [09:24<19:11, 1.34s/it]
Training 1/1 epoch (loss 2.8580): 31%|βββ | 390/1250 [09:25<19:11, 1.34s/it]
Training 1/1 epoch (loss 2.8580): 31%|ββββ | 391/1250 [09:25<19:10, 1.34s/it]
Training 1/1 epoch (loss 2.7211): 31%|ββββ | 391/1250 [09:28<19:10, 1.34s/it]
Training 1/1 epoch (loss 2.7211): 31%|ββββ | 392/1250 [09:28<22:36, 1.58s/it]
Training 1/1 epoch (loss 2.8142): 31%|ββββ | 392/1250 [09:28<22:36, 1.58s/it]
Training 1/1 epoch (loss 2.8142): 31%|ββββ | 393/1250 [09:28<18:34, 1.30s/it]
Training 1/1 epoch (loss 2.5563): 31%|ββββ | 393/1250 [09:30<18:34, 1.30s/it]
Training 1/1 epoch (loss 2.5563): 32%|ββββ | 394/1250 [09:30<19:57, 1.40s/it]
Training 1/1 epoch (loss 2.4111): 32%|ββββ | 394/1250 [09:31<19:57, 1.40s/it]
Training 1/1 epoch (loss 2.4111): 32%|ββββ | 395/1250 [09:31<20:25, 1.43s/it]
Training 1/1 epoch (loss 2.6066): 32%|ββββ | 395/1250 [09:32<20:25, 1.43s/it]
Training 1/1 epoch (loss 2.6066): 32%|ββββ | 396/1250 [09:32<17:27, 1.23s/it]
Training 1/1 epoch (loss 2.4284): 32%|ββββ | 396/1250 [09:33<17:27, 1.23s/it]
Training 1/1 epoch (loss 2.4284): 32%|ββββ | 397/1250 [09:33<17:10, 1.21s/it]
Training 1/1 epoch (loss 2.5165): 32%|ββββ | 397/1250 [09:35<17:10, 1.21s/it]
Training 1/1 epoch (loss 2.5165): 32%|ββββ | 398/1250 [09:35<18:34, 1.31s/it]
Training 1/1 epoch (loss 2.7199): 32%|ββββ | 398/1250 [09:36<18:34, 1.31s/it]
Training 1/1 epoch (loss 2.7199): 32%|ββββ | 399/1250 [09:36<16:14, 1.14s/it]
Training 1/1 epoch (loss 2.7608): 32%|ββββ | 399/1250 [09:38<16:14, 1.14s/it]
Training 1/1 epoch (loss 2.7608): 32%|ββββ | 400/1250 [09:38<22:34, 1.59s/it]
Training 1/1 epoch (loss 2.5263): 32%|ββββ | 400/1250 [09:39<22:34, 1.59s/it]
Training 1/1 epoch (loss 2.5263): 32%|ββββ | 401/1250 [09:39<20:47, 1.47s/it]
Training 1/1 epoch (loss 2.8479): 32%|ββββ | 401/1250 [09:42<20:47, 1.47s/it]
Training 1/1 epoch (loss 2.8479): 32%|ββββ | 402/1250 [09:42<24:47, 1.75s/it]
Training 1/1 epoch (loss 2.7594): 32%|ββββ | 402/1250 [09:44<24:47, 1.75s/it]
Training 1/1 epoch (loss 2.7594): 32%|ββββ | 403/1250 [09:44<27:52, 1.98s/it]
Training 1/1 epoch (loss 2.5077): 32%|ββββ | 403/1250 [09:45<27:52, 1.98s/it]
Training 1/1 epoch (loss 2.5077): 32%|ββββ | 404/1250 [09:45<22:07, 1.57s/it]
Training 1/1 epoch (loss 2.6691): 32%|ββββ | 404/1250 [09:47<22:07, 1.57s/it]
Training 1/1 epoch (loss 2.6691): 32%|ββββ | 405/1250 [09:47<23:54, 1.70s/it]
Training 1/1 epoch (loss 2.6612): 32%|ββββ | 405/1250 [09:49<23:54, 1.70s/it]
Training 1/1 epoch (loss 2.6612): 32%|ββββ | 406/1250 [09:49<23:49, 1.69s/it]
Training 1/1 epoch (loss 2.5584): 32%|ββββ | 406/1250 [09:50<23:49, 1.69s/it]
Training 1/1 epoch (loss 2.5584): 33%|ββββ | 407/1250 [09:50<20:19, 1.45s/it]
Training 1/1 epoch (loss 2.6658): 33%|ββββ | 407/1250 [09:52<20:19, 1.45s/it]
Training 1/1 epoch (loss 2.6658): 33%|ββββ | 408/1250 [09:52<22:52, 1.63s/it]
Training 1/1 epoch (loss 2.8191): 33%|ββββ | 408/1250 [09:53<22:52, 1.63s/it]
Training 1/1 epoch (loss 2.8191): 33%|ββββ | 409/1250 [09:53<20:56, 1.49s/it]
Training 1/1 epoch (loss 2.8488): 33%|ββββ | 409/1250 [09:54<20:56, 1.49s/it]
Training 1/1 epoch (loss 2.8488): 33%|ββββ | 410/1250 [09:54<18:22, 1.31s/it]
Training 1/1 epoch (loss 2.6133): 33%|ββββ | 410/1250 [09:55<18:22, 1.31s/it]
Training 1/1 epoch (loss 2.6133): 33%|ββββ | 411/1250 [09:55<19:13, 1.38s/it]
Training 1/1 epoch (loss 2.4729): 33%|ββββ | 411/1250 [09:56<19:13, 1.38s/it]
Training 1/1 epoch (loss 2.4729): 33%|ββββ | 412/1250 [09:56<17:48, 1.28s/it]
Training 1/1 epoch (loss 2.6187): 33%|ββββ | 412/1250 [09:58<17:48, 1.28s/it]
Training 1/1 epoch (loss 2.6187): 33%|ββββ | 413/1250 [09:58<20:32, 1.47s/it]
Training 1/1 epoch (loss 2.7751): 33%|ββββ | 413/1250 [09:59<20:32, 1.47s/it]
Training 1/1 epoch (loss 2.7751): 33%|ββββ | 414/1250 [09:59<19:50, 1.42s/it]
Training 1/1 epoch (loss 2.6552): 33%|ββββ | 414/1250 [10:00<19:50, 1.42s/it]
Training 1/1 epoch (loss 2.6552): 33%|ββββ | 415/1250 [10:00<16:10, 1.16s/it]
Training 1/1 epoch (loss 2.7628): 33%|ββββ | 415/1250 [10:02<16:10, 1.16s/it]
Training 1/1 epoch (loss 2.7628): 33%|ββββ | 416/1250 [10:02<18:22, 1.32s/it]
Training 1/1 epoch (loss 2.8229): 33%|ββββ | 416/1250 [10:04<18:22, 1.32s/it]
Training 1/1 epoch (loss 2.8229): 33%|ββββ | 417/1250 [10:04<21:28, 1.55s/it]
Training 1/1 epoch (loss 2.6713): 33%|ββββ | 417/1250 [10:04<21:28, 1.55s/it]
Training 1/1 epoch (loss 2.6713): 33%|ββββ | 418/1250 [10:04<17:21, 1.25s/it]
Training 1/1 epoch (loss 2.8238): 33%|ββββ | 418/1250 [10:06<17:21, 1.25s/it]
Training 1/1 epoch (loss 2.8238): 34%|ββββ | 419/1250 [10:06<17:10, 1.24s/it]
Training 1/1 epoch (loss 2.6905): 34%|ββββ | 419/1250 [10:07<17:10, 1.24s/it]
Training 1/1 epoch (loss 2.6905): 34%|ββββ | 420/1250 [10:07<16:06, 1.16s/it]
Training 1/1 epoch (loss 2.6471): 34%|ββββ | 420/1250 [10:07<16:06, 1.16s/it]
Training 1/1 epoch (loss 2.6471): 34%|ββββ | 421/1250 [10:07<14:07, 1.02s/it]
Training 1/1 epoch (loss 2.6021): 34%|ββββ | 421/1250 [10:09<14:07, 1.02s/it]
Training 1/1 epoch (loss 2.6021): 34%|ββββ | 422/1250 [10:09<15:49, 1.15s/it]
Training 1/1 epoch (loss 2.6105): 34%|ββββ | 422/1250 [10:10<15:49, 1.15s/it]
Training 1/1 epoch (loss 2.6105): 34%|ββββ | 423/1250 [10:10<17:20, 1.26s/it]
Training 1/1 epoch (loss 2.6038): 34%|ββββ | 423/1250 [10:12<17:20, 1.26s/it]
Training 1/1 epoch (loss 2.6038): 34%|ββββ | 424/1250 [10:12<17:40, 1.28s/it]
Training 1/1 epoch (loss 2.8084): 34%|ββββ | 424/1250 [10:13<17:40, 1.28s/it]
Training 1/1 epoch (loss 2.8084): 34%|ββββ | 425/1250 [10:13<19:44, 1.44s/it]
Training 1/1 epoch (loss 2.8642): 34%|ββββ | 425/1250 [10:14<19:44, 1.44s/it]
Training 1/1 epoch (loss 2.8642): 34%|ββββ | 426/1250 [10:14<15:34, 1.13s/it]
Training 1/1 epoch (loss 2.6890): 34%|ββββ | 426/1250 [10:16<15:34, 1.13s/it]
Training 1/1 epoch (loss 2.6890): 34%|ββββ | 427/1250 [10:16<19:20, 1.41s/it]
Training 1/1 epoch (loss 2.6146): 34%|ββββ | 427/1250 [10:18<19:20, 1.41s/it]
Training 1/1 epoch (loss 2.6146): 34%|ββββ | 428/1250 [10:18<21:25, 1.56s/it]
Training 1/1 epoch (loss 2.6196): 34%|ββββ | 428/1250 [10:19<21:25, 1.56s/it]
Training 1/1 epoch (loss 2.6196): 34%|ββββ | 429/1250 [10:19<19:03, 1.39s/it]
Training 1/1 epoch (loss 2.4933): 34%|ββββ | 429/1250 [10:21<19:03, 1.39s/it]
Training 1/1 epoch (loss 2.4933): 34%|ββββ | 430/1250 [10:21<20:47, 1.52s/it]
Training 1/1 epoch (loss 2.6314): 34%|ββββ | 430/1250 [10:22<20:47, 1.52s/it]
Training 1/1 epoch (loss 2.6314): 34%|ββββ | 431/1250 [10:22<20:25, 1.50s/it]
Training 1/1 epoch (loss 2.7320): 34%|ββββ | 431/1250 [10:23<20:25, 1.50s/it]
Training 1/1 epoch (loss 2.7320): 35%|ββββ | 432/1250 [10:23<20:02, 1.47s/it]
Training 1/1 epoch (loss 2.6160): 35%|ββββ | 432/1250 [10:25<20:02, 1.47s/it]
Training 1/1 epoch (loss 2.6160): 35%|ββββ | 433/1250 [10:25<19:07, 1.40s/it]
Training 1/1 epoch (loss 2.7768): 35%|ββββ | 433/1250 [10:26<19:07, 1.40s/it]
Training 1/1 epoch (loss 2.7768): 35%|ββββ | 434/1250 [10:26<17:00, 1.25s/it]
Training 1/1 epoch (loss 2.7672): 35%|ββββ | 434/1250 [10:27<17:00, 1.25s/it]
Training 1/1 epoch (loss 2.7672): 35%|ββββ | 435/1250 [10:27<16:32, 1.22s/it]
Training 1/1 epoch (loss 2.8155): 35%|ββββ | 435/1250 [10:29<16:32, 1.22s/it]
Training 1/1 epoch (loss 2.8155): 35%|ββββ | 436/1250 [10:29<19:16, 1.42s/it]
Training 1/1 epoch (loss 2.6080): 35%|ββββ | 436/1250 [10:29<19:16, 1.42s/it]
Training 1/1 epoch (loss 2.6080): 35%|ββββ | 437/1250 [10:29<15:43, 1.16s/it]
Training 1/1 epoch (loss 2.5693): 35%|ββββ | 437/1250 [10:32<15:43, 1.16s/it]
Training 1/1 epoch (loss 2.5693): 35%|ββββ | 438/1250 [10:32<20:43, 1.53s/it]
Training 1/1 epoch (loss 2.7279): 35%|ββββ | 438/1250 [10:34<20:43, 1.53s/it]
Training 1/1 epoch (loss 2.7279): 35%|ββββ | 439/1250 [10:34<24:19, 1.80s/it]
Training 1/1 epoch (loss 2.9884): 35%|ββββ | 439/1250 [10:35<24:19, 1.80s/it]
Training 1/1 epoch (loss 2.9884): 35%|ββββ | 440/1250 [10:35<19:46, 1.46s/it]
Training 1/1 epoch (loss 2.5583): 35%|ββββ | 440/1250 [10:37<19:46, 1.46s/it]
Training 1/1 epoch (loss 2.5583): 35%|ββββ | 441/1250 [10:37<22:34, 1.67s/it]
Training 1/1 epoch (loss 2.6308): 35%|ββββ | 441/1250 [10:38<22:34, 1.67s/it]
Training 1/1 epoch (loss 2.6308): 35%|ββββ | 442/1250 [10:38<21:14, 1.58s/it]
Training 1/1 epoch (loss 2.6247): 35%|ββββ | 442/1250 [10:40<21:14, 1.58s/it]
Training 1/1 epoch (loss 2.6247): 35%|ββββ | 443/1250 [10:40<20:36, 1.53s/it]
Training 1/1 epoch (loss 2.8495): 35%|ββββ | 443/1250 [10:41<20:36, 1.53s/it]
Training 1/1 epoch (loss 2.8495): 36%|ββββ | 444/1250 [10:41<19:24, 1.45s/it]
Training 1/1 epoch (loss 2.6094): 36%|ββββ | 444/1250 [10:42<19:24, 1.45s/it]
Training 1/1 epoch (loss 2.6094): 36%|ββββ | 445/1250 [10:42<20:00, 1.49s/it]
Training 1/1 epoch (loss 2.5652): 36%|ββββ | 445/1250 [10:44<20:00, 1.49s/it]
Training 1/1 epoch (loss 2.5652): 36%|ββββ | 446/1250 [10:44<21:14, 1.58s/it]
Training 1/1 epoch (loss 2.6748): 36%|ββββ | 446/1250 [10:47<21:14, 1.58s/it]
Training 1/1 epoch (loss 2.6748): 36%|ββββ | 447/1250 [10:47<24:44, 1.85s/it]
Training 1/1 epoch (loss 2.5228): 36%|ββββ | 447/1250 [10:48<24:44, 1.85s/it]
Training 1/1 epoch (loss 2.5228): 36%|ββββ | 448/1250 [10:48<21:11, 1.58s/it]
Training 1/1 epoch (loss 2.7304): 36%|ββββ | 448/1250 [10:50<21:11, 1.58s/it]
Training 1/1 epoch (loss 2.7304): 36%|ββββ | 449/1250 [10:50<25:05, 1.88s/it]
Training 1/1 epoch (loss 2.6408): 36%|ββββ | 449/1250 [10:52<25:05, 1.88s/it]
Training 1/1 epoch (loss 2.6408): 36%|ββββ | 450/1250 [10:52<23:06, 1.73s/it]
Training 1/1 epoch (loss 2.6087): 36%|ββββ | 450/1250 [10:52<23:06, 1.73s/it]
Training 1/1 epoch (loss 2.6087): 36%|ββββ | 451/1250 [10:52<18:43, 1.41s/it]
Training 1/1 epoch (loss 2.5890): 36%|ββββ | 451/1250 [10:54<18:43, 1.41s/it]
Training 1/1 epoch (loss 2.5890): 36%|ββββ | 452/1250 [10:54<20:44, 1.56s/it]
Training 1/1 epoch (loss 2.7193): 36%|ββββ | 452/1250 [10:55<20:44, 1.56s/it]
Training 1/1 epoch (loss 2.7193): 36%|ββββ | 453/1250 [10:55<19:04, 1.44s/it]
Training 1/1 epoch (loss 2.6947): 36%|ββββ | 453/1250 [10:56<19:04, 1.44s/it]
Training 1/1 epoch (loss 2.6947): 36%|ββββ | 454/1250 [10:56<17:00, 1.28s/it]
Training 1/1 epoch (loss 2.6451): 36%|ββββ | 454/1250 [10:58<17:00, 1.28s/it]
Training 1/1 epoch (loss 2.6451): 36%|ββββ | 455/1250 [10:58<19:48, 1.50s/it]
Training 1/1 epoch (loss 2.7738): 36%|ββββ | 455/1250 [11:00<19:48, 1.50s/it]
Training 1/1 epoch (loss 2.7738): 36%|ββββ | 456/1250 [11:00<20:11, 1.53s/it]
Training 1/1 epoch (loss 2.8372): 36%|ββββ | 456/1250 [11:01<20:11, 1.53s/it]
Training 1/1 epoch (loss 2.8372): 37%|ββββ | 457/1250 [11:01<17:40, 1.34s/it]
Training 1/1 epoch (loss 2.6663): 37%|ββββ | 457/1250 [11:03<17:40, 1.34s/it]
Training 1/1 epoch (loss 2.6663): 37%|ββββ | 458/1250 [11:03<20:16, 1.54s/it]
Training 1/1 epoch (loss 2.9013): 37%|ββββ | 458/1250 [11:04<20:16, 1.54s/it]
Training 1/1 epoch (loss 2.9013): 37%|ββββ | 459/1250 [11:04<18:09, 1.38s/it]
Training 1/1 epoch (loss 2.7456): 37%|ββββ | 459/1250 [11:05<18:09, 1.38s/it]
Training 1/1 epoch (loss 2.7456): 37%|ββββ | 460/1250 [11:05<19:22, 1.47s/it]
Training 1/1 epoch (loss 2.5533): 37%|ββββ | 460/1250 [11:07<19:22, 1.47s/it]
Training 1/1 epoch (loss 2.5533): 37%|ββββ | 461/1250 [11:07<18:11, 1.38s/it]
Training 1/1 epoch (loss 2.5904): 37%|ββββ | 461/1250 [11:07<18:11, 1.38s/it]
Training 1/1 epoch (loss 2.5904): 37%|ββββ | 462/1250 [11:07<15:42, 1.20s/it]
Training 1/1 epoch (loss 2.7028): 37%|ββββ | 462/1250 [11:09<15:42, 1.20s/it]
Training 1/1 epoch (loss 2.7028): 37%|ββββ | 463/1250 [11:09<17:21, 1.32s/it]
Training 1/1 epoch (loss 2.5791): 37%|ββββ | 463/1250 [11:10<17:21, 1.32s/it]
Training 1/1 epoch (loss 2.5791): 37%|ββββ | 464/1250 [11:10<18:08, 1.38s/it]
Training 1/1 epoch (loss 2.3955): 37%|ββββ | 464/1250 [11:11<18:08, 1.38s/it]
Training 1/1 epoch (loss 2.3955): 37%|ββββ | 465/1250 [11:11<15:16, 1.17s/it]
Training 1/1 epoch (loss 2.5013): 37%|ββββ | 465/1250 [11:14<15:16, 1.17s/it]
Training 1/1 epoch (loss 2.5013): 37%|ββββ | 466/1250 [11:14<20:22, 1.56s/it]
Training 1/1 epoch (loss 2.5626): 37%|ββββ | 466/1250 [11:15<20:22, 1.56s/it]
Training 1/1 epoch (loss 2.5626): 37%|ββββ | 467/1250 [11:15<21:31, 1.65s/it]
Training 1/1 epoch (loss 2.7239): 37%|ββββ | 467/1250 [11:16<21:31, 1.65s/it]
Training 1/1 epoch (loss 2.7239): 37%|ββββ | 468/1250 [11:16<17:02, 1.31s/it]
Training 1/1 epoch (loss 2.7522): 37%|ββββ | 468/1250 [11:17<17:02, 1.31s/it]
Training 1/1 epoch (loss 2.7522): 38%|ββββ | 469/1250 [11:17<17:17, 1.33s/it]
Training 1/1 epoch (loss 2.7546): 38%|ββββ | 469/1250 [11:19<17:17, 1.33s/it]
Training 1/1 epoch (loss 2.7546): 38%|ββββ | 470/1250 [11:19<18:36, 1.43s/it]
Training 1/1 epoch (loss 2.7923): 38%|ββββ | 470/1250 [11:20<18:36, 1.43s/it]
Training 1/1 epoch (loss 2.7923): 38%|ββββ | 471/1250 [11:20<16:25, 1.27s/it]
Training 1/1 epoch (loss 2.5788): 38%|ββββ | 471/1250 [11:22<16:25, 1.27s/it]
Training 1/1 epoch (loss 2.5788): 38%|ββββ | 472/1250 [11:22<19:26, 1.50s/it]
Training 1/1 epoch (loss 2.5995): 38%|ββββ | 472/1250 [11:23<19:26, 1.50s/it]
Training 1/1 epoch (loss 2.5995): 38%|ββββ | 473/1250 [11:23<18:45, 1.45s/it]
Training 1/1 epoch (loss 2.5392): 38%|ββββ | 473/1250 [11:25<18:45, 1.45s/it]
Training 1/1 epoch (loss 2.5392): 38%|ββββ | 474/1250 [11:25<20:36, 1.59s/it]
Training 1/1 epoch (loss 2.6292): 38%|ββββ | 474/1250 [11:27<20:36, 1.59s/it]
Training 1/1 epoch (loss 2.6292): 38%|ββββ | 475/1250 [11:27<19:28, 1.51s/it]
Training 1/1 epoch (loss 2.7298): 38%|ββββ | 475/1250 [11:27<19:28, 1.51s/it]
Training 1/1 epoch (loss 2.7298): 38%|ββββ | 476/1250 [11:27<16:11, 1.26s/it]
Training 1/1 epoch (loss 2.7148): 38%|ββββ | 476/1250 [11:29<16:11, 1.26s/it]
Training 1/1 epoch (loss 2.7148): 38%|ββββ | 477/1250 [11:29<19:04, 1.48s/it]
Training 1/1 epoch (loss 2.8425): 38%|ββββ | 477/1250 [11:31<19:04, 1.48s/it]
Training 1/1 epoch (loss 2.8425): 38%|ββββ | 478/1250 [11:31<20:38, 1.60s/it]
Training 1/1 epoch (loss 2.6144): 38%|ββββ | 478/1250 [11:32<20:38, 1.60s/it]
Training 1/1 epoch (loss 2.6144): 38%|ββββ | 479/1250 [11:32<18:08, 1.41s/it]
Training 1/1 epoch (loss 2.6559): 38%|ββββ | 479/1250 [11:35<18:08, 1.41s/it]
Training 1/1 epoch (loss 2.6559): 38%|ββββ | 480/1250 [11:35<23:06, 1.80s/it]
Training 1/1 epoch (loss 2.6354): 38%|ββββ | 480/1250 [11:36<23:06, 1.80s/it]
Training 1/1 epoch (loss 2.6354): 38%|ββββ | 481/1250 [11:36<19:50, 1.55s/it]
Training 1/1 epoch (loss 2.8414): 38%|ββββ | 481/1250 [11:37<19:50, 1.55s/it]
Training 1/1 epoch (loss 2.8414): 39%|ββββ | 482/1250 [11:37<19:44, 1.54s/it]
Training 1/1 epoch (loss 2.5825): 39%|ββββ | 482/1250 [11:39<19:44, 1.54s/it]
Training 1/1 epoch (loss 2.5825): 39%|ββββ | 483/1250 [11:39<20:23, 1.60s/it]
Training 1/1 epoch (loss 2.4990): 39%|ββββ | 483/1250 [11:40<20:23, 1.60s/it]
Training 1/1 epoch (loss 2.4990): 39%|ββββ | 484/1250 [11:40<18:36, 1.46s/it]
Training 1/1 epoch (loss 2.5440): 39%|ββββ | 484/1250 [11:42<18:36, 1.46s/it]
Training 1/1 epoch (loss 2.5440): 39%|ββββ | 485/1250 [11:42<18:51, 1.48s/it]
Training 1/1 epoch (loss 2.6909): 39%|ββββ | 485/1250 [11:43<18:51, 1.48s/it]
Training 1/1 epoch (loss 2.6909): 39%|ββββ | 486/1250 [11:43<19:35, 1.54s/it]
Training 1/1 epoch (loss 2.6625): 39%|ββββ | 486/1250 [11:44<19:35, 1.54s/it]
Training 1/1 epoch (loss 2.6625): 39%|ββββ | 487/1250 [11:44<15:37, 1.23s/it]
Training 1/1 epoch (loss 2.5128): 39%|ββββ | 487/1250 [11:46<15:37, 1.23s/it]
Training 1/1 epoch (loss 2.5128): 39%|ββββ | 488/1250 [11:46<18:08, 1.43s/it]
Training 1/1 epoch (loss 2.6997): 39%|ββββ | 488/1250 [11:47<18:08, 1.43s/it]
Training 1/1 epoch (loss 2.6997): 39%|ββββ | 489/1250 [11:47<19:10, 1.51s/it]
Training 1/1 epoch (loss 2.8731): 39%|ββββ | 489/1250 [11:49<19:10, 1.51s/it]
Training 1/1 epoch (loss 2.8731): 39%|ββββ | 490/1250 [11:49<21:01, 1.66s/it]
Training 1/1 epoch (loss 2.6987): 39%|ββββ | 490/1250 [11:51<21:01, 1.66s/it]
Training 1/1 epoch (loss 2.6987): 39%|ββββ | 491/1250 [11:51<20:07, 1.59s/it]
Training 1/1 epoch (loss 2.6264): 39%|ββββ | 491/1250 [11:52<20:07, 1.59s/it]
Training 1/1 epoch (loss 2.6264): 39%|ββββ | 492/1250 [11:52<17:24, 1.38s/it]
Training 1/1 epoch (loss 2.7255): 39%|ββββ | 492/1250 [11:54<17:24, 1.38s/it]
Training 1/1 epoch (loss 2.7255): 39%|ββββ | 493/1250 [11:54<19:25, 1.54s/it]
Training 1/1 epoch (loss 2.4759): 39%|ββββ | 493/1250 [11:55<19:25, 1.54s/it]
Training 1/1 epoch (loss 2.4759): 40%|ββββ | 494/1250 [11:55<17:52, 1.42s/it]
Training 1/1 epoch (loss 2.7911): 40%|ββββ | 494/1250 [11:56<17:52, 1.42s/it]
Training 1/1 epoch (loss 2.7911): 40%|ββββ | 495/1250 [11:56<15:41, 1.25s/it]
Training 1/1 epoch (loss 2.7201): 40%|ββββ | 495/1250 [11:57<15:41, 1.25s/it]
Training 1/1 epoch (loss 2.7201): 40%|ββββ | 496/1250 [11:57<17:39, 1.40s/it]
Training 1/1 epoch (loss 2.5624): 40%|ββββ | 496/1250 [11:59<17:39, 1.40s/it]
Training 1/1 epoch (loss 2.5624): 40%|ββββ | 497/1250 [11:59<18:25, 1.47s/it]
Training 1/1 epoch (loss 2.7950): 40%|ββββ | 497/1250 [12:00<18:25, 1.47s/it]
Training 1/1 epoch (loss 2.7950): 40%|ββββ | 498/1250 [12:00<17:14, 1.38s/it]
Training 1/1 epoch (loss 2.4343): 40%|ββββ | 498/1250 [12:02<17:14, 1.38s/it]
Training 1/1 epoch (loss 2.4343): 40%|ββββ | 499/1250 [12:02<18:24, 1.47s/it]
Training 1/1 epoch (loss 2.7708): 40%|ββββ | 499/1250 [12:03<18:24, 1.47s/it]
Training 1/1 epoch (loss 2.7708): 40%|ββββ | 500/1250 [12:03<15:46, 1.26s/it]
Training 1/1 epoch (loss 2.7069): 40%|ββββ | 500/1250 [12:05<15:46, 1.26s/it]
Training 1/1 epoch (loss 2.7069): 40%|ββββ | 501/1250 [12:05<20:21, 1.63s/it]
Training 1/1 epoch (loss 2.8279): 40%|ββββ | 501/1250 [12:07<20:21, 1.63s/it]
Training 1/1 epoch (loss 2.8279): 40%|ββββ | 502/1250 [12:07<22:12, 1.78s/it]
Training 1/1 epoch (loss 2.9321): 40%|ββββ | 502/1250 [12:08<22:12, 1.78s/it]
Training 1/1 epoch (loss 2.9321): 40%|ββββ | 503/1250 [12:08<17:08, 1.38s/it]
Training 1/1 epoch (loss 2.5204): 40%|ββββ | 503/1250 [12:10<17:08, 1.38s/it]
Training 1/1 epoch (loss 2.5204): 40%|ββββ | 504/1250 [12:10<21:15, 1.71s/it]
Training 1/1 epoch (loss 2.8400): 40%|ββββ | 504/1250 [12:12<21:15, 1.71s/it]
Training 1/1 epoch (loss 2.8400): 40%|ββββ | 505/1250 [12:12<22:53, 1.84s/it]
Training 1/1 epoch (loss 2.8484): 40%|ββββ | 505/1250 [12:13<22:53, 1.84s/it]
Training 1/1 epoch (loss 2.8484): 40%|ββββ | 506/1250 [12:13<20:02, 1.62s/it]
Training 1/1 epoch (loss 2.6631): 40%|ββββ | 506/1250 [12:15<20:02, 1.62s/it]
Training 1/1 epoch (loss 2.6631): 41%|ββββ | 507/1250 [12:15<19:46, 1.60s/it]
Training 1/1 epoch (loss 2.6624): 41%|ββββ | 507/1250 [12:16<19:46, 1.60s/it]
Training 1/1 epoch (loss 2.6624): 41%|ββββ | 508/1250 [12:16<19:11, 1.55s/it]
Training 1/1 epoch (loss 2.7015): 41%|ββββ | 508/1250 [12:18<19:11, 1.55s/it]
Training 1/1 epoch (loss 2.7015): 41%|ββββ | 509/1250 [12:18<18:30, 1.50s/it]
Training 1/1 epoch (loss 2.5881): 41%|ββββ | 509/1250 [12:19<18:30, 1.50s/it]
Training 1/1 epoch (loss 2.5881): 41%|ββββ | 510/1250 [12:19<17:03, 1.38s/it]
Training 1/1 epoch (loss 2.6135): 41%|ββββ | 510/1250 [12:20<17:03, 1.38s/it]
Training 1/1 epoch (loss 2.6135): 41%|ββββ | 511/1250 [12:20<16:08, 1.31s/it]
Training 1/1 epoch (loss 2.6448): 41%|ββββ | 511/1250 [12:21<16:08, 1.31s/it]
Training 1/1 epoch (loss 2.6448): 41%|ββββ | 512/1250 [12:21<14:41, 1.19s/it]
Training 1/1 epoch (loss 2.6987): 41%|ββββ | 512/1250 [12:22<14:41, 1.19s/it]
Training 1/1 epoch (loss 2.6987): 41%|ββββ | 513/1250 [12:22<13:34, 1.11s/it]
Training 1/1 epoch (loss 2.5636): 41%|ββββ | 513/1250 [12:23<13:34, 1.11s/it]
Training 1/1 epoch (loss 2.5636): 41%|ββββ | 514/1250 [12:23<13:07, 1.07s/it]
Training 1/1 epoch (loss 2.5543): 41%|ββββ | 514/1250 [12:24<13:07, 1.07s/it]
Training 1/1 epoch (loss 2.5543): 41%|ββββ | 515/1250 [12:24<13:55, 1.14s/it]
Training 1/1 epoch (loss 2.6529): 41%|ββββ | 515/1250 [12:26<13:55, 1.14s/it]
Training 1/1 epoch (loss 2.6529): 41%|βββββ | 516/1250 [12:26<16:27, 1.34s/it]
Training 1/1 epoch (loss 2.7930): 41%|βββββ | 516/1250 [12:27<16:27, 1.34s/it]
Training 1/1 epoch (loss 2.7930): 41%|βββββ | 517/1250 [12:27<14:30, 1.19s/it]
Training 1/1 epoch (loss 2.6596): 41%|βββββ | 517/1250 [12:28<14:30, 1.19s/it]
Training 1/1 epoch (loss 2.6596): 41%|βββββ | 518/1250 [12:28<15:20, 1.26s/it]
Training 1/1 epoch (loss 2.6193): 41%|βββββ | 518/1250 [12:30<15:20, 1.26s/it]
Training 1/1 epoch (loss 2.6193): 42%|βββββ | 519/1250 [12:30<18:25, 1.51s/it]
Training 1/1 epoch (loss 2.5132): 42%|βββββ | 519/1250 [12:31<18:25, 1.51s/it]
Training 1/1 epoch (loss 2.5132): 42%|βββββ | 520/1250 [12:31<15:34, 1.28s/it]
Training 1/1 epoch (loss 2.7778): 42%|βββββ | 520/1250 [12:34<15:34, 1.28s/it]
Training 1/1 epoch (loss 2.7778): 42%|βββββ | 521/1250 [12:34<19:44, 1.62s/it]
Training 1/1 epoch (loss 2.7397): 42%|βββββ | 521/1250 [12:35<19:44, 1.62s/it]
Training 1/1 epoch (loss 2.7397): 42%|βββββ | 522/1250 [12:35<18:46, 1.55s/it]
Training 1/1 epoch (loss 2.8839): 42%|βββββ | 522/1250 [12:35<18:46, 1.55s/it]
Training 1/1 epoch (loss 2.8839): 42%|βββββ | 523/1250 [12:35<14:29, 1.20s/it]
Training 1/1 epoch (loss 2.7314): 42%|βββββ | 523/1250 [12:37<14:29, 1.20s/it]
Training 1/1 epoch (loss 2.7314): 42%|βββββ | 524/1250 [12:37<17:06, 1.41s/it]
Training 1/1 epoch (loss 2.7087): 42%|βββββ | 524/1250 [12:39<17:06, 1.41s/it]
Training 1/1 epoch (loss 2.7087): 42%|βββββ | 525/1250 [12:39<17:49, 1.48s/it]
Training 1/1 epoch (loss 2.6593): 42%|βββββ | 525/1250 [12:40<17:49, 1.48s/it]
Training 1/1 epoch (loss 2.6593): 42%|βββββ | 526/1250 [12:40<15:05, 1.25s/it]
Training 1/1 epoch (loss 2.5829): 42%|βββββ | 526/1250 [12:41<15:05, 1.25s/it]
Training 1/1 epoch (loss 2.5829): 42%|βββββ | 527/1250 [12:41<15:37, 1.30s/it]
Training 1/1 epoch (loss 2.7203): 42%|βββββ | 527/1250 [12:43<15:37, 1.30s/it]
Training 1/1 epoch (loss 2.7203): 42%|βββββ | 528/1250 [12:43<16:33, 1.38s/it]
Training 1/1 epoch (loss 2.5864): 42%|βββββ | 528/1250 [12:44<16:33, 1.38s/it]
Training 1/1 epoch (loss 2.5864): 42%|βββββ | 529/1250 [12:44<16:25, 1.37s/it]
Training 1/1 epoch (loss 2.7065): 42%|βββββ | 529/1250 [12:46<16:25, 1.37s/it]
Training 1/1 epoch (loss 2.7065): 42%|βββββ | 530/1250 [12:46<19:44, 1.65s/it]
Training 1/1 epoch (loss 2.4687): 42%|βββββ | 530/1250 [12:47<19:44, 1.65s/it]
Training 1/1 epoch (loss 2.4687): 42%|βββββ | 531/1250 [12:47<16:50, 1.41s/it]
Training 1/1 epoch (loss 2.6429): 42%|βββββ | 531/1250 [12:48<16:50, 1.41s/it]
Training 1/1 epoch (loss 2.6429): 43%|βββββ | 532/1250 [12:48<16:45, 1.40s/it]
Training 1/1 epoch (loss 2.7205): 43%|βββββ | 532/1250 [12:51<16:45, 1.40s/it]
Training 1/1 epoch (loss 2.7205): 43%|βββββ | 533/1250 [12:51<20:45, 1.74s/it]
Training 1/1 epoch (loss 2.8042): 43%|βββββ | 533/1250 [12:52<20:45, 1.74s/it]
Training 1/1 epoch (loss 2.8042): 43%|βββββ | 534/1250 [12:52<16:39, 1.40s/it]
Training 1/1 epoch (loss 2.8914): 43%|βββββ | 534/1250 [12:53<16:39, 1.40s/it]
Training 1/1 epoch (loss 2.8914): 43%|βββββ | 535/1250 [12:53<16:43, 1.40s/it]
Training 1/1 epoch (loss 2.8479): 43%|βββββ | 535/1250 [12:55<16:43, 1.40s/it]
Training 1/1 epoch (loss 2.8479): 43%|βββββ | 536/1250 [12:55<19:55, 1.67s/it]
Training 1/1 epoch (loss 2.8805): 43%|βββββ | 536/1250 [12:56<19:55, 1.67s/it]
Training 1/1 epoch (loss 2.8805): 43%|βββββ | 537/1250 [12:56<18:05, 1.52s/it]
Training 1/1 epoch (loss 2.5748): 43%|βββββ | 537/1250 [12:59<18:05, 1.52s/it]
Training 1/1 epoch (loss 2.5748): 43%|βββββ | 538/1250 [12:59<21:45, 1.83s/it]
Training 1/1 epoch (loss 2.7588): 43%|βββββ | 538/1250 [13:00<21:45, 1.83s/it]
Training 1/1 epoch (loss 2.7588): 43%|βββββ | 539/1250 [13:00<18:19, 1.55s/it]
Training 1/1 epoch (loss 2.8386): 43%|βββββ | 539/1250 [13:02<18:19, 1.55s/it]
Training 1/1 epoch (loss 2.8386): 43%|βββββ | 540/1250 [13:02<19:40, 1.66s/it]
Training 1/1 epoch (loss 2.6906): 43%|βββββ | 540/1250 [13:04<19:40, 1.66s/it]
Training 1/1 epoch (loss 2.6906): 43%|βββββ | 541/1250 [13:04<20:12, 1.71s/it]
Training 1/1 epoch (loss 2.6882): 43%|βββββ | 541/1250 [13:05<20:12, 1.71s/it]
Training 1/1 epoch (loss 2.6882): 43%|βββββ | 542/1250 [13:05<17:42, 1.50s/it]
Training 1/1 epoch (loss 2.5770): 43%|βββββ | 542/1250 [13:07<17:42, 1.50s/it]
Training 1/1 epoch (loss 2.5770): 43%|βββββ | 543/1250 [13:07<20:51, 1.77s/it]
Training 1/1 epoch (loss 2.6081): 43%|βββββ | 543/1250 [13:09<20:51, 1.77s/it]
Training 1/1 epoch (loss 2.6081): 44%|βββββ | 544/1250 [13:09<21:50, 1.86s/it]
Training 1/1 epoch (loss 2.7029): 44%|βββββ | 544/1250 [13:10<21:50, 1.86s/it]
Training 1/1 epoch (loss 2.7029): 44%|βββββ | 545/1250 [13:10<18:26, 1.57s/it]
Training 1/1 epoch (loss 2.7848): 44%|βββββ | 545/1250 [13:12<18:26, 1.57s/it]
Training 1/1 epoch (loss 2.7848): 44%|βββββ | 546/1250 [13:12<20:38, 1.76s/it]
Training 1/1 epoch (loss 2.7225): 44%|βββββ | 546/1250 [13:13<20:38, 1.76s/it]
Training 1/1 epoch (loss 2.7225): 44%|βββββ | 547/1250 [13:13<19:03, 1.63s/it]
Training 1/1 epoch (loss 2.8150): 44%|βββββ | 547/1250 [13:14<19:03, 1.63s/it]
Training 1/1 epoch (loss 2.8150): 44%|βββββ | 548/1250 [13:14<15:25, 1.32s/it]
Training 1/1 epoch (loss 2.9203): 44%|βββββ | 548/1250 [13:16<15:25, 1.32s/it]
Training 1/1 epoch (loss 2.9203): 44%|βββββ | 549/1250 [13:16<17:01, 1.46s/it]
Training 1/1 epoch (loss 2.7547): 44%|βββββ | 549/1250 [13:18<17:01, 1.46s/it]
Training 1/1 epoch (loss 2.7547): 44%|βββββ | 550/1250 [13:18<18:13, 1.56s/it]
Training 1/1 epoch (loss 2.6111): 44%|βββββ | 550/1250 [13:18<18:13, 1.56s/it]
Training 1/1 epoch (loss 2.6111): 44%|βββββ | 551/1250 [13:18<15:31, 1.33s/it]
Training 1/1 epoch (loss 2.5161): 44%|βββββ | 551/1250 [13:20<15:31, 1.33s/it]
Training 1/1 epoch (loss 2.5161): 44%|βββββ | 552/1250 [13:20<17:34, 1.51s/it]
Training 1/1 epoch (loss 2.8199): 44%|βββββ | 552/1250 [13:21<17:34, 1.51s/it]
Training 1/1 epoch (loss 2.8199): 44%|βββββ | 553/1250 [13:21<14:44, 1.27s/it]
Training 1/1 epoch (loss 2.6862): 44%|βββββ | 553/1250 [13:23<14:44, 1.27s/it]
Training 1/1 epoch (loss 2.6862): 44%|βββββ | 554/1250 [13:23<18:34, 1.60s/it]
Training 1/1 epoch (loss 2.4726): 44%|βββββ | 554/1250 [13:25<18:34, 1.60s/it]
Training 1/1 epoch (loss 2.4726): 44%|βββββ | 555/1250 [13:25<19:13, 1.66s/it]
Training 1/1 epoch (loss 2.6531): 44%|βββββ | 555/1250 [13:26<19:13, 1.66s/it]
Training 1/1 epoch (loss 2.6531): 44%|βββββ | 556/1250 [13:26<15:08, 1.31s/it]
Training 1/1 epoch (loss 2.8455): 44%|βββββ | 556/1250 [13:28<15:08, 1.31s/it]
Training 1/1 epoch (loss 2.8455): 45%|βββββ | 557/1250 [13:28<16:40, 1.44s/it]
Training 1/1 epoch (loss 2.7231): 45%|βββββ | 557/1250 [13:29<16:40, 1.44s/it]
Training 1/1 epoch (loss 2.7231): 45%|βββββ | 558/1250 [13:29<18:08, 1.57s/it]
Training 1/1 epoch (loss 2.7855): 45%|βββββ | 558/1250 [13:30<18:08, 1.57s/it]
Training 1/1 epoch (loss 2.7855): 45%|βββββ | 559/1250 [13:30<15:05, 1.31s/it]
Training 1/1 epoch (loss 2.7884): 45%|βββββ | 559/1250 [13:31<15:05, 1.31s/it]
Training 1/1 epoch (loss 2.7884): 45%|βββββ | 560/1250 [13:31<14:38, 1.27s/it]
Training 1/1 epoch (loss 2.5296): 45%|βββββ | 560/1250 [13:33<14:38, 1.27s/it]
Training 1/1 epoch (loss 2.5296): 45%|βββββ | 561/1250 [13:33<15:44, 1.37s/it]
Training 1/1 epoch (loss 2.7699): 45%|βββββ | 561/1250 [13:34<15:44, 1.37s/it]
Training 1/1 epoch (loss 2.7699): 45%|βββββ | 562/1250 [13:34<14:18, 1.25s/it]
Training 1/1 epoch (loss 2.7943): 45%|βββββ | 562/1250 [13:35<14:18, 1.25s/it]
Training 1/1 epoch (loss 2.7943): 45%|βββββ | 563/1250 [13:35<15:22, 1.34s/it]
Training 1/1 epoch (loss 2.8348): 45%|βββββ | 563/1250 [13:37<15:22, 1.34s/it]
Training 1/1 epoch (loss 2.8348): 45%|βββββ | 564/1250 [13:37<14:44, 1.29s/it]
Training 1/1 epoch (loss 2.7618): 45%|βββββ | 564/1250 [13:37<14:44, 1.29s/it]
Training 1/1 epoch (loss 2.7618): 45%|βββββ | 565/1250 [13:37<13:18, 1.17s/it]
Training 1/1 epoch (loss 2.7600): 45%|βββββ | 565/1250 [13:40<13:18, 1.17s/it]
Training 1/1 epoch (loss 2.7600): 45%|βββββ | 566/1250 [13:40<17:52, 1.57s/it]
Training 1/1 epoch (loss 2.7205): 45%|βββββ | 566/1250 [13:41<17:52, 1.57s/it]
Training 1/1 epoch (loss 2.7205): 45%|βββββ | 567/1250 [13:41<16:50, 1.48s/it]
Training 1/1 epoch (loss 2.6841): 45%|βββββ | 567/1250 [13:43<16:50, 1.48s/it]
Training 1/1 epoch (loss 2.6841): 45%|βββββ | 568/1250 [13:43<17:59, 1.58s/it]
Training 1/1 epoch (loss 2.7964): 45%|βββββ | 568/1250 [13:44<17:59, 1.58s/it]
Training 1/1 epoch (loss 2.7964): 46%|βββββ | 569/1250 [13:44<17:28, 1.54s/it]
Training 1/1 epoch (loss 2.6470): 46%|βββββ | 569/1250 [13:45<17:28, 1.54s/it]
Training 1/1 epoch (loss 2.6470): 46%|βββββ | 570/1250 [13:45<14:18, 1.26s/it]
Training 1/1 epoch (loss 2.6930): 46%|βββββ | 570/1250 [13:47<14:18, 1.26s/it]
Training 1/1 epoch (loss 2.6930): 46%|βββββ | 571/1250 [13:47<15:27, 1.37s/it]
Training 1/1 epoch (loss 2.8290): 46%|βββββ | 571/1250 [13:49<15:27, 1.37s/it]
Training 1/1 epoch (loss 2.8290): 46%|βββββ | 572/1250 [13:49<18:20, 1.62s/it]
Training 1/1 epoch (loss 2.7141): 46%|βββββ | 572/1250 [13:49<18:20, 1.62s/it]
Training 1/1 epoch (loss 2.7141): 46%|βββββ | 573/1250 [13:49<14:36, 1.29s/it]
Training 1/1 epoch (loss 2.7469): 46%|βββββ | 573/1250 [13:52<14:36, 1.29s/it]
Training 1/1 epoch (loss 2.7469): 46%|βββββ | 574/1250 [13:52<18:31, 1.64s/it]
Training 1/1 epoch (loss 2.8200): 46%|βββββ | 574/1250 [13:54<18:31, 1.64s/it]
Training 1/1 epoch (loss 2.8200): 46%|βββββ | 575/1250 [13:54<19:49, 1.76s/it]
Training 1/1 epoch (loss 2.6578): 46%|βββββ | 575/1250 [13:55<19:49, 1.76s/it]
Training 1/1 epoch (loss 2.6578): 46%|βββββ | 576/1250 [13:55<17:30, 1.56s/it]
Training 1/1 epoch (loss 2.6227): 46%|βββββ | 576/1250 [13:56<17:30, 1.56s/it]
Training 1/1 epoch (loss 2.6227): 46%|βββββ | 577/1250 [13:56<16:45, 1.49s/it]
Training 1/1 epoch (loss 2.8172): 46%|βββββ | 577/1250 [13:57<16:45, 1.49s/it]
Training 1/1 epoch (loss 2.8172): 46%|βββββ | 578/1250 [13:57<15:07, 1.35s/it]
Training 1/1 epoch (loss 2.9136): 46%|βββββ | 578/1250 [14:00<15:07, 1.35s/it]
Training 1/1 epoch (loss 2.9136): 46%|βββββ | 579/1250 [14:00<18:05, 1.62s/it]
Training 1/1 epoch (loss 2.8637): 46%|βββββ | 579/1250 [14:01<18:05, 1.62s/it]
Training 1/1 epoch (loss 2.8637): 46%|βββββ | 580/1250 [14:01<18:38, 1.67s/it]
Training 1/1 epoch (loss 2.6864): 46%|βββββ | 580/1250 [14:02<18:38, 1.67s/it]
Training 1/1 epoch (loss 2.6864): 46%|βββββ | 581/1250 [14:02<14:42, 1.32s/it]
Training 1/1 epoch (loss 2.5894): 46%|βββββ | 581/1250 [14:04<14:42, 1.32s/it]
Training 1/1 epoch (loss 2.5894): 47%|βββββ | 582/1250 [14:04<16:31, 1.48s/it]
Training 1/1 epoch (loss 2.5470): 47%|βββββ | 582/1250 [14:05<16:31, 1.48s/it]
Training 1/1 epoch (loss 2.5470): 47%|βββββ | 583/1250 [14:05<15:43, 1.41s/it]
Training 1/1 epoch (loss 2.6041): 47%|βββββ | 583/1250 [14:06<15:43, 1.41s/it]
Training 1/1 epoch (loss 2.6041): 47%|βββββ | 584/1250 [14:06<13:35, 1.22s/it]
Training 1/1 epoch (loss 2.7909): 47%|βββββ | 584/1250 [14:08<13:35, 1.22s/it]
Training 1/1 epoch (loss 2.7909): 47%|βββββ | 585/1250 [14:08<15:43, 1.42s/it]
Training 1/1 epoch (loss 2.5880): 47%|βββββ | 585/1250 [14:10<15:43, 1.42s/it]
Training 1/1 epoch (loss 2.5880): 47%|βββββ | 586/1250 [14:10<17:48, 1.61s/it]
Training 1/1 epoch (loss 2.6550): 47%|βββββ | 586/1250 [14:11<17:48, 1.61s/it]
Training 1/1 epoch (loss 2.6550): 47%|βββββ | 587/1250 [14:11<15:14, 1.38s/it]
Training 1/1 epoch (loss 2.4435): 47%|βββββ | 587/1250 [14:12<15:14, 1.38s/it]
Training 1/1 epoch (loss 2.4435): 47%|βββββ | 588/1250 [14:12<16:47, 1.52s/it]
Training 1/1 epoch (loss 2.5411): 47%|βββββ | 588/1250 [14:14<16:47, 1.52s/it]
Training 1/1 epoch (loss 2.5411): 47%|βββββ | 589/1250 [14:14<16:49, 1.53s/it]
Training 1/1 epoch (loss 2.6979): 47%|βββββ | 589/1250 [14:15<16:49, 1.53s/it]
Training 1/1 epoch (loss 2.6979): 47%|βββββ | 590/1250 [14:15<15:58, 1.45s/it]
Training 1/1 epoch (loss 2.5763): 47%|βββββ | 590/1250 [14:18<15:58, 1.45s/it]
Training 1/1 epoch (loss 2.5763): 47%|βββββ | 591/1250 [14:18<19:18, 1.76s/it]
Training 1/1 epoch (loss 2.7215): 47%|βββββ | 591/1250 [14:19<19:18, 1.76s/it]
Training 1/1 epoch (loss 2.7215): 47%|βββββ | 592/1250 [14:19<17:05, 1.56s/it]
Training 1/1 epoch (loss 2.8266): 47%|βββββ | 592/1250 [14:21<17:05, 1.56s/it]
Training 1/1 epoch (loss 2.8266): 47%|βββββ | 593/1250 [14:21<19:57, 1.82s/it]
Training 1/1 epoch (loss 2.6069): 47%|βββββ | 593/1250 [14:23<19:57, 1.82s/it]
Training 1/1 epoch (loss 2.6069): 48%|βββββ | 594/1250 [14:23<20:56, 1.92s/it]
Training 1/1 epoch (loss 2.5879): 48%|βββββ | 594/1250 [14:24<20:56, 1.92s/it]
Training 1/1 epoch (loss 2.5879): 48%|βββββ | 595/1250 [14:24<16:51, 1.54s/it]
Training 1/1 epoch (loss 2.6303): 48%|βββββ | 595/1250 [14:26<16:51, 1.54s/it]
Training 1/1 epoch (loss 2.6303): 48%|βββββ | 596/1250 [14:26<17:38, 1.62s/it]
Training 1/1 epoch (loss 2.7333): 48%|βββββ | 596/1250 [14:27<17:38, 1.62s/it]
Training 1/1 epoch (loss 2.7333): 48%|βββββ | 597/1250 [14:27<17:01, 1.56s/it]
Training 1/1 epoch (loss 2.6795): 48%|βββββ | 597/1250 [14:28<17:01, 1.56s/it]
Training 1/1 epoch (loss 2.6795): 48%|βββββ | 598/1250 [14:28<14:41, 1.35s/it]
Training 1/1 epoch (loss 2.7003): 48%|βββββ | 598/1250 [14:30<14:41, 1.35s/it]
Training 1/1 epoch (loss 2.7003): 48%|βββββ | 599/1250 [14:30<16:33, 1.53s/it]
Training 1/1 epoch (loss 2.6887): 48%|βββββ | 599/1250 [14:31<16:33, 1.53s/it]
Training 1/1 epoch (loss 2.6887): 48%|βββββ | 600/1250 [14:31<14:41, 1.36s/it]
Training 1/1 epoch (loss 2.8890): 48%|βββββ | 600/1250 [14:32<14:41, 1.36s/it]
Training 1/1 epoch (loss 2.8890): 48%|βββββ | 601/1250 [14:32<14:11, 1.31s/it]
Training 1/1 epoch (loss 2.5971): 48%|βββββ | 601/1250 [14:34<14:11, 1.31s/it]
Training 1/1 epoch (loss 2.5971): 48%|βββββ | 602/1250 [14:34<16:15, 1.50s/it]
Training 1/1 epoch (loss 2.5735): 48%|βββββ | 602/1250 [14:35<16:15, 1.50s/it]
Training 1/1 epoch (loss 2.5735): 48%|βββββ | 603/1250 [14:35<13:43, 1.27s/it]
Training 1/1 epoch (loss 2.7785): 48%|βββββ | 603/1250 [14:36<13:43, 1.27s/it]
Training 1/1 epoch (loss 2.7785): 48%|βββββ | 604/1250 [14:36<14:14, 1.32s/it]
Training 1/1 epoch (loss 2.6707): 48%|βββββ | 604/1250 [14:38<14:14, 1.32s/it]
Training 1/1 epoch (loss 2.6707): 48%|βββββ | 605/1250 [14:38<15:40, 1.46s/it]
Training 1/1 epoch (loss 2.7932): 48%|βββββ | 605/1250 [14:39<15:40, 1.46s/it]
Training 1/1 epoch (loss 2.7932): 48%|βββββ | 606/1250 [14:39<12:29, 1.16s/it]
Training 1/1 epoch (loss 2.5722): 48%|βββββ | 606/1250 [14:41<12:29, 1.16s/it]
Training 1/1 epoch (loss 2.5722): 49%|βββββ | 607/1250 [14:41<16:31, 1.54s/it]
Training 1/1 epoch (loss 2.8843): 49%|βββββ | 607/1250 [14:43<16:31, 1.54s/it]
Training 1/1 epoch (loss 2.8843): 49%|βββββ | 608/1250 [14:43<16:59, 1.59s/it]
Training 1/1 epoch (loss 2.6326): 49%|βββββ | 608/1250 [14:43<16:59, 1.59s/it]
Training 1/1 epoch (loss 2.6326): 49%|βββββ | 609/1250 [14:43<14:05, 1.32s/it]
Training 1/1 epoch (loss 2.7642): 49%|βββββ | 609/1250 [14:45<14:05, 1.32s/it]
Training 1/1 epoch (loss 2.7642): 49%|βββββ | 610/1250 [14:45<15:35, 1.46s/it]
Training 1/1 epoch (loss 2.7424): 49%|βββββ | 610/1250 [14:47<15:35, 1.46s/it]
Training 1/1 epoch (loss 2.7424): 49%|βββββ | 611/1250 [14:47<15:41, 1.47s/it]
Training 1/1 epoch (loss 2.5728): 49%|βββββ | 611/1250 [14:48<15:41, 1.47s/it]
Training 1/1 epoch (loss 2.5728): 49%|βββββ | 612/1250 [14:48<15:07, 1.42s/it]
Training 1/1 epoch (loss 2.8065): 49%|βββββ | 612/1250 [14:50<15:07, 1.42s/it]
Training 1/1 epoch (loss 2.8065): 49%|βββββ | 613/1250 [14:50<16:50, 1.59s/it]
Training 1/1 epoch (loss 2.8513): 49%|βββββ | 613/1250 [14:51<16:50, 1.59s/it]
Training 1/1 epoch (loss 2.8513): 49%|βββββ | 614/1250 [14:51<15:06, 1.43s/it]
Training 1/1 epoch (loss 2.7036): 49%|βββββ | 614/1250 [14:52<15:06, 1.43s/it]
Training 1/1 epoch (loss 2.7036): 49%|βββββ | 615/1250 [14:52<14:42, 1.39s/it]
Training 1/1 epoch (loss 2.7736): 49%|βββββ | 615/1250 [14:54<14:42, 1.39s/it]
Training 1/1 epoch (loss 2.7736): 49%|βββββ | 616/1250 [14:54<15:17, 1.45s/it]
Training 1/1 epoch (loss 2.3511): 49%|βββββ | 616/1250 [14:55<15:17, 1.45s/it]
Training 1/1 epoch (loss 2.3511): 49%|βββββ | 617/1250 [14:55<13:44, 1.30s/it]
Training 1/1 epoch (loss 2.6886): 49%|βββββ | 617/1250 [14:56<13:44, 1.30s/it]
Training 1/1 epoch (loss 2.6886): 49%|βββββ | 618/1250 [14:56<13:26, 1.28s/it]
Training 1/1 epoch (loss 2.6158): 49%|βββββ | 618/1250 [14:57<13:26, 1.28s/it]
Training 1/1 epoch (loss 2.6158): 50%|βββββ | 619/1250 [14:57<13:37, 1.30s/it]
Training 1/1 epoch (loss 2.6185): 50%|βββββ | 619/1250 [14:58<13:37, 1.30s/it]
Training 1/1 epoch (loss 2.6185): 50%|βββββ | 620/1250 [14:58<10:52, 1.04s/it]
Training 1/1 epoch (loss 2.5569): 50%|βββββ | 620/1250 [15:00<10:52, 1.04s/it]
Training 1/1 epoch (loss 2.5569): 50%|βββββ | 621/1250 [15:00<14:14, 1.36s/it]
Training 1/1 epoch (loss 2.8364): 50%|βββββ | 621/1250 [15:02<14:14, 1.36s/it]
Training 1/1 epoch (loss 2.8364): 50%|βββββ | 622/1250 [15:02<15:09, 1.45s/it]
Training 1/1 epoch (loss 2.7262): 50%|βββββ | 622/1250 [15:02<15:09, 1.45s/it]
Training 1/1 epoch (loss 2.7262): 50%|βββββ | 623/1250 [15:02<12:38, 1.21s/it]
Training 1/1 epoch (loss 2.6499): 50%|βββββ | 623/1250 [15:05<12:38, 1.21s/it]
Training 1/1 epoch (loss 2.6499): 50%|βββββ | 624/1250 [15:05<16:37, 1.59s/it]
Training 1/1 epoch (loss 2.7123): 50%|βββββ | 624/1250 [15:06<16:37, 1.59s/it]
Training 1/1 epoch (loss 2.7123): 50%|βββββ | 625/1250 [15:06<15:16, 1.47s/it]
Training 1/1 epoch (loss 2.7603): 50%|βββββ | 625/1250 [15:07<15:16, 1.47s/it]
Training 1/1 epoch (loss 2.7603): 50%|βββββ | 626/1250 [15:07<12:56, 1.24s/it]
Training 1/1 epoch (loss 2.4679): 50%|βββββ | 626/1250 [15:08<12:56, 1.24s/it]
Training 1/1 epoch (loss 2.4679): 50%|βββββ | 627/1250 [15:08<13:27, 1.30s/it]
Training 1/1 epoch (loss 2.6600): 50%|βββββ | 627/1250 [15:09<13:27, 1.30s/it]
Training 1/1 epoch (loss 2.6600): 50%|βββββ | 628/1250 [15:09<12:30, 1.21s/it]
Training 1/1 epoch (loss 2.8506): 50%|βββββ | 628/1250 [15:10<12:30, 1.21s/it]
Training 1/1 epoch (loss 2.8506): 50%|βββββ | 629/1250 [15:10<11:40, 1.13s/it]
Training 1/1 epoch (loss 2.4235): 50%|βββββ | 629/1250 [15:13<11:40, 1.13s/it]
Training 1/1 epoch (loss 2.4235): 50%|βββββ | 630/1250 [15:13<15:42, 1.52s/it]
Training 1/1 epoch (loss 2.6785): 50%|βββββ | 630/1250 [15:14<15:42, 1.52s/it]
Training 1/1 epoch (loss 2.6785): 50%|βββββ | 631/1250 [15:14<15:42, 1.52s/it]
Training 1/1 epoch (loss 2.8479): 50%|βββββ | 631/1250 [15:16<15:42, 1.52s/it]
Training 1/1 epoch (loss 2.8479): 51%|βββββ | 632/1250 [15:16<16:05, 1.56s/it]
Training 1/1 epoch (loss 2.7007): 51%|βββββ | 632/1250 [15:18<16:05, 1.56s/it]
Training 1/1 epoch (loss 2.7007): 51%|βββββ | 633/1250 [15:18<18:48, 1.83s/it]
Training 1/1 epoch (loss 2.6779): 51%|βββββ | 633/1250 [15:19<18:48, 1.83s/it]
Training 1/1 epoch (loss 2.6779): 51%|βββββ | 634/1250 [15:19<15:28, 1.51s/it]
Training 1/1 epoch (loss 2.5353): 51%|βββββ | 634/1250 [15:21<15:28, 1.51s/it]
Training 1/1 epoch (loss 2.5353): 51%|βββββ | 635/1250 [15:21<18:16, 1.78s/it]
Training 1/1 epoch (loss 2.4935): 51%|βββββ | 635/1250 [15:23<18:16, 1.78s/it]
Training 1/1 epoch (loss 2.4935): 51%|βββββ | 636/1250 [15:23<17:03, 1.67s/it]
Training 1/1 epoch (loss 2.5703): 51%|βββββ | 636/1250 [15:24<17:03, 1.67s/it]
Training 1/1 epoch (loss 2.5703): 51%|βββββ | 637/1250 [15:24<15:05, 1.48s/it]
Training 1/1 epoch (loss 2.5328): 51%|βββββ | 637/1250 [15:26<15:05, 1.48s/it]
Training 1/1 epoch (loss 2.5328): 51%|βββββ | 638/1250 [15:26<17:11, 1.69s/it]
Training 1/1 epoch (loss 2.8745): 51%|βββββ | 638/1250 [15:27<17:11, 1.69s/it]
Training 1/1 epoch (loss 2.8745): 51%|βββββ | 639/1250 [15:27<15:16, 1.50s/it]
Training 1/1 epoch (loss 2.6576): 51%|βββββ | 639/1250 [15:28<15:16, 1.50s/it]
Training 1/1 epoch (loss 2.6576): 51%|βββββ | 640/1250 [15:28<13:56, 1.37s/it]
Training 1/1 epoch (loss 2.6165): 51%|βββββ | 640/1250 [15:30<13:56, 1.37s/it]
Training 1/1 epoch (loss 2.6165): 51%|ββββββ | 641/1250 [15:30<14:32, 1.43s/it]
Training 1/1 epoch (loss 2.6343): 51%|ββββββ | 641/1250 [15:31<14:32, 1.43s/it]
Training 1/1 epoch (loss 2.6343): 51%|ββββββ | 642/1250 [15:31<12:53, 1.27s/it]
Training 1/1 epoch (loss 2.6751): 51%|ββββββ | 642/1250 [15:32<12:53, 1.27s/it]
Training 1/1 epoch (loss 2.6751): 51%|ββββββ | 643/1250 [15:32<14:29, 1.43s/it]
Training 1/1 epoch (loss 2.6685): 51%|ββββββ | 643/1250 [15:34<14:29, 1.43s/it]
Training 1/1 epoch (loss 2.6685): 52%|ββββββ | 644/1250 [15:34<15:50, 1.57s/it]
Training 1/1 epoch (loss 2.5447): 52%|ββββββ | 644/1250 [15:35<15:50, 1.57s/it]
Training 1/1 epoch (loss 2.5447): 52%|ββββββ | 645/1250 [15:35<12:31, 1.24s/it]
Training 1/1 epoch (loss 2.2722): 52%|ββββββ | 645/1250 [15:37<12:31, 1.24s/it]
Training 1/1 epoch (loss 2.2722): 52%|ββββββ | 646/1250 [15:37<14:08, 1.40s/it]
Training 1/1 epoch (loss 2.6533): 52%|ββββββ | 646/1250 [15:39<14:08, 1.40s/it]
Training 1/1 epoch (loss 2.6533): 52%|ββββββ | 647/1250 [15:39<16:44, 1.67s/it]
Training 1/1 epoch (loss 2.8162): 52%|ββββββ | 647/1250 [15:40<16:44, 1.67s/it]
Training 1/1 epoch (loss 2.8162): 52%|ββββββ | 648/1250 [15:40<15:14, 1.52s/it]
Training 1/1 epoch (loss 2.5195): 52%|ββββββ | 648/1250 [15:42<15:14, 1.52s/it]
Training 1/1 epoch (loss 2.5195): 52%|ββββββ | 649/1250 [15:42<16:30, 1.65s/it]
Training 1/1 epoch (loss 2.8052): 52%|ββββββ | 649/1250 [15:43<16:30, 1.65s/it]
Training 1/1 epoch (loss 2.8052): 52%|ββββββ | 650/1250 [15:43<13:34, 1.36s/it]
Training 1/1 epoch (loss 2.7059): 52%|ββββββ | 650/1250 [15:45<13:34, 1.36s/it]
Training 1/1 epoch (loss 2.7059): 52%|ββββββ | 651/1250 [15:45<15:25, 1.55s/it]
Training 1/1 epoch (loss 2.5560): 52%|ββββββ | 651/1250 [15:47<15:25, 1.55s/it]
Training 1/1 epoch (loss 2.5560): 52%|ββββββ | 652/1250 [15:47<17:16, 1.73s/it]
Training 1/1 epoch (loss 2.5467): 52%|ββββββ | 652/1250 [15:48<17:16, 1.73s/it]
Training 1/1 epoch (loss 2.5467): 52%|ββββββ | 653/1250 [15:48<14:35, 1.47s/it]
Training 1/1 epoch (loss 2.4835): 52%|ββββββ | 653/1250 [15:50<14:35, 1.47s/it]
Training 1/1 epoch (loss 2.4835): 52%|ββββββ | 654/1250 [15:50<16:36, 1.67s/it]
Training 1/1 epoch (loss 2.6986): 52%|ββββββ | 654/1250 [15:52<16:36, 1.67s/it]
Training 1/1 epoch (loss 2.6986): 52%|ββββββ | 655/1250 [15:52<17:49, 1.80s/it]
Training 1/1 epoch (loss 2.8668): 52%|ββββββ | 655/1250 [15:53<17:49, 1.80s/it]
Training 1/1 epoch (loss 2.8668): 52%|ββββββ | 656/1250 [15:53<14:33, 1.47s/it]
Training 1/1 epoch (loss 2.8108): 52%|ββββββ | 656/1250 [15:54<14:33, 1.47s/it]
Training 1/1 epoch (loss 2.8108): 53%|ββββββ | 657/1250 [15:54<15:50, 1.60s/it]
Training 1/1 epoch (loss 2.8269): 53%|ββββββ | 657/1250 [15:56<15:50, 1.60s/it]
Training 1/1 epoch (loss 2.8269): 53%|ββββββ | 658/1250 [15:56<15:11, 1.54s/it]
Training 1/1 epoch (loss 2.5625): 53%|ββββββ | 658/1250 [15:56<15:11, 1.54s/it]
Training 1/1 epoch (loss 2.5625): 53%|ββββββ | 659/1250 [15:56<12:28, 1.27s/it]
Training 1/1 epoch (loss 2.3504): 53%|ββββββ | 659/1250 [15:59<12:28, 1.27s/it]
Training 1/1 epoch (loss 2.3504): 53%|ββββββ | 660/1250 [15:59<15:44, 1.60s/it]
Training 1/1 epoch (loss 2.5120): 53%|ββββββ | 660/1250 [16:00<15:44, 1.60s/it]
Training 1/1 epoch (loss 2.5120): 53%|ββββββ | 661/1250 [16:00<13:56, 1.42s/it]
Training 1/1 epoch (loss 2.6010): 53%|ββββββ | 661/1250 [16:01<13:56, 1.42s/it]
Training 1/1 epoch (loss 2.6010): 53%|ββββββ | 662/1250 [16:01<13:28, 1.37s/it]
Training 1/1 epoch (loss 2.6845): 53%|ββββββ | 662/1250 [16:03<13:28, 1.37s/it]
Training 1/1 epoch (loss 2.6845): 53%|ββββββ | 663/1250 [16:03<13:33, 1.39s/it]
Training 1/1 epoch (loss 2.7650): 53%|ββββββ | 663/1250 [16:04<13:33, 1.39s/it]
Training 1/1 epoch (loss 2.7650): 53%|ββββββ | 664/1250 [16:04<12:33, 1.29s/it]
Training 1/1 epoch (loss 2.7083): 53%|ββββββ | 664/1250 [16:05<12:33, 1.29s/it]
Training 1/1 epoch (loss 2.7083): 53%|ββββββ | 665/1250 [16:05<13:46, 1.41s/it]
Training 1/1 epoch (loss 2.6369): 53%|ββββββ | 665/1250 [16:08<13:46, 1.41s/it]
Training 1/1 epoch (loss 2.6369): 53%|ββββββ | 666/1250 [16:08<16:07, 1.66s/it]
Training 1/1 epoch (loss 2.8546): 53%|ββββββ | 666/1250 [16:09<16:07, 1.66s/it]
Training 1/1 epoch (loss 2.8546): 53%|ββββββ | 667/1250 [16:09<14:21, 1.48s/it]
Training 1/1 epoch (loss 2.7554): 53%|ββββββ | 667/1250 [16:11<14:21, 1.48s/it]
Training 1/1 epoch (loss 2.7554): 53%|ββββββ | 668/1250 [16:11<15:37, 1.61s/it]
Training 1/1 epoch (loss 2.7035): 53%|ββββββ | 668/1250 [16:12<15:37, 1.61s/it]
Training 1/1 epoch (loss 2.7035): 54%|ββββββ | 669/1250 [16:12<15:31, 1.60s/it]
Training 1/1 epoch (loss 2.9590): 54%|ββββββ | 669/1250 [16:13<15:31, 1.60s/it]
Training 1/1 epoch (loss 2.9590): 54%|ββββββ | 670/1250 [16:13<12:10, 1.26s/it]
Training 1/1 epoch (loss 2.9710): 54%|ββββββ | 670/1250 [16:14<12:10, 1.26s/it]
Training 1/1 epoch (loss 2.9710): 54%|ββββββ | 671/1250 [16:14<13:46, 1.43s/it]
Training 1/1 epoch (loss 2.8468): 54%|ββββββ | 671/1250 [16:16<13:46, 1.43s/it]
Training 1/1 epoch (loss 2.8468): 54%|ββββββ | 672/1250 [16:16<13:25, 1.39s/it]
Training 1/1 epoch (loss 2.5578): 54%|ββββββ | 672/1250 [16:17<13:25, 1.39s/it]
Training 1/1 epoch (loss 2.5578): 54%|ββββββ | 673/1250 [16:17<12:45, 1.33s/it]
Training 1/1 epoch (loss 2.7980): 54%|ββββββ | 673/1250 [16:19<12:45, 1.33s/it]
Training 1/1 epoch (loss 2.7980): 54%|ββββββ | 674/1250 [16:19<14:11, 1.48s/it]
Training 1/1 epoch (loss 2.4641): 54%|ββββββ | 674/1250 [16:20<14:11, 1.48s/it]
Training 1/1 epoch (loss 2.4641): 54%|ββββββ | 675/1250 [16:20<13:12, 1.38s/it]
Training 1/1 epoch (loss 2.7195): 54%|ββββββ | 675/1250 [16:21<13:12, 1.38s/it]
Training 1/1 epoch (loss 2.7195): 54%|ββββββ | 676/1250 [16:21<12:12, 1.28s/it]
Training 1/1 epoch (loss 2.5117): 54%|ββββββ | 676/1250 [16:23<12:12, 1.28s/it]
Training 1/1 epoch (loss 2.5117): 54%|ββββββ | 677/1250 [16:23<15:49, 1.66s/it]
Training 1/1 epoch (loss 2.7658): 54%|ββββββ | 677/1250 [16:25<15:49, 1.66s/it]
Training 1/1 epoch (loss 2.7658): 54%|ββββββ | 678/1250 [16:25<14:44, 1.55s/it]
Training 1/1 epoch (loss 2.7613): 54%|ββββββ | 678/1250 [16:26<14:44, 1.55s/it]
Training 1/1 epoch (loss 2.7613): 54%|ββββββ | 679/1250 [16:26<14:10, 1.49s/it]
Training 1/1 epoch (loss 2.6677): 54%|ββββββ | 679/1250 [16:28<14:10, 1.49s/it]
Training 1/1 epoch (loss 2.6677): 54%|ββββββ | 680/1250 [16:28<15:27, 1.63s/it]
Training 1/1 epoch (loss 2.5732): 54%|ββββββ | 680/1250 [16:29<15:27, 1.63s/it]
Training 1/1 epoch (loss 2.5732): 54%|ββββββ | 681/1250 [16:29<13:02, 1.37s/it]
Training 1/1 epoch (loss 2.7917): 54%|ββββββ | 681/1250 [16:30<13:02, 1.37s/it]
Training 1/1 epoch (loss 2.7917): 55%|ββββββ | 682/1250 [16:30<13:39, 1.44s/it]
Training 1/1 epoch (loss 2.7716): 55%|ββββββ | 682/1250 [16:32<13:39, 1.44s/it]
Training 1/1 epoch (loss 2.7716): 55%|ββββββ | 683/1250 [16:32<13:52, 1.47s/it]
Training 1/1 epoch (loss 2.7420): 55%|ββββββ | 683/1250 [16:33<13:52, 1.47s/it]
Training 1/1 epoch (loss 2.7420): 55%|ββββββ | 684/1250 [16:33<12:27, 1.32s/it]
Training 1/1 epoch (loss 2.4869): 55%|ββββββ | 684/1250 [16:34<12:27, 1.32s/it]
Training 1/1 epoch (loss 2.4869): 55%|ββββββ | 685/1250 [16:34<12:29, 1.33s/it]
Training 1/1 epoch (loss 2.7400): 55%|ββββββ | 685/1250 [16:36<12:29, 1.33s/it]
Training 1/1 epoch (loss 2.7400): 55%|ββββββ | 686/1250 [16:36<13:16, 1.41s/it]
Training 1/1 epoch (loss 2.6179): 55%|ββββββ | 686/1250 [16:37<13:16, 1.41s/it]
Training 1/1 epoch (loss 2.6179): 55%|ββββββ | 687/1250 [16:37<12:53, 1.37s/it]
Training 1/1 epoch (loss 2.7339): 55%|ββββββ | 687/1250 [16:39<12:53, 1.37s/it]
Training 1/1 epoch (loss 2.7339): 55%|ββββββ | 688/1250 [16:39<12:57, 1.38s/it]
Training 1/1 epoch (loss 2.7796): 55%|ββββββ | 688/1250 [16:40<12:57, 1.38s/it]
Training 1/1 epoch (loss 2.7796): 55%|ββββββ | 689/1250 [16:40<12:58, 1.39s/it]
Training 1/1 epoch (loss 2.8004): 55%|ββββββ | 689/1250 [16:42<12:58, 1.39s/it]
Training 1/1 epoch (loss 2.8004): 55%|ββββββ | 690/1250 [16:42<15:03, 1.61s/it]
Training 1/1 epoch (loss 2.6168): 55%|ββββββ | 690/1250 [16:44<15:03, 1.61s/it]
Training 1/1 epoch (loss 2.6168): 55%|ββββββ | 691/1250 [16:44<16:16, 1.75s/it]
Training 1/1 epoch (loss 2.7259): 55%|ββββββ | 691/1250 [16:45<16:16, 1.75s/it]
Training 1/1 epoch (loss 2.7259): 55%|ββββββ | 692/1250 [16:45<12:56, 1.39s/it]
Training 1/1 epoch (loss 2.7067): 55%|ββββββ | 692/1250 [16:47<12:56, 1.39s/it]
Training 1/1 epoch (loss 2.7067): 55%|ββββββ | 693/1250 [16:47<14:55, 1.61s/it]
Training 1/1 epoch (loss 2.8842): 55%|ββββββ | 693/1250 [16:49<14:55, 1.61s/it]
Training 1/1 epoch (loss 2.8842): 56%|ββββββ | 694/1250 [16:49<17:14, 1.86s/it]
Training 1/1 epoch (loss 2.6375): 56%|ββββββ | 694/1250 [16:50<17:14, 1.86s/it]
Training 1/1 epoch (loss 2.6375): 56%|ββββββ | 695/1250 [16:50<13:30, 1.46s/it]
Training 1/1 epoch (loss 2.9049): 56%|ββββββ | 695/1250 [16:52<13:30, 1.46s/it]
Training 1/1 epoch (loss 2.9049): 56%|ββββββ | 696/1250 [16:52<14:12, 1.54s/it]
Training 1/1 epoch (loss 2.6689): 56%|ββββββ | 696/1250 [16:53<14:12, 1.54s/it]
Training 1/1 epoch (loss 2.6689): 56%|ββββββ | 697/1250 [16:53<14:29, 1.57s/it]
Training 1/1 epoch (loss 2.5341): 56%|ββββββ | 697/1250 [16:54<14:29, 1.57s/it]
Training 1/1 epoch (loss 2.5341): 56%|ββββββ | 698/1250 [16:54<12:15, 1.33s/it]
Training 1/1 epoch (loss 2.8390): 56%|ββββββ | 698/1250 [16:55<12:15, 1.33s/it]
Training 1/1 epoch (loss 2.8390): 56%|ββββββ | 699/1250 [16:55<11:51, 1.29s/it]
Training 1/1 epoch (loss 2.9260): 56%|ββββββ | 699/1250 [16:57<11:51, 1.29s/it]
Training 1/1 epoch (loss 2.9260): 56%|ββββββ | 700/1250 [16:57<13:36, 1.48s/it]
Training 1/1 epoch (loss 2.8206): 56%|ββββββ | 700/1250 [16:58<13:36, 1.48s/it]
Training 1/1 epoch (loss 2.8206): 56%|ββββββ | 701/1250 [16:58<12:03, 1.32s/it]
Training 1/1 epoch (loss 2.6184): 56%|ββββββ | 701/1250 [17:00<12:03, 1.32s/it]
Training 1/1 epoch (loss 2.6184): 56%|ββββββ | 702/1250 [17:00<14:37, 1.60s/it]
Training 1/1 epoch (loss 2.7334): 56%|ββββββ | 702/1250 [17:01<14:37, 1.60s/it]
Training 1/1 epoch (loss 2.7334): 56%|ββββββ | 703/1250 [17:01<12:44, 1.40s/it]
Training 1/1 epoch (loss 2.5352): 56%|ββββββ | 703/1250 [17:02<12:44, 1.40s/it]
Training 1/1 epoch (loss 2.5352): 56%|ββββββ | 704/1250 [17:02<12:21, 1.36s/it]
Training 1/1 epoch (loss 2.7107): 56%|ββββββ | 704/1250 [17:05<12:21, 1.36s/it]
Training 1/1 epoch (loss 2.7107): 56%|ββββββ | 705/1250 [17:05<15:09, 1.67s/it]
Training 1/1 epoch (loss 2.8003): 56%|ββββββ | 705/1250 [17:06<15:09, 1.67s/it]
Training 1/1 epoch (loss 2.8003): 56%|ββββββ | 706/1250 [17:06<13:01, 1.44s/it]
Training 1/1 epoch (loss 2.7759): 56%|ββββββ | 706/1250 [17:07<13:01, 1.44s/it]
Training 1/1 epoch (loss 2.7759): 57%|ββββββ | 707/1250 [17:07<11:47, 1.30s/it]
Training 1/1 epoch (loss 2.8062): 57%|ββββββ | 707/1250 [17:08<11:47, 1.30s/it]
Training 1/1 epoch (loss 2.8062): 57%|ββββββ | 708/1250 [17:08<12:17, 1.36s/it]
Training 1/1 epoch (loss 2.8475): 57%|ββββββ | 708/1250 [17:09<12:17, 1.36s/it]
Training 1/1 epoch (loss 2.8475): 57%|ββββββ | 709/1250 [17:09<11:14, 1.25s/it]
Training 1/1 epoch (loss 2.6570): 57%|ββββββ | 709/1250 [17:11<11:14, 1.25s/it]
Training 1/1 epoch (loss 2.6570): 57%|ββββββ | 710/1250 [17:11<13:34, 1.51s/it]
Training 1/1 epoch (loss 2.8123): 57%|ββββββ | 710/1250 [17:13<13:34, 1.51s/it]
Training 1/1 epoch (loss 2.8123): 57%|ββββββ | 711/1250 [17:13<14:30, 1.62s/it]
Training 1/1 epoch (loss 2.8120): 57%|ββββββ | 711/1250 [17:14<14:30, 1.62s/it]
Training 1/1 epoch (loss 2.8120): 57%|ββββββ | 712/1250 [17:14<11:53, 1.33s/it]
Training 1/1 epoch (loss 2.5476): 57%|ββββββ | 712/1250 [17:15<11:53, 1.33s/it]
Training 1/1 epoch (loss 2.5476): 57%|ββββββ | 713/1250 [17:15<12:37, 1.41s/it]
Training 1/1 epoch (loss 2.8905): 57%|ββββββ | 713/1250 [17:17<12:37, 1.41s/it]
Training 1/1 epoch (loss 2.8905): 57%|ββββββ | 714/1250 [17:17<12:59, 1.45s/it]
Training 1/1 epoch (loss 2.7893): 57%|ββββββ | 714/1250 [17:18<12:59, 1.45s/it]
Training 1/1 epoch (loss 2.7893): 57%|ββββββ | 715/1250 [17:18<11:29, 1.29s/it]
Training 1/1 epoch (loss 2.4270): 57%|ββββββ | 715/1250 [17:20<11:29, 1.29s/it]
Training 1/1 epoch (loss 2.4270): 57%|ββββββ | 716/1250 [17:20<13:31, 1.52s/it]
Training 1/1 epoch (loss 2.7385): 57%|ββββββ | 716/1250 [17:22<13:31, 1.52s/it]
Training 1/1 epoch (loss 2.7385): 57%|ββββββ | 717/1250 [17:22<13:57, 1.57s/it]
Training 1/1 epoch (loss 2.7463): 57%|ββββββ | 717/1250 [17:23<13:57, 1.57s/it]
Training 1/1 epoch (loss 2.7463): 57%|ββββββ | 718/1250 [17:23<13:36, 1.53s/it]
Training 1/1 epoch (loss 2.7833): 57%|ββββββ | 718/1250 [17:24<13:36, 1.53s/it]
Training 1/1 epoch (loss 2.7833): 58%|ββββββ | 719/1250 [17:24<12:42, 1.44s/it]
Training 1/1 epoch (loss 2.8388): 58%|ββββββ | 719/1250 [17:26<12:42, 1.44s/it]
Training 1/1 epoch (loss 2.8388): 58%|ββββββ | 720/1250 [17:26<12:31, 1.42s/it]
Training 1/1 epoch (loss 2.6094): 58%|ββββββ | 720/1250 [17:27<12:31, 1.42s/it]
Training 1/1 epoch (loss 2.6094): 58%|ββββββ | 721/1250 [17:27<12:06, 1.37s/it]
Training 1/1 epoch (loss 2.6391): 58%|ββββββ | 721/1250 [17:29<12:06, 1.37s/it]
Training 1/1 epoch (loss 2.6391): 58%|ββββββ | 722/1250 [17:29<13:31, 1.54s/it]
Training 1/1 epoch (loss 2.4534): 58%|ββββββ | 722/1250 [17:30<13:31, 1.54s/it]
Training 1/1 epoch (loss 2.4534): 58%|ββββββ | 723/1250 [17:30<12:36, 1.43s/it]
Training 1/1 epoch (loss 2.6166): 58%|ββββββ | 723/1250 [17:32<12:36, 1.43s/it]
Training 1/1 epoch (loss 2.6166): 58%|ββββββ | 724/1250 [17:32<13:50, 1.58s/it]
Training 1/1 epoch (loss 2.9780): 58%|ββββββ | 724/1250 [17:34<13:50, 1.58s/it]
Training 1/1 epoch (loss 2.9780): 58%|ββββββ | 725/1250 [17:34<14:10, 1.62s/it]
Training 1/1 epoch (loss 2.6208): 58%|ββββββ | 725/1250 [17:34<14:10, 1.62s/it]
Training 1/1 epoch (loss 2.6208): 58%|ββββββ | 726/1250 [17:34<11:44, 1.34s/it]
Training 1/1 epoch (loss 2.5828): 58%|ββββββ | 726/1250 [17:36<11:44, 1.34s/it]
Training 1/1 epoch (loss 2.5828): 58%|ββββββ | 727/1250 [17:36<12:32, 1.44s/it]
Training 1/1 epoch (loss 2.5279): 58%|ββββββ | 727/1250 [17:39<12:32, 1.44s/it]
Training 1/1 epoch (loss 2.5279): 58%|ββββββ | 728/1250 [17:39<16:02, 1.84s/it]
Training 1/1 epoch (loss 2.7652): 58%|ββββββ | 728/1250 [17:40<16:02, 1.84s/it]
Training 1/1 epoch (loss 2.7652): 58%|ββββββ | 729/1250 [17:40<13:03, 1.50s/it]
Training 1/1 epoch (loss 2.7808): 58%|ββββββ | 729/1250 [17:42<13:03, 1.50s/it]
Training 1/1 epoch (loss 2.7808): 58%|ββββββ | 730/1250 [17:42<15:25, 1.78s/it]
Training 1/1 epoch (loss 2.6770): 58%|ββββββ | 730/1250 [17:43<15:25, 1.78s/it]
Training 1/1 epoch (loss 2.6770): 58%|ββββββ | 731/1250 [17:43<14:06, 1.63s/it]
Training 1/1 epoch (loss 2.4044): 58%|ββββββ | 731/1250 [17:44<14:06, 1.63s/it]
Training 1/1 epoch (loss 2.4044): 59%|ββββββ | 732/1250 [17:44<12:06, 1.40s/it]
Training 1/1 epoch (loss 2.5580): 59%|ββββββ | 732/1250 [17:46<12:06, 1.40s/it]
Training 1/1 epoch (loss 2.5580): 59%|ββββββ | 733/1250 [17:46<12:52, 1.49s/it]
Training 1/1 epoch (loss 2.6852): 59%|ββββββ | 733/1250 [17:47<12:52, 1.49s/it]
Training 1/1 epoch (loss 2.6852): 59%|ββββββ | 734/1250 [17:47<13:16, 1.54s/it]
Training 1/1 epoch (loss 2.6075): 59%|ββββββ | 734/1250 [17:49<13:16, 1.54s/it]
Training 1/1 epoch (loss 2.6075): 59%|ββββββ | 735/1250 [17:49<13:28, 1.57s/it]
Training 1/1 epoch (loss 2.8567): 59%|ββββββ | 735/1250 [17:51<13:28, 1.57s/it]
Training 1/1 epoch (loss 2.8567): 59%|ββββββ | 736/1250 [17:51<14:14, 1.66s/it]
Training 1/1 epoch (loss 2.5498): 59%|ββββββ | 736/1250 [17:52<14:14, 1.66s/it]
Training 1/1 epoch (loss 2.5498): 59%|ββββββ | 737/1250 [17:52<12:28, 1.46s/it]
Training 1/1 epoch (loss 2.6243): 59%|ββββββ | 737/1250 [17:54<12:28, 1.46s/it]
Training 1/1 epoch (loss 2.6243): 59%|ββββββ | 738/1250 [17:54<13:05, 1.53s/it]
Training 1/1 epoch (loss 2.7106): 59%|ββββββ | 738/1250 [17:56<13:05, 1.53s/it]
Training 1/1 epoch (loss 2.7106): 59%|ββββββ | 739/1250 [17:56<13:46, 1.62s/it]
Training 1/1 epoch (loss 2.6395): 59%|ββββββ | 739/1250 [17:56<13:46, 1.62s/it]
Training 1/1 epoch (loss 2.6395): 59%|ββββββ | 740/1250 [17:56<11:13, 1.32s/it]
Training 1/1 epoch (loss 2.6119): 59%|ββββββ | 740/1250 [17:59<11:13, 1.32s/it]
Training 1/1 epoch (loss 2.6119): 59%|ββββββ | 741/1250 [17:59<14:08, 1.67s/it]
Training 1/1 epoch (loss 2.7640): 59%|ββββββ | 741/1250 [18:00<14:08, 1.67s/it]
Training 1/1 epoch (loss 2.7640): 59%|ββββββ | 742/1250 [18:00<14:10, 1.67s/it]
Training 1/1 epoch (loss 2.6974): 59%|ββββββ | 742/1250 [18:01<14:10, 1.67s/it]
Training 1/1 epoch (loss 2.6974): 59%|ββββββ | 743/1250 [18:01<12:43, 1.51s/it]
Training 1/1 epoch (loss 2.7708): 59%|ββββββ | 743/1250 [18:04<12:43, 1.51s/it]
Training 1/1 epoch (loss 2.7708): 60%|ββββββ | 744/1250 [18:04<15:30, 1.84s/it]
Training 1/1 epoch (loss 2.6775): 60%|ββββββ | 744/1250 [18:05<15:30, 1.84s/it]
Training 1/1 epoch (loss 2.6775): 60%|ββββββ | 745/1250 [18:05<12:47, 1.52s/it]
Training 1/1 epoch (loss 2.7355): 60%|ββββββ | 745/1250 [18:07<12:47, 1.52s/it]
Training 1/1 epoch (loss 2.7355): 60%|ββββββ | 746/1250 [18:07<13:47, 1.64s/it]
Training 1/1 epoch (loss 2.5513): 60%|ββββββ | 746/1250 [18:08<13:47, 1.64s/it]
Training 1/1 epoch (loss 2.5513): 60%|ββββββ | 747/1250 [18:08<13:27, 1.61s/it]
Training 1/1 epoch (loss 2.7434): 60%|ββββββ | 747/1250 [18:09<13:27, 1.61s/it]
Training 1/1 epoch (loss 2.7434): 60%|ββββββ | 748/1250 [18:09<10:55, 1.31s/it]
Training 1/1 epoch (loss 2.5549): 60%|ββββββ | 748/1250 [18:11<10:55, 1.31s/it]
Training 1/1 epoch (loss 2.5549): 60%|ββββββ | 749/1250 [18:11<13:34, 1.63s/it]
Training 1/1 epoch (loss 2.7719): 60%|ββββββ | 749/1250 [18:14<13:34, 1.63s/it]
Training 1/1 epoch (loss 2.7719): 60%|ββββββ | 750/1250 [18:14<15:08, 1.82s/it]
Training 1/1 epoch (loss 2.4958): 60%|ββββββ | 750/1250 [18:14<15:08, 1.82s/it]
Training 1/1 epoch (loss 2.4958): 60%|ββββββ | 751/1250 [18:14<12:05, 1.45s/it]
Training 1/1 epoch (loss 2.7137): 60%|ββββββ | 751/1250 [18:16<12:05, 1.45s/it]
Training 1/1 epoch (loss 2.7137): 60%|ββββββ | 752/1250 [18:16<14:22, 1.73s/it]
Training 1/1 epoch (loss 2.8674): 60%|ββββββ | 752/1250 [18:18<14:22, 1.73s/it]
Training 1/1 epoch (loss 2.8674): 60%|ββββββ | 753/1250 [18:18<13:03, 1.58s/it]
Training 1/1 epoch (loss 2.8685): 60%|ββββββ | 753/1250 [18:19<13:03, 1.58s/it]
Training 1/1 epoch (loss 2.8685): 60%|ββββββ | 754/1250 [18:19<12:21, 1.49s/it]
Training 1/1 epoch (loss 2.6901): 60%|ββββββ | 754/1250 [18:21<12:21, 1.49s/it]
Training 1/1 epoch (loss 2.6901): 60%|ββββββ | 755/1250 [18:21<13:40, 1.66s/it]
Training 1/1 epoch (loss 2.9012): 60%|ββββββ | 755/1250 [18:22<13:40, 1.66s/it]
Training 1/1 epoch (loss 2.9012): 60%|ββββββ | 756/1250 [18:22<11:44, 1.43s/it]
Training 1/1 epoch (loss 2.6917): 60%|ββββββ | 756/1250 [18:23<11:44, 1.43s/it]
Training 1/1 epoch (loss 2.6917): 61%|ββββββ | 757/1250 [18:23<10:23, 1.26s/it]
Training 1/1 epoch (loss 2.6225): 61%|ββββββ | 757/1250 [18:24<10:23, 1.26s/it]
Training 1/1 epoch (loss 2.6225): 61%|ββββββ | 758/1250 [18:24<09:46, 1.19s/it]
Training 1/1 epoch (loss 2.6599): 61%|ββββββ | 758/1250 [18:25<09:46, 1.19s/it]
Training 1/1 epoch (loss 2.6599): 61%|ββββββ | 759/1250 [18:25<09:57, 1.22s/it]
Training 1/1 epoch (loss 2.7252): 61%|ββββββ | 759/1250 [18:27<09:57, 1.22s/it]
Training 1/1 epoch (loss 2.7252): 61%|ββββββ | 760/1250 [18:27<10:42, 1.31s/it]
Training 1/1 epoch (loss 2.7934): 61%|ββββββ | 760/1250 [18:28<10:42, 1.31s/it]
Training 1/1 epoch (loss 2.7934): 61%|ββββββ | 761/1250 [18:28<10:52, 1.33s/it]
Training 1/1 epoch (loss 2.5667): 61%|ββββββ | 761/1250 [18:28<10:52, 1.33s/it]
Training 1/1 epoch (loss 2.5667): 61%|ββββββ | 762/1250 [18:28<08:44, 1.07s/it]
Training 1/1 epoch (loss 2.5832): 61%|ββββββ | 762/1250 [18:30<08:44, 1.07s/it]
Training 1/1 epoch (loss 2.5832): 61%|ββββββ | 763/1250 [18:30<10:12, 1.26s/it]
Training 1/1 epoch (loss 2.7112): 61%|ββββββ | 763/1250 [18:32<10:12, 1.26s/it]
Training 1/1 epoch (loss 2.7112): 61%|ββββββ | 764/1250 [18:32<11:42, 1.45s/it]
Training 1/1 epoch (loss 2.7594): 61%|ββββββ | 764/1250 [18:32<11:42, 1.45s/it]
Training 1/1 epoch (loss 2.7594): 61%|ββββββ | 765/1250 [18:32<09:05, 1.12s/it]
Training 1/1 epoch (loss 2.4180): 61%|ββββββ | 765/1250 [18:35<09:05, 1.12s/it]
Training 1/1 epoch (loss 2.4180): 61%|βββββββ | 766/1250 [18:35<12:17, 1.52s/it]
Training 1/1 epoch (loss 2.4931): 61%|βββββββ | 766/1250 [18:37<12:17, 1.52s/it]
Training 1/1 epoch (loss 2.4931): 61%|βββββββ | 767/1250 [18:37<13:28, 1.67s/it]
Training 1/1 epoch (loss 2.7024): 61%|βββββββ | 767/1250 [18:38<13:28, 1.67s/it]
Training 1/1 epoch (loss 2.7024): 61%|βββββββ | 768/1250 [18:38<11:25, 1.42s/it]
Training 1/1 epoch (loss 2.8654): 61%|βββββββ | 768/1250 [18:39<11:25, 1.42s/it]
Training 1/1 epoch (loss 2.8654): 62%|βββββββ | 769/1250 [18:39<11:25, 1.43s/it]
Training 1/1 epoch (loss 2.7055): 62%|βββββββ | 769/1250 [18:40<11:25, 1.43s/it]
Training 1/1 epoch (loss 2.7055): 62%|βββββββ | 770/1250 [18:40<10:40, 1.34s/it]
Training 1/1 epoch (loss 2.7548): 62%|βββββββ | 770/1250 [18:41<10:40, 1.34s/it]
Training 1/1 epoch (loss 2.7548): 62%|βββββββ | 771/1250 [18:41<10:11, 1.28s/it]
Training 1/1 epoch (loss 2.7491): 62%|βββββββ | 771/1250 [18:43<10:11, 1.28s/it]
Training 1/1 epoch (loss 2.7491): 62%|βββββββ | 772/1250 [18:43<10:28, 1.31s/it]
Training 1/1 epoch (loss 2.6481): 62%|βββββββ | 772/1250 [18:44<10:28, 1.31s/it]
Training 1/1 epoch (loss 2.6481): 62%|βββββββ | 773/1250 [18:44<10:29, 1.32s/it]
Training 1/1 epoch (loss 2.6286): 62%|βββββββ | 773/1250 [18:45<10:29, 1.32s/it]
Training 1/1 epoch (loss 2.6286): 62%|βββββββ | 774/1250 [18:45<10:00, 1.26s/it]
Training 1/1 epoch (loss 2.7683): 62%|βββββββ | 774/1250 [18:48<10:00, 1.26s/it]
Training 1/1 epoch (loss 2.7683): 62%|βββββββ | 775/1250 [18:48<12:12, 1.54s/it]
Training 1/1 epoch (loss 2.5821): 62%|βββββββ | 775/1250 [18:48<12:12, 1.54s/it]
Training 1/1 epoch (loss 2.5821): 62%|βββββββ | 776/1250 [18:48<10:31, 1.33s/it]
Training 1/1 epoch (loss 2.7377): 62%|βββββββ | 776/1250 [18:50<10:31, 1.33s/it]
Training 1/1 epoch (loss 2.7377): 62%|βββββββ | 777/1250 [18:50<11:47, 1.50s/it]
Training 1/1 epoch (loss 2.7868): 62%|βββββββ | 777/1250 [18:52<11:47, 1.50s/it]
Training 1/1 epoch (loss 2.7868): 62%|βββββββ | 778/1250 [18:52<12:12, 1.55s/it]
Training 1/1 epoch (loss 2.6263): 62%|βββββββ | 778/1250 [18:53<12:12, 1.55s/it]
Training 1/1 epoch (loss 2.6263): 62%|βββββββ | 779/1250 [18:53<10:15, 1.31s/it]
Training 1/1 epoch (loss 2.9277): 62%|βββββββ | 779/1250 [18:55<10:15, 1.31s/it]
Training 1/1 epoch (loss 2.9277): 62%|βββββββ | 780/1250 [18:55<11:45, 1.50s/it]
Training 1/1 epoch (loss 2.5390): 62%|βββββββ | 780/1250 [18:55<11:45, 1.50s/it]
Training 1/1 epoch (loss 2.5390): 62%|βββββββ | 781/1250 [18:55<10:14, 1.31s/it]
Training 1/1 epoch (loss 2.7323): 62%|βββββββ | 781/1250 [18:57<10:14, 1.31s/it]
Training 1/1 epoch (loss 2.7323): 63%|βββββββ | 782/1250 [18:57<11:33, 1.48s/it]
Training 1/1 epoch (loss 2.7287): 63%|βββββββ | 782/1250 [18:59<11:33, 1.48s/it]
Training 1/1 epoch (loss 2.7287): 63%|βββββββ | 783/1250 [18:59<11:58, 1.54s/it]
Training 1/1 epoch (loss 2.5468): 63%|βββββββ | 783/1250 [19:00<11:58, 1.54s/it]
Training 1/1 epoch (loss 2.5468): 63%|βββββββ | 784/1250 [19:00<10:31, 1.36s/it]
Training 1/1 epoch (loss 2.8516): 63%|βββββββ | 784/1250 [19:02<10:31, 1.36s/it]
Training 1/1 epoch (loss 2.8516): 63%|βββββββ | 785/1250 [19:02<13:03, 1.68s/it]
Training 1/1 epoch (loss 2.7413): 63%|βββββββ | 785/1250 [19:04<13:03, 1.68s/it]
Training 1/1 epoch (loss 2.7413): 63%|βββββββ | 786/1250 [19:04<13:06, 1.69s/it]
Training 1/1 epoch (loss 2.7555): 63%|βββββββ | 786/1250 [19:05<13:06, 1.69s/it]
Training 1/1 epoch (loss 2.7555): 63%|βββββββ | 787/1250 [19:05<12:11, 1.58s/it]
Training 1/1 epoch (loss 2.5185): 63%|βββββββ | 787/1250 [19:07<12:11, 1.58s/it]
Training 1/1 epoch (loss 2.5185): 63%|βββββββ | 788/1250 [19:07<10:58, 1.43s/it]
Training 1/1 epoch (loss 2.4803): 63%|βββββββ | 788/1250 [19:07<10:58, 1.43s/it]
Training 1/1 epoch (loss 2.4803): 63%|βββββββ | 789/1250 [19:07<09:38, 1.25s/it]
Training 1/1 epoch (loss 2.5914): 63%|βββββββ | 789/1250 [19:09<09:38, 1.25s/it]
Training 1/1 epoch (loss 2.5914): 63%|βββββββ | 790/1250 [19:09<09:57, 1.30s/it]
Training 1/1 epoch (loss 2.6958): 63%|βββββββ | 790/1250 [19:10<09:57, 1.30s/it]
Training 1/1 epoch (loss 2.6958): 63%|βββββββ | 791/1250 [19:10<10:17, 1.34s/it]
Training 1/1 epoch (loss 2.6997): 63%|βββββββ | 791/1250 [19:11<10:17, 1.34s/it]
Training 1/1 epoch (loss 2.6997): 63%|βββββββ | 792/1250 [19:11<09:48, 1.29s/it]
Training 1/1 epoch (loss 2.4924): 63%|βββββββ | 792/1250 [19:13<09:48, 1.29s/it]
Training 1/1 epoch (loss 2.4924): 63%|βββββββ | 793/1250 [19:13<11:14, 1.48s/it]
Training 1/1 epoch (loss 2.6961): 63%|βββββββ | 793/1250 [19:14<11:14, 1.48s/it]
Training 1/1 epoch (loss 2.6961): 64%|βββββββ | 794/1250 [19:14<09:27, 1.24s/it]
Training 1/1 epoch (loss 2.8206): 64%|βββββββ | 794/1250 [19:16<09:27, 1.24s/it]
Training 1/1 epoch (loss 2.8206): 64%|βββββββ | 795/1250 [19:16<10:47, 1.42s/it]
Training 1/1 epoch (loss 2.6409): 64%|βββββββ | 795/1250 [19:18<10:47, 1.42s/it]
Training 1/1 epoch (loss 2.6409): 64%|βββββββ | 796/1250 [19:18<12:00, 1.59s/it]
Training 1/1 epoch (loss 2.5993): 64%|βββββββ | 796/1250 [19:18<12:00, 1.59s/it]
Training 1/1 epoch (loss 2.5993): 64%|βββββββ | 797/1250 [19:18<09:33, 1.27s/it]
Training 1/1 epoch (loss 2.8178): 64%|βββββββ | 797/1250 [19:19<09:33, 1.27s/it]
Training 1/1 epoch (loss 2.8178): 64%|βββββββ | 798/1250 [19:19<08:26, 1.12s/it]
Training 1/1 epoch (loss 2.7899): 64%|βββββββ | 798/1250 [19:21<08:26, 1.12s/it]
Training 1/1 epoch (loss 2.7899): 64%|βββββββ | 799/1250 [19:21<10:57, 1.46s/it]
Training 1/1 epoch (loss 2.7329): 64%|βββββββ | 799/1250 [19:22<10:57, 1.46s/it]
Training 1/1 epoch (loss 2.7329): 64%|βββββββ | 800/1250 [19:22<09:04, 1.21s/it]
Training 1/1 epoch (loss 2.6719): 64%|βββββββ | 800/1250 [19:24<09:04, 1.21s/it]
Training 1/1 epoch (loss 2.6719): 64%|βββββββ | 801/1250 [19:24<11:48, 1.58s/it]
Training 1/1 epoch (loss 2.8071): 64%|βββββββ | 801/1250 [19:26<11:48, 1.58s/it]
Training 1/1 epoch (loss 2.8071): 64%|βββββββ | 802/1250 [19:26<11:45, 1.57s/it]
Training 1/1 epoch (loss 2.6972): 64%|βββββββ | 802/1250 [19:27<11:45, 1.57s/it]
Training 1/1 epoch (loss 2.6972): 64%|βββββββ | 803/1250 [19:27<09:47, 1.31s/it]
Training 1/1 epoch (loss 2.5904): 64%|βββββββ | 803/1250 [19:29<09:47, 1.31s/it]
Training 1/1 epoch (loss 2.5904): 64%|βββββββ | 804/1250 [19:29<11:43, 1.58s/it]
Training 1/1 epoch (loss 2.6998): 64%|βββββββ | 804/1250 [19:30<11:43, 1.58s/it]
Training 1/1 epoch (loss 2.6998): 64%|βββββββ | 805/1250 [19:30<10:53, 1.47s/it]
Training 1/1 epoch (loss 2.6776): 64%|βββββββ | 805/1250 [19:31<10:53, 1.47s/it]
Training 1/1 epoch (loss 2.6776): 64%|βββββββ | 806/1250 [19:31<10:14, 1.38s/it]
Training 1/1 epoch (loss 2.6948): 64%|βββββββ | 806/1250 [19:33<10:14, 1.38s/it]
Training 1/1 epoch (loss 2.6948): 65%|βββββββ | 807/1250 [19:33<11:06, 1.51s/it]
Training 1/1 epoch (loss 2.5902): 65%|βββββββ | 807/1250 [19:35<11:06, 1.51s/it]
Training 1/1 epoch (loss 2.5902): 65%|βββββββ | 808/1250 [19:35<10:59, 1.49s/it]
Training 1/1 epoch (loss 2.6219): 65%|βββββββ | 808/1250 [19:35<10:59, 1.49s/it]
Training 1/1 epoch (loss 2.6219): 65%|βββββββ | 809/1250 [19:35<09:25, 1.28s/it]
Training 1/1 epoch (loss 2.7517): 65%|βββββββ | 809/1250 [19:37<09:25, 1.28s/it]
Training 1/1 epoch (loss 2.7517): 65%|βββββββ | 810/1250 [19:37<10:35, 1.44s/it]
Training 1/1 epoch (loss 2.6864): 65%|βββββββ | 810/1250 [19:38<10:35, 1.44s/it]
Training 1/1 epoch (loss 2.6864): 65%|βββββββ | 811/1250 [19:38<09:20, 1.28s/it]
Training 1/1 epoch (loss 2.6373): 65%|βββββββ | 811/1250 [19:39<09:20, 1.28s/it]
Training 1/1 epoch (loss 2.6373): 65%|βββββββ | 812/1250 [19:39<09:34, 1.31s/it]
Training 1/1 epoch (loss 2.6046): 65%|βββββββ | 812/1250 [19:41<09:34, 1.31s/it]
Training 1/1 epoch (loss 2.6046): 65%|βββββββ | 813/1250 [19:41<10:48, 1.48s/it]
Training 1/1 epoch (loss 2.7451): 65%|βββββββ | 813/1250 [19:43<10:48, 1.48s/it]
Training 1/1 epoch (loss 2.7451): 65%|βββββββ | 814/1250 [19:43<10:13, 1.41s/it]
Training 1/1 epoch (loss 2.7245): 65%|βββββββ | 814/1250 [19:44<10:13, 1.41s/it]
Training 1/1 epoch (loss 2.7245): 65%|βββββββ | 815/1250 [19:44<10:45, 1.48s/it]
Training 1/1 epoch (loss 2.8191): 65%|βββββββ | 815/1250 [19:46<10:45, 1.48s/it]
Training 1/1 epoch (loss 2.8191): 65%|βββββββ | 816/1250 [19:46<12:14, 1.69s/it]
Training 1/1 epoch (loss 2.7213): 65%|βββββββ | 816/1250 [19:47<12:14, 1.69s/it]
Training 1/1 epoch (loss 2.7213): 65%|βββββββ | 817/1250 [19:47<09:42, 1.35s/it]
Training 1/1 epoch (loss 2.5633): 65%|βββββββ | 817/1250 [19:48<09:42, 1.35s/it]
Training 1/1 epoch (loss 2.5633): 65%|βββββββ | 818/1250 [19:48<10:00, 1.39s/it]
Training 1/1 epoch (loss 2.8654): 65%|βββββββ | 818/1250 [19:50<10:00, 1.39s/it]
Training 1/1 epoch (loss 2.8654): 66%|βββββββ | 819/1250 [19:50<09:52, 1.37s/it]
Training 1/1 epoch (loss 2.6678): 66%|βββββββ | 819/1250 [19:51<09:52, 1.37s/it]
Training 1/1 epoch (loss 2.6678): 66%|βββββββ | 820/1250 [19:51<09:29, 1.32s/it]
Training 1/1 epoch (loss 2.5536): 66%|βββββββ | 820/1250 [19:53<09:29, 1.32s/it]
Training 1/1 epoch (loss 2.5536): 66%|βββββββ | 821/1250 [19:53<11:12, 1.57s/it]
Training 1/1 epoch (loss 2.5230): 66%|βββββββ | 821/1250 [19:54<11:12, 1.57s/it]
Training 1/1 epoch (loss 2.5230): 66%|βββββββ | 822/1250 [19:54<09:56, 1.39s/it]
Training 1/1 epoch (loss 2.5456): 66%|βββββββ | 822/1250 [19:56<09:56, 1.39s/it]
Training 1/1 epoch (loss 2.5456): 66%|βββββββ | 823/1250 [19:56<11:11, 1.57s/it]
Training 1/1 epoch (loss 2.5544): 66%|βββββββ | 823/1250 [19:59<11:11, 1.57s/it]
Training 1/1 epoch (loss 2.5544): 66%|βββββββ | 824/1250 [19:59<13:06, 1.85s/it]
Training 1/1 epoch (loss 2.6037): 66%|βββββββ | 824/1250 [19:59<13:06, 1.85s/it]
Training 1/1 epoch (loss 2.6037): 66%|βββββββ | 825/1250 [19:59<10:43, 1.51s/it]
Training 1/1 epoch (loss 2.8248): 66%|βββββββ | 825/1250 [20:02<10:43, 1.51s/it]
Training 1/1 epoch (loss 2.8248): 66%|βββββββ | 826/1250 [20:02<12:38, 1.79s/it]
Training 1/1 epoch (loss 2.7406): 66%|βββββββ | 826/1250 [20:03<12:38, 1.79s/it]
Training 1/1 epoch (loss 2.7406): 66%|βββββββ | 827/1250 [20:03<10:45, 1.53s/it]
Training 1/1 epoch (loss 2.6210): 66%|βββββββ | 827/1250 [20:04<10:45, 1.53s/it]
Training 1/1 epoch (loss 2.6210): 66%|βββββββ | 828/1250 [20:04<10:03, 1.43s/it]
Training 1/1 epoch (loss 2.6626): 66%|βββββββ | 828/1250 [20:06<10:03, 1.43s/it]
Training 1/1 epoch (loss 2.6626): 66%|βββββββ | 829/1250 [20:06<11:17, 1.61s/it]
Training 1/1 epoch (loss 2.5908): 66%|βββββββ | 829/1250 [20:07<11:17, 1.61s/it]
Training 1/1 epoch (loss 2.5908): 66%|βββββββ | 830/1250 [20:07<09:36, 1.37s/it]
Training 1/1 epoch (loss 2.6566): 66%|βββββββ | 830/1250 [20:09<09:36, 1.37s/it]
Training 1/1 epoch (loss 2.6566): 66%|βββββββ | 831/1250 [20:09<11:07, 1.59s/it]
Training 1/1 epoch (loss 2.8076): 66%|βββββββ | 831/1250 [20:11<11:07, 1.59s/it]
Training 1/1 epoch (loss 2.8076): 67%|βββββββ | 832/1250 [20:11<12:54, 1.85s/it]
Training 1/1 epoch (loss 2.6334): 67%|βββββββ | 832/1250 [20:12<12:54, 1.85s/it]
Training 1/1 epoch (loss 2.6334): 67%|βββββββ | 833/1250 [20:12<10:24, 1.50s/it]
Training 1/1 epoch (loss 2.7484): 67%|βββββββ | 833/1250 [20:14<10:24, 1.50s/it]
Training 1/1 epoch (loss 2.7484): 67%|βββββββ | 834/1250 [20:14<11:24, 1.65s/it]
Training 1/1 epoch (loss 2.9280): 67%|βββββββ | 834/1250 [20:15<11:24, 1.65s/it]
Training 1/1 epoch (loss 2.9280): 67%|βββββββ | 835/1250 [20:15<10:28, 1.52s/it]
Training 1/1 epoch (loss 2.7084): 67%|βββββββ | 835/1250 [20:16<10:28, 1.52s/it]
Training 1/1 epoch (loss 2.7084): 67%|βββββββ | 836/1250 [20:16<09:04, 1.32s/it]
Training 1/1 epoch (loss 2.6488): 67%|βββββββ | 836/1250 [20:17<09:04, 1.32s/it]
Training 1/1 epoch (loss 2.6488): 67%|βββββββ | 837/1250 [20:17<08:45, 1.27s/it]
Training 1/1 epoch (loss 2.7718): 67%|βββββββ | 837/1250 [20:19<08:45, 1.27s/it]
Training 1/1 epoch (loss 2.7718): 67%|βββββββ | 838/1250 [20:19<09:17, 1.35s/it]
Training 1/1 epoch (loss 2.7501): 67%|βββββββ | 838/1250 [20:20<09:17, 1.35s/it]
Training 1/1 epoch (loss 2.7501): 67%|βββββββ | 839/1250 [20:20<09:05, 1.33s/it]
Training 1/1 epoch (loss 2.4763): 67%|βββββββ | 839/1250 [20:22<09:05, 1.33s/it]
Training 1/1 epoch (loss 2.4763): 67%|βββββββ | 840/1250 [20:22<10:03, 1.47s/it]
Training 1/1 epoch (loss 2.6610): 67%|βββββββ | 840/1250 [20:23<10:03, 1.47s/it]
Training 1/1 epoch (loss 2.6610): 67%|βββββββ | 841/1250 [20:23<08:42, 1.28s/it]
Training 1/1 epoch (loss 2.7635): 67%|βββββββ | 841/1250 [20:25<08:42, 1.28s/it]
Training 1/1 epoch (loss 2.7635): 67%|βββββββ | 842/1250 [20:25<10:10, 1.50s/it]
Training 1/1 epoch (loss 2.6070): 67%|βββββββ | 842/1250 [20:27<10:10, 1.50s/it]
Training 1/1 epoch (loss 2.6070): 67%|βββββββ | 843/1250 [20:27<11:07, 1.64s/it]
Training 1/1 epoch (loss 2.7478): 67%|βββββββ | 843/1250 [20:27<11:07, 1.64s/it]
Training 1/1 epoch (loss 2.7478): 68%|βββββββ | 844/1250 [20:27<08:53, 1.31s/it]
Training 1/1 epoch (loss 2.6853): 68%|βββββββ | 844/1250 [20:30<08:53, 1.31s/it]
Training 1/1 epoch (loss 2.6853): 68%|βββββββ | 845/1250 [20:30<11:12, 1.66s/it]
Training 1/1 epoch (loss 2.7749): 68%|βββββββ | 845/1250 [20:32<11:12, 1.66s/it]
Training 1/1 epoch (loss 2.7749): 68%|βββββββ | 846/1250 [20:32<11:44, 1.74s/it]
Training 1/1 epoch (loss 2.5871): 68%|βββββββ | 846/1250 [20:32<11:44, 1.74s/it]
Training 1/1 epoch (loss 2.5871): 68%|βββββββ | 847/1250 [20:32<09:12, 1.37s/it]
Training 1/1 epoch (loss 2.6719): 68%|βββββββ | 847/1250 [20:35<09:12, 1.37s/it]
Training 1/1 epoch (loss 2.6719): 68%|βββββββ | 848/1250 [20:35<12:04, 1.80s/it]
Training 1/1 epoch (loss 2.7285): 68%|βββββββ | 848/1250 [20:36<12:04, 1.80s/it]
Training 1/1 epoch (loss 2.7285): 68%|βββββββ | 849/1250 [20:36<11:00, 1.65s/it]
Training 1/1 epoch (loss 2.8668): 68%|βββββββ | 849/1250 [20:37<11:00, 1.65s/it]
Training 1/1 epoch (loss 2.8668): 68%|βββββββ | 850/1250 [20:37<09:47, 1.47s/it]
Training 1/1 epoch (loss 2.6040): 68%|βββββββ | 850/1250 [20:40<09:47, 1.47s/it]
Training 1/1 epoch (loss 2.6040): 68%|βββββββ | 851/1250 [20:40<11:53, 1.79s/it]
Training 1/1 epoch (loss 2.5228): 68%|βββββββ | 851/1250 [20:41<11:53, 1.79s/it]
Training 1/1 epoch (loss 2.5228): 68%|βββββββ | 852/1250 [20:41<10:40, 1.61s/it]
Training 1/1 epoch (loss 2.8685): 68%|βββββββ | 852/1250 [20:42<10:40, 1.61s/it]
Training 1/1 epoch (loss 2.8685): 68%|βββββββ | 853/1250 [20:42<09:01, 1.36s/it]
Training 1/1 epoch (loss 2.5838): 68%|βββββββ | 853/1250 [20:44<09:01, 1.36s/it]
Training 1/1 epoch (loss 2.5838): 68%|βββββββ | 854/1250 [20:44<09:58, 1.51s/it]
Training 1/1 epoch (loss 2.8192): 68%|βββββββ | 854/1250 [20:45<09:58, 1.51s/it]
Training 1/1 epoch (loss 2.8192): 68%|βββββββ | 855/1250 [20:45<09:42, 1.47s/it]
Training 1/1 epoch (loss 2.4231): 68%|βββββββ | 855/1250 [20:47<09:42, 1.47s/it]
Training 1/1 epoch (loss 2.4231): 68%|βββββββ | 856/1250 [20:47<11:15, 1.72s/it]
Training 1/1 epoch (loss 2.7046): 68%|βββββββ | 856/1250 [20:50<11:15, 1.72s/it]
Training 1/1 epoch (loss 2.7046): 69%|βββββββ | 857/1250 [20:50<12:44, 1.95s/it]
Training 1/1 epoch (loss 2.6511): 69%|βββββββ | 857/1250 [20:50<12:44, 1.95s/it]
Training 1/1 epoch (loss 2.6511): 69%|βββββββ | 858/1250 [20:50<09:40, 1.48s/it]
Training 1/1 epoch (loss 2.6272): 69%|βββββββ | 858/1250 [20:52<09:40, 1.48s/it]
Training 1/1 epoch (loss 2.6272): 69%|βββββββ | 859/1250 [20:52<09:42, 1.49s/it]
Training 1/1 epoch (loss 2.6787): 69%|βββββββ | 859/1250 [20:53<09:42, 1.49s/it]
Training 1/1 epoch (loss 2.6787): 69%|βββββββ | 860/1250 [20:53<08:46, 1.35s/it]
Training 1/1 epoch (loss 2.7306): 69%|βββββββ | 860/1250 [20:53<08:46, 1.35s/it]
Training 1/1 epoch (loss 2.7306): 69%|βββββββ | 861/1250 [20:53<07:12, 1.11s/it]
Training 1/1 epoch (loss 2.6572): 69%|βββββββ | 861/1250 [20:54<07:12, 1.11s/it]
Training 1/1 epoch (loss 2.6572): 69%|βββββββ | 862/1250 [20:54<07:29, 1.16s/it]
Training 1/1 epoch (loss 2.6747): 69%|βββββββ | 862/1250 [20:56<07:29, 1.16s/it]
Training 1/1 epoch (loss 2.6747): 69%|βββββββ | 863/1250 [20:56<08:46, 1.36s/it]
Training 1/1 epoch (loss 2.6220): 69%|βββββββ | 863/1250 [20:57<08:46, 1.36s/it]
Training 1/1 epoch (loss 2.6220): 69%|βββββββ | 864/1250 [20:57<07:09, 1.11s/it]
Training 1/1 epoch (loss 2.6189): 69%|βββββββ | 864/1250 [20:58<07:09, 1.11s/it]
Training 1/1 epoch (loss 2.6189): 69%|βββββββ | 865/1250 [20:58<07:34, 1.18s/it]
Training 1/1 epoch (loss 2.5555): 69%|βββββββ | 865/1250 [20:59<07:34, 1.18s/it]
Training 1/1 epoch (loss 2.5555): 69%|βββββββ | 866/1250 [20:59<07:00, 1.10s/it]
Training 1/1 epoch (loss 2.7253): 69%|βββββββ | 866/1250 [21:00<07:00, 1.10s/it]
Training 1/1 epoch (loss 2.7253): 69%|βββββββ | 867/1250 [21:00<07:36, 1.19s/it]
Training 1/1 epoch (loss 2.8992): 69%|βββββββ | 867/1250 [21:02<07:36, 1.19s/it]
Training 1/1 epoch (loss 2.8992): 69%|βββββββ | 868/1250 [21:02<08:34, 1.35s/it]
Training 1/1 epoch (loss 2.5155): 69%|βββββββ | 868/1250 [21:03<08:34, 1.35s/it]
Training 1/1 epoch (loss 2.5155): 70%|βββββββ | 869/1250 [21:03<08:23, 1.32s/it]
Training 1/1 epoch (loss 2.5539): 70%|βββββββ | 869/1250 [21:04<08:23, 1.32s/it]
Training 1/1 epoch (loss 2.5539): 70%|βββββββ | 870/1250 [21:04<07:37, 1.20s/it]
Training 1/1 epoch (loss 2.5812): 70%|βββββββ | 870/1250 [21:06<07:37, 1.20s/it]
Training 1/1 epoch (loss 2.5812): 70%|βββββββ | 871/1250 [21:06<08:52, 1.41s/it]
Training 1/1 epoch (loss 2.6973): 70%|βββββββ | 871/1250 [21:07<08:52, 1.41s/it]
Training 1/1 epoch (loss 2.6973): 70%|βββββββ | 872/1250 [21:07<08:22, 1.33s/it]
Training 1/1 epoch (loss 2.8565): 70%|βββββββ | 872/1250 [21:09<08:22, 1.33s/it]
Training 1/1 epoch (loss 2.8565): 70%|βββββββ | 873/1250 [21:09<09:04, 1.45s/it]
Training 1/1 epoch (loss 2.6369): 70%|βββββββ | 873/1250 [21:11<09:04, 1.45s/it]
Training 1/1 epoch (loss 2.6369): 70%|βββββββ | 874/1250 [21:11<09:11, 1.47s/it]
Training 1/1 epoch (loss 2.6899): 70%|βββββββ | 874/1250 [21:11<09:11, 1.47s/it]
Training 1/1 epoch (loss 2.6899): 70%|βββββββ | 875/1250 [21:11<07:38, 1.22s/it]
Training 1/1 epoch (loss 2.7327): 70%|βββββββ | 875/1250 [21:13<07:38, 1.22s/it]
Training 1/1 epoch (loss 2.7327): 70%|βββββββ | 876/1250 [21:13<08:24, 1.35s/it]
Training 1/1 epoch (loss 2.7875): 70%|βββββββ | 876/1250 [21:15<08:24, 1.35s/it]
Training 1/1 epoch (loss 2.7875): 70%|βββββββ | 877/1250 [21:15<10:21, 1.67s/it]
Training 1/1 epoch (loss 2.8844): 70%|βββββββ | 877/1250 [21:16<10:21, 1.67s/it]
Training 1/1 epoch (loss 2.8844): 70%|βββββββ | 878/1250 [21:16<08:16, 1.33s/it]
Training 1/1 epoch (loss 2.7399): 70%|βββββββ | 878/1250 [21:18<08:16, 1.33s/it]
Training 1/1 epoch (loss 2.7399): 70%|βββββββ | 879/1250 [21:18<09:15, 1.50s/it]
Training 1/1 epoch (loss 2.8027): 70%|βββββββ | 879/1250 [21:20<09:15, 1.50s/it]
Training 1/1 epoch (loss 2.8027): 70%|βββββββ | 880/1250 [21:20<10:31, 1.71s/it]
Training 1/1 epoch (loss 2.5869): 70%|βββββββ | 880/1250 [21:21<10:31, 1.71s/it]
Training 1/1 epoch (loss 2.5869): 70%|βββββββ | 881/1250 [21:21<08:42, 1.42s/it]
Training 1/1 epoch (loss 2.7144): 70%|βββββββ | 881/1250 [21:23<08:42, 1.42s/it]
Training 1/1 epoch (loss 2.7144): 71%|βββββββ | 882/1250 [21:23<10:32, 1.72s/it]
Training 1/1 epoch (loss 2.8267): 71%|βββββββ | 882/1250 [21:25<10:32, 1.72s/it]
Training 1/1 epoch (loss 2.8267): 71%|βββββββ | 883/1250 [21:25<10:34, 1.73s/it]
Training 1/1 epoch (loss 2.7582): 71%|βββββββ | 883/1250 [21:26<10:34, 1.73s/it]
Training 1/1 epoch (loss 2.7582): 71%|βββββββ | 884/1250 [21:26<08:53, 1.46s/it]
Training 1/1 epoch (loss 2.6409): 71%|βββββββ | 884/1250 [21:28<08:53, 1.46s/it]
Training 1/1 epoch (loss 2.6409): 71%|βββββββ | 885/1250 [21:28<09:43, 1.60s/it]
Training 1/1 epoch (loss 2.7405): 71%|βββββββ | 885/1250 [21:29<09:43, 1.60s/it]
Training 1/1 epoch (loss 2.7405): 71%|βββββββ | 886/1250 [21:29<09:05, 1.50s/it]
Training 1/1 epoch (loss 2.6895): 71%|βββββββ | 886/1250 [21:30<09:05, 1.50s/it]
Training 1/1 epoch (loss 2.6895): 71%|βββββββ | 887/1250 [21:30<09:15, 1.53s/it]
Training 1/1 epoch (loss 2.6292): 71%|βββββββ | 887/1250 [21:32<09:15, 1.53s/it]
Training 1/1 epoch (loss 2.6292): 71%|βββββββ | 888/1250 [21:32<09:07, 1.51s/it]
Training 1/1 epoch (loss 2.6776): 71%|βββββββ | 888/1250 [21:33<09:07, 1.51s/it]
Training 1/1 epoch (loss 2.6776): 71%|βββββββ | 889/1250 [21:33<07:28, 1.24s/it]
Training 1/1 epoch (loss 2.6891): 71%|βββββββ | 889/1250 [21:34<07:28, 1.24s/it]
Training 1/1 epoch (loss 2.6891): 71%|βββββββ | 890/1250 [21:34<07:47, 1.30s/it]
Training 1/1 epoch (loss 2.8467): 71%|βββββββ | 890/1250 [21:36<07:47, 1.30s/it]
Training 1/1 epoch (loss 2.8467): 71%|ββββββββ | 891/1250 [21:36<09:06, 1.52s/it]
Training 1/1 epoch (loss 2.6832): 71%|ββββββββ | 891/1250 [21:37<09:06, 1.52s/it]
Training 1/1 epoch (loss 2.6832): 71%|ββββββββ | 892/1250 [21:37<07:18, 1.23s/it]
Training 1/1 epoch (loss 2.5314): 71%|ββββββββ | 892/1250 [21:38<07:18, 1.23s/it]
Training 1/1 epoch (loss 2.5314): 71%|ββββββββ | 893/1250 [21:38<08:24, 1.41s/it]
Training 1/1 epoch (loss 2.5042): 71%|ββββββββ | 893/1250 [21:40<08:24, 1.41s/it]
Training 1/1 epoch (loss 2.5042): 72%|ββββββββ | 894/1250 [21:40<08:57, 1.51s/it]
Training 1/1 epoch (loss 2.9929): 72%|ββββββββ | 894/1250 [21:41<08:57, 1.51s/it]
Training 1/1 epoch (loss 2.9929): 72%|ββββββββ | 895/1250 [21:41<07:54, 1.34s/it]
Training 1/1 epoch (loss 2.3838): 72%|ββββββββ | 895/1250 [21:43<07:54, 1.34s/it]
Training 1/1 epoch (loss 2.3838): 72%|ββββββββ | 896/1250 [21:43<08:40, 1.47s/it]
Training 1/1 epoch (loss 2.6608): 72%|ββββββββ | 896/1250 [21:44<08:40, 1.47s/it]
Training 1/1 epoch (loss 2.6608): 72%|ββββββββ | 897/1250 [21:44<08:08, 1.38s/it]
Training 1/1 epoch (loss 2.7744): 72%|ββββββββ | 897/1250 [21:45<08:08, 1.38s/it]
Training 1/1 epoch (loss 2.7744): 72%|ββββββββ | 898/1250 [21:45<07:09, 1.22s/it]
Training 1/1 epoch (loss 2.7042): 72%|ββββββββ | 898/1250 [21:47<07:09, 1.22s/it]
Training 1/1 epoch (loss 2.7042): 72%|ββββββββ | 899/1250 [21:47<07:53, 1.35s/it]
Training 1/1 epoch (loss 2.8693): 72%|ββββββββ | 899/1250 [21:47<07:53, 1.35s/it]
Training 1/1 epoch (loss 2.8693): 72%|ββββββββ | 900/1250 [21:47<06:56, 1.19s/it]
Training 1/1 epoch (loss 2.7175): 72%|ββββββββ | 900/1250 [21:49<06:56, 1.19s/it]
Training 1/1 epoch (loss 2.7175): 72%|ββββββββ | 901/1250 [21:49<08:01, 1.38s/it]
Training 1/1 epoch (loss 2.7248): 72%|ββββββββ | 901/1250 [21:51<08:01, 1.38s/it]
Training 1/1 epoch (loss 2.7248): 72%|ββββββββ | 902/1250 [21:51<08:51, 1.53s/it]
Training 1/1 epoch (loss 2.6919): 72%|ββββββββ | 902/1250 [21:52<08:51, 1.53s/it]
Training 1/1 epoch (loss 2.6919): 72%|ββββββββ | 903/1250 [21:52<06:58, 1.21s/it]
Training 1/1 epoch (loss 2.7169): 72%|ββββββββ | 903/1250 [21:54<06:58, 1.21s/it]
Training 1/1 epoch (loss 2.7169): 72%|ββββββββ | 904/1250 [21:54<08:37, 1.50s/it]
Training 1/1 epoch (loss 2.5795): 72%|ββββββββ | 904/1250 [21:56<08:37, 1.50s/it]
Training 1/1 epoch (loss 2.5795): 72%|ββββββββ | 905/1250 [21:56<09:30, 1.65s/it]
Training 1/1 epoch (loss 2.5783): 72%|ββββββββ | 905/1250 [21:56<09:30, 1.65s/it]
Training 1/1 epoch (loss 2.5783): 72%|ββββββββ | 906/1250 [21:56<07:44, 1.35s/it]
Training 1/1 epoch (loss 2.5576): 72%|ββββββββ | 906/1250 [21:58<07:44, 1.35s/it]
Training 1/1 epoch (loss 2.5576): 73%|ββββββββ | 907/1250 [21:58<08:20, 1.46s/it]
Training 1/1 epoch (loss 2.5391): 73%|ββββββββ | 907/1250 [22:00<08:20, 1.46s/it]
Training 1/1 epoch (loss 2.5391): 73%|ββββββββ | 908/1250 [22:00<08:41, 1.52s/it]
Training 1/1 epoch (loss 2.8032): 73%|ββββββββ | 908/1250 [22:00<08:41, 1.52s/it]
Training 1/1 epoch (loss 2.8032): 73%|ββββββββ | 909/1250 [22:00<06:54, 1.21s/it]
Training 1/1 epoch (loss 2.5881): 73%|ββββββββ | 909/1250 [22:03<06:54, 1.21s/it]
Training 1/1 epoch (loss 2.5881): 73%|ββββββββ | 910/1250 [22:03<08:55, 1.57s/it]
Training 1/1 epoch (loss 2.4540): 73%|ββββββββ | 910/1250 [22:04<08:55, 1.57s/it]
Training 1/1 epoch (loss 2.4540): 73%|ββββββββ | 911/1250 [22:04<08:54, 1.58s/it]
Training 1/1 epoch (loss 2.7098): 73%|ββββββββ | 911/1250 [22:06<08:54, 1.58s/it]
Training 1/1 epoch (loss 2.7098): 73%|ββββββββ | 912/1250 [22:06<08:20, 1.48s/it]
Training 1/1 epoch (loss 2.5207): 73%|ββββββββ | 912/1250 [22:07<08:20, 1.48s/it]
Training 1/1 epoch (loss 2.5207): 73%|ββββββββ | 913/1250 [22:07<08:47, 1.57s/it]
Training 1/1 epoch (loss 2.6465): 73%|ββββββββ | 913/1250 [22:08<08:47, 1.57s/it]
Training 1/1 epoch (loss 2.6465): 73%|ββββββββ | 914/1250 [22:08<08:06, 1.45s/it]
Training 1/1 epoch (loss 2.7619): 73%|ββββββββ | 914/1250 [22:11<08:06, 1.45s/it]
Training 1/1 epoch (loss 2.7619): 73%|ββββββββ | 915/1250 [22:11<09:12, 1.65s/it]
Training 1/1 epoch (loss 2.7077): 73%|ββββββββ | 915/1250 [22:13<09:12, 1.65s/it]
Training 1/1 epoch (loss 2.7077): 73%|ββββββββ | 916/1250 [22:13<09:47, 1.76s/it]
Training 1/1 epoch (loss 2.4952): 73%|ββββββββ | 916/1250 [22:13<09:47, 1.76s/it]
Training 1/1 epoch (loss 2.4952): 73%|ββββββββ | 917/1250 [22:13<07:35, 1.37s/it]
Training 1/1 epoch (loss 2.8712): 73%|ββββββββ | 917/1250 [22:15<07:35, 1.37s/it]
Training 1/1 epoch (loss 2.8712): 73%|ββββββββ | 918/1250 [22:15<09:23, 1.70s/it]
Training 1/1 epoch (loss 2.6132): 73%|ββββββββ | 918/1250 [22:17<09:23, 1.70s/it]
Training 1/1 epoch (loss 2.6132): 74%|ββββββββ | 919/1250 [22:17<09:31, 1.73s/it]
Training 1/1 epoch (loss 2.7621): 74%|ββββββββ | 919/1250 [22:18<09:31, 1.73s/it]
Training 1/1 epoch (loss 2.7621): 74%|ββββββββ | 920/1250 [22:18<07:57, 1.45s/it]
Training 1/1 epoch (loss 2.6635): 74%|ββββββββ | 920/1250 [22:20<07:57, 1.45s/it]
Training 1/1 epoch (loss 2.6635): 74%|ββββββββ | 921/1250 [22:20<08:34, 1.57s/it]
Training 1/1 epoch (loss 2.6139): 74%|ββββββββ | 921/1250 [22:21<08:34, 1.57s/it]
Training 1/1 epoch (loss 2.6139): 74%|ββββββββ | 922/1250 [22:21<08:32, 1.56s/it]
Training 1/1 epoch (loss 2.5961): 74%|ββββββββ | 922/1250 [22:23<08:32, 1.56s/it]
Training 1/1 epoch (loss 2.5961): 74%|ββββββββ | 923/1250 [22:23<08:18, 1.53s/it]
Training 1/1 epoch (loss 2.8740): 74%|ββββββββ | 923/1250 [22:25<08:18, 1.53s/it]
Training 1/1 epoch (loss 2.8740): 74%|ββββββββ | 924/1250 [22:25<08:48, 1.62s/it]
Training 1/1 epoch (loss 2.7074): 74%|ββββββββ | 924/1250 [22:26<08:48, 1.62s/it]
Training 1/1 epoch (loss 2.7074): 74%|ββββββββ | 925/1250 [22:26<07:33, 1.40s/it]
Training 1/1 epoch (loss 2.6904): 74%|ββββββββ | 925/1250 [22:27<07:33, 1.40s/it]
Training 1/1 epoch (loss 2.6904): 74%|ββββββββ | 926/1250 [22:27<07:58, 1.48s/it]
Training 1/1 epoch (loss 2.5773): 74%|ββββββββ | 926/1250 [22:29<07:58, 1.48s/it]
Training 1/1 epoch (loss 2.5773): 74%|ββββββββ | 927/1250 [22:29<09:04, 1.68s/it]
Training 1/1 epoch (loss 2.9245): 74%|ββββββββ | 927/1250 [22:30<09:04, 1.68s/it]
Training 1/1 epoch (loss 2.9245): 74%|ββββββββ | 928/1250 [22:30<07:23, 1.38s/it]
Training 1/1 epoch (loss 2.7632): 74%|ββββββββ | 928/1250 [22:32<07:23, 1.38s/it]
Training 1/1 epoch (loss 2.7632): 74%|ββββββββ | 929/1250 [22:32<07:28, 1.40s/it]
Training 1/1 epoch (loss 2.8952): 74%|ββββββββ | 929/1250 [22:33<07:28, 1.40s/it]
Training 1/1 epoch (loss 2.8952): 74%|ββββββββ | 930/1250 [22:33<07:37, 1.43s/it]
Training 1/1 epoch (loss 2.7696): 74%|ββββββββ | 930/1250 [22:34<07:37, 1.43s/it]
Training 1/1 epoch (loss 2.7696): 74%|ββββββββ | 931/1250 [22:34<06:52, 1.29s/it]
Training 1/1 epoch (loss 2.7186): 74%|ββββββββ | 931/1250 [22:36<06:52, 1.29s/it]
Training 1/1 epoch (loss 2.7186): 75%|ββββββββ | 932/1250 [22:36<07:50, 1.48s/it]
Training 1/1 epoch (loss 2.7051): 75%|ββββββββ | 932/1250 [22:37<07:50, 1.48s/it]
Training 1/1 epoch (loss 2.7051): 75%|ββββββββ | 933/1250 [22:37<07:39, 1.45s/it]
Training 1/1 epoch (loss 2.5770): 75%|ββββββββ | 933/1250 [22:38<07:39, 1.45s/it]
Training 1/1 epoch (loss 2.5770): 75%|ββββββββ | 934/1250 [22:38<07:08, 1.36s/it]
Training 1/1 epoch (loss 2.5845): 75%|ββββββββ | 934/1250 [22:40<07:08, 1.36s/it]
Training 1/1 epoch (loss 2.5845): 75%|ββββββββ | 935/1250 [22:40<07:09, 1.36s/it]
Training 1/1 epoch (loss 2.7230): 75%|ββββββββ | 935/1250 [22:41<07:09, 1.36s/it]
Training 1/1 epoch (loss 2.7230): 75%|ββββββββ | 936/1250 [22:41<06:51, 1.31s/it]
Training 1/1 epoch (loss 2.4744): 75%|ββββββββ | 936/1250 [22:42<06:51, 1.31s/it]
Training 1/1 epoch (loss 2.4744): 75%|ββββββββ | 937/1250 [22:42<06:34, 1.26s/it]
Training 1/1 epoch (loss 2.5986): 75%|ββββββββ | 937/1250 [22:44<06:34, 1.26s/it]
Training 1/1 epoch (loss 2.5986): 75%|ββββββββ | 938/1250 [22:44<07:26, 1.43s/it]
Training 1/1 epoch (loss 2.7346): 75%|ββββββββ | 938/1250 [22:45<07:26, 1.43s/it]
Training 1/1 epoch (loss 2.7346): 75%|ββββββββ | 939/1250 [22:45<06:41, 1.29s/it]
Training 1/1 epoch (loss 2.7104): 75%|ββββββββ | 939/1250 [22:47<06:41, 1.29s/it]
Training 1/1 epoch (loss 2.7104): 75%|ββββββββ | 940/1250 [22:47<07:15, 1.41s/it]
Training 1/1 epoch (loss 2.6661): 75%|ββββββββ | 940/1250 [22:48<07:15, 1.41s/it]
Training 1/1 epoch (loss 2.6661): 75%|ββββββββ | 941/1250 [22:48<07:15, 1.41s/it]
Training 1/1 epoch (loss 2.6440): 75%|ββββββββ | 941/1250 [22:49<07:15, 1.41s/it]
Training 1/1 epoch (loss 2.6440): 75%|ββββββββ | 942/1250 [22:49<06:33, 1.28s/it]
Training 1/1 epoch (loss 2.4992): 75%|ββββββββ | 942/1250 [22:51<06:33, 1.28s/it]
Training 1/1 epoch (loss 2.4992): 75%|ββββββββ | 943/1250 [22:51<07:25, 1.45s/it]
Training 1/1 epoch (loss 2.4048): 75%|ββββββββ | 943/1250 [22:53<07:25, 1.45s/it]
Training 1/1 epoch (loss 2.4048): 76%|ββββββββ | 944/1250 [22:53<07:47, 1.53s/it]
Training 1/1 epoch (loss 2.6602): 76%|ββββββββ | 944/1250 [22:53<07:47, 1.53s/it]
Training 1/1 epoch (loss 2.6602): 76%|ββββββββ | 945/1250 [22:53<06:19, 1.24s/it]
Training 1/1 epoch (loss 2.7099): 76%|ββββββββ | 945/1250 [22:56<06:19, 1.24s/it]
Training 1/1 epoch (loss 2.7099): 76%|ββββββββ | 946/1250 [22:56<08:08, 1.61s/it]
Training 1/1 epoch (loss 2.6009): 76%|ββββββββ | 946/1250 [22:57<08:08, 1.61s/it]
Training 1/1 epoch (loss 2.6009): 76%|ββββββββ | 947/1250 [22:57<08:07, 1.61s/it]
Training 1/1 epoch (loss 2.5541): 76%|ββββββββ | 947/1250 [22:58<08:07, 1.61s/it]
Training 1/1 epoch (loss 2.5541): 76%|ββββββββ | 948/1250 [22:58<06:48, 1.35s/it]
Training 1/1 epoch (loss 2.7309): 76%|ββββββββ | 948/1250 [23:00<06:48, 1.35s/it]
Training 1/1 epoch (loss 2.7309): 76%|ββββββββ | 949/1250 [23:00<07:31, 1.50s/it]
Training 1/1 epoch (loss 2.5531): 76%|ββββββββ | 949/1250 [23:01<07:31, 1.50s/it]
Training 1/1 epoch (loss 2.5531): 76%|ββββββββ | 950/1250 [23:01<07:00, 1.40s/it]
Training 1/1 epoch (loss 2.4705): 76%|ββββββββ | 950/1250 [23:02<07:00, 1.40s/it]
Training 1/1 epoch (loss 2.4705): 76%|ββββββββ | 951/1250 [23:02<06:59, 1.40s/it]
Training 1/1 epoch (loss 2.7709): 76%|ββββββββ | 951/1250 [23:04<06:59, 1.40s/it]
Training 1/1 epoch (loss 2.7709): 76%|ββββββββ | 952/1250 [23:04<06:59, 1.41s/it]
Training 1/1 epoch (loss 2.5363): 76%|ββββββββ | 952/1250 [23:05<06:59, 1.41s/it]
Training 1/1 epoch (loss 2.5363): 76%|ββββββββ | 953/1250 [23:05<06:56, 1.40s/it]
Training 1/1 epoch (loss 2.6200): 76%|ββββββββ | 953/1250 [23:08<06:56, 1.40s/it]
Training 1/1 epoch (loss 2.6200): 76%|ββββββββ | 954/1250 [23:08<08:13, 1.67s/it]
Training 1/1 epoch (loss 2.5895): 76%|ββββββββ | 954/1250 [23:10<08:13, 1.67s/it]
Training 1/1 epoch (loss 2.5895): 76%|ββββββββ | 955/1250 [23:10<08:58, 1.82s/it]
Training 1/1 epoch (loss 2.6738): 76%|ββββββββ | 955/1250 [23:10<08:58, 1.82s/it]
Training 1/1 epoch (loss 2.6738): 76%|ββββββββ | 956/1250 [23:10<06:53, 1.41s/it]
Training 1/1 epoch (loss 2.8415): 76%|ββββββββ | 956/1250 [23:12<06:53, 1.41s/it]
Training 1/1 epoch (loss 2.8415): 77%|ββββββββ | 957/1250 [23:12<07:55, 1.62s/it]
Training 1/1 epoch (loss 2.6346): 77%|ββββββββ | 957/1250 [23:14<07:55, 1.62s/it]
Training 1/1 epoch (loss 2.6346): 77%|ββββββββ | 958/1250 [23:14<07:42, 1.58s/it]
Training 1/1 epoch (loss 2.9087): 77%|ββββββββ | 958/1250 [23:14<07:42, 1.58s/it]
Training 1/1 epoch (loss 2.9087): 77%|ββββββββ | 959/1250 [23:14<06:17, 1.30s/it]
Training 1/1 epoch (loss 2.4456): 77%|ββββββββ | 959/1250 [23:17<06:17, 1.30s/it]
Training 1/1 epoch (loss 2.4456): 77%|ββββββββ | 960/1250 [23:17<07:34, 1.57s/it]
Training 1/1 epoch (loss 2.4316): 77%|ββββββββ | 960/1250 [23:18<07:34, 1.57s/it]
Training 1/1 epoch (loss 2.4316): 77%|ββββββββ | 961/1250 [23:18<07:17, 1.52s/it]
Training 1/1 epoch (loss 2.4105): 77%|ββββββββ | 961/1250 [23:19<07:17, 1.52s/it]
Training 1/1 epoch (loss 2.4105): 77%|ββββββββ | 962/1250 [23:19<06:45, 1.41s/it]
Training 1/1 epoch (loss 2.8177): 77%|ββββββββ | 962/1250 [23:21<06:45, 1.41s/it]
Training 1/1 epoch (loss 2.8177): 77%|ββββββββ | 963/1250 [23:21<08:02, 1.68s/it]
Training 1/1 epoch (loss 2.7751): 77%|ββββββββ | 963/1250 [23:23<08:02, 1.68s/it]
Training 1/1 epoch (loss 2.7751): 77%|ββββββββ | 964/1250 [23:23<08:10, 1.71s/it]
Training 1/1 epoch (loss 2.4118): 77%|ββββββββ | 964/1250 [23:24<08:10, 1.71s/it]
Training 1/1 epoch (loss 2.4118): 77%|ββββββββ | 965/1250 [23:24<06:55, 1.46s/it]
Training 1/1 epoch (loss 2.6406): 77%|ββββββββ | 965/1250 [23:26<06:55, 1.46s/it]
Training 1/1 epoch (loss 2.6406): 77%|ββββββββ | 966/1250 [23:26<07:46, 1.64s/it]
Training 1/1 epoch (loss 2.6151): 77%|ββββββββ | 966/1250 [23:27<07:46, 1.64s/it]
Training 1/1 epoch (loss 2.6151): 77%|ββββββββ | 967/1250 [23:27<06:49, 1.45s/it]
Training 1/1 epoch (loss 2.6655): 77%|ββββββββ | 967/1250 [23:28<06:49, 1.45s/it]
Training 1/1 epoch (loss 2.6655): 77%|ββββββββ | 968/1250 [23:28<06:26, 1.37s/it]
Training 1/1 epoch (loss 2.9446): 77%|ββββββββ | 968/1250 [23:30<06:26, 1.37s/it]
Training 1/1 epoch (loss 2.9446): 78%|ββββββββ | 969/1250 [23:30<07:18, 1.56s/it]
Training 1/1 epoch (loss 2.5759): 78%|ββββββββ | 969/1250 [23:31<07:18, 1.56s/it]
Training 1/1 epoch (loss 2.5759): 78%|ββββββββ | 970/1250 [23:31<06:03, 1.30s/it]
Training 1/1 epoch (loss 2.9871): 78%|ββββββββ | 970/1250 [23:33<06:03, 1.30s/it]
Training 1/1 epoch (loss 2.9871): 78%|ββββββββ | 971/1250 [23:33<06:31, 1.40s/it]
Training 1/1 epoch (loss 2.6434): 78%|ββββββββ | 971/1250 [23:35<06:31, 1.40s/it]
Training 1/1 epoch (loss 2.6434): 78%|ββββββββ | 972/1250 [23:35<07:09, 1.54s/it]
Training 1/1 epoch (loss 2.6663): 78%|ββββββββ | 972/1250 [23:35<07:09, 1.54s/it]
Training 1/1 epoch (loss 2.6663): 78%|ββββββββ | 973/1250 [23:35<06:07, 1.33s/it]
Training 1/1 epoch (loss 2.5222): 78%|ββββββββ | 973/1250 [23:37<06:07, 1.33s/it]
Training 1/1 epoch (loss 2.5222): 78%|ββββββββ | 974/1250 [23:37<06:52, 1.50s/it]
Training 1/1 epoch (loss 2.6620): 78%|ββββββββ | 974/1250 [23:38<06:52, 1.50s/it]
Training 1/1 epoch (loss 2.6620): 78%|ββββββββ | 975/1250 [23:38<06:22, 1.39s/it]
Training 1/1 epoch (loss 2.5488): 78%|ββββββββ | 975/1250 [23:39<06:22, 1.39s/it]
Training 1/1 epoch (loss 2.5488): 78%|ββββββββ | 976/1250 [23:39<05:33, 1.22s/it]
Training 1/1 epoch (loss 2.7312): 78%|ββββββββ | 976/1250 [23:41<05:33, 1.22s/it]
Training 1/1 epoch (loss 2.7312): 78%|ββββββββ | 977/1250 [23:41<05:57, 1.31s/it]
Training 1/1 epoch (loss 2.7678): 78%|ββββββββ | 977/1250 [23:42<05:57, 1.31s/it]
Training 1/1 epoch (loss 2.7678): 78%|ββββββββ | 978/1250 [23:42<05:32, 1.22s/it]
Training 1/1 epoch (loss 2.9379): 78%|ββββββββ | 978/1250 [23:43<05:32, 1.22s/it]
Training 1/1 epoch (loss 2.9379): 78%|ββββββββ | 979/1250 [23:43<05:45, 1.27s/it]
Training 1/1 epoch (loss 2.7322): 78%|ββββββββ | 979/1250 [23:45<05:45, 1.27s/it]
Training 1/1 epoch (loss 2.7322): 78%|ββββββββ | 980/1250 [23:45<06:24, 1.42s/it]
Training 1/1 epoch (loss 2.6855): 78%|ββββββββ | 980/1250 [23:46<06:24, 1.42s/it]
Training 1/1 epoch (loss 2.6855): 78%|ββββββββ | 981/1250 [23:46<05:17, 1.18s/it]
Training 1/1 epoch (loss 2.7068): 78%|ββββββββ | 981/1250 [23:47<05:17, 1.18s/it]
Training 1/1 epoch (loss 2.7068): 79%|ββββββββ | 982/1250 [23:47<06:08, 1.37s/it]
Training 1/1 epoch (loss 2.7421): 79%|ββββββββ | 982/1250 [23:49<06:08, 1.37s/it]
Training 1/1 epoch (loss 2.7421): 79%|ββββββββ | 983/1250 [23:49<06:48, 1.53s/it]
Training 1/1 epoch (loss 2.7475): 79%|ββββββββ | 983/1250 [23:50<06:48, 1.53s/it]
Training 1/1 epoch (loss 2.7475): 79%|ββββββββ | 984/1250 [23:50<05:23, 1.22s/it]
Training 1/1 epoch (loss 2.8004): 79%|ββββββββ | 984/1250 [23:51<05:23, 1.22s/it]
Training 1/1 epoch (loss 2.8004): 79%|ββββββββ | 985/1250 [23:51<05:40, 1.29s/it]
Training 1/1 epoch (loss 2.5415): 79%|ββββββββ | 985/1250 [23:52<05:40, 1.29s/it]
Training 1/1 epoch (loss 2.5415): 79%|ββββββββ | 986/1250 [23:52<05:27, 1.24s/it]
Training 1/1 epoch (loss 2.7016): 79%|ββββββββ | 986/1250 [23:53<05:27, 1.24s/it]
Training 1/1 epoch (loss 2.7016): 79%|ββββββββ | 987/1250 [23:53<05:10, 1.18s/it]
Training 1/1 epoch (loss 2.6342): 79%|ββββββββ | 987/1250 [23:55<05:10, 1.18s/it]
Training 1/1 epoch (loss 2.6342): 79%|ββββββββ | 988/1250 [23:55<05:59, 1.37s/it]
Training 1/1 epoch (loss 2.8827): 79%|ββββββββ | 988/1250 [23:56<05:59, 1.37s/it]
Training 1/1 epoch (loss 2.8827): 79%|ββββββββ | 989/1250 [23:56<05:24, 1.24s/it]
Training 1/1 epoch (loss 2.5466): 79%|ββββββββ | 989/1250 [23:57<05:24, 1.24s/it]
Training 1/1 epoch (loss 2.5466): 79%|ββββββββ | 990/1250 [23:57<04:49, 1.11s/it]
Training 1/1 epoch (loss 2.5959): 79%|ββββββββ | 990/1250 [23:59<04:49, 1.11s/it]
Training 1/1 epoch (loss 2.5959): 79%|ββββββββ | 991/1250 [23:59<06:36, 1.53s/it]
Training 1/1 epoch (loss 2.3451): 79%|ββββββββ | 991/1250 [24:01<06:36, 1.53s/it]
Training 1/1 epoch (loss 2.3451): 79%|ββββββββ | 992/1250 [24:01<06:01, 1.40s/it]
Training 1/1 epoch (loss 2.6118): 79%|ββββββββ | 992/1250 [24:02<06:01, 1.40s/it]
Training 1/1 epoch (loss 2.6118): 79%|ββββββββ | 993/1250 [24:02<06:33, 1.53s/it]
Training 1/1 epoch (loss 2.5164): 79%|ββββββββ | 993/1250 [24:04<06:33, 1.53s/it]
Training 1/1 epoch (loss 2.5164): 80%|ββββββββ | 994/1250 [24:04<06:34, 1.54s/it]
Training 1/1 epoch (loss 2.4160): 80%|ββββββββ | 994/1250 [24:05<06:34, 1.54s/it]
Training 1/1 epoch (loss 2.4160): 80%|ββββββββ | 995/1250 [24:05<05:22, 1.27s/it]
Training 1/1 epoch (loss 2.6015): 80%|ββββββββ | 995/1250 [24:07<05:22, 1.27s/it]
Training 1/1 epoch (loss 2.6015): 80%|ββββββββ | 996/1250 [24:07<06:37, 1.57s/it]
Training 1/1 epoch (loss 2.7502): 80%|ββββββββ | 996/1250 [24:09<06:37, 1.57s/it]
Training 1/1 epoch (loss 2.7502): 80%|ββββββββ | 997/1250 [24:09<06:53, 1.63s/it]
Training 1/1 epoch (loss 2.6610): 80%|ββββββββ | 997/1250 [24:09<06:53, 1.63s/it]
Training 1/1 epoch (loss 2.6610): 80%|ββββββββ | 998/1250 [24:09<05:35, 1.33s/it]
Training 1/1 epoch (loss 2.6480): 80%|ββββββββ | 998/1250 [24:11<05:35, 1.33s/it]
Training 1/1 epoch (loss 2.6480): 80%|ββββββββ | 999/1250 [24:11<05:37, 1.34s/it]
Training 1/1 epoch (loss 2.6515): 80%|ββββββββ | 999/1250 [24:13<05:37, 1.34s/it]
Training 1/1 epoch (loss 2.6515): 80%|ββββββββ | 1000/1250 [24:13<07:07, 1.71s/it]
Training 1/1 epoch (loss 2.7356): 80%|ββββββββ | 1000/1250 [24:14<07:07, 1.71s/it]
Training 1/1 epoch (loss 2.7356): 80%|ββββββββ | 1001/1250 [24:14<05:49, 1.41s/it]
Training 1/1 epoch (loss 2.4858): 80%|ββββββββ | 1001/1250 [24:16<05:49, 1.41s/it]
Training 1/1 epoch (loss 2.4858): 80%|ββββββββ | 1002/1250 [24:16<06:09, 1.49s/it]
Training 1/1 epoch (loss 2.5631): 80%|ββββββββ | 1002/1250 [24:17<06:09, 1.49s/it]
Training 1/1 epoch (loss 2.5631): 80%|ββββββββ | 1003/1250 [24:17<06:21, 1.54s/it]
Training 1/1 epoch (loss 2.5232): 80%|ββββββββ | 1003/1250 [24:18<06:21, 1.54s/it]
Training 1/1 epoch (loss 2.5232): 80%|ββββββββ | 1004/1250 [24:18<05:54, 1.44s/it]
Training 1/1 epoch (loss 2.7478): 80%|ββββββββ | 1004/1250 [24:20<05:54, 1.44s/it]
Training 1/1 epoch (loss 2.7478): 80%|ββββββββ | 1005/1250 [24:20<06:11, 1.52s/it]
Training 1/1 epoch (loss 2.6779): 80%|ββββββββ | 1005/1250 [24:21<06:11, 1.52s/it]
Training 1/1 epoch (loss 2.6779): 80%|ββββββββ | 1006/1250 [24:21<05:51, 1.44s/it]
Training 1/1 epoch (loss 2.7407): 80%|ββββββββ | 1006/1250 [24:23<05:51, 1.44s/it]
Training 1/1 epoch (loss 2.7407): 81%|ββββββββ | 1007/1250 [24:23<06:17, 1.55s/it]
Training 1/1 epoch (loss 2.7041): 81%|ββββββββ | 1007/1250 [24:26<06:17, 1.55s/it]
Training 1/1 epoch (loss 2.7041): 81%|ββββββββ | 1008/1250 [24:26<07:21, 1.83s/it]
Training 1/1 epoch (loss 2.7995): 81%|ββββββββ | 1008/1250 [24:27<07:21, 1.83s/it]
Training 1/1 epoch (loss 2.7995): 81%|ββββββββ | 1009/1250 [24:27<06:15, 1.56s/it]
Training 1/1 epoch (loss 2.7193): 81%|ββββββββ | 1009/1250 [24:28<06:15, 1.56s/it]
Training 1/1 epoch (loss 2.7193): 81%|ββββββββ | 1010/1250 [24:28<06:28, 1.62s/it]
Training 1/1 epoch (loss 2.8081): 81%|ββββββββ | 1010/1250 [24:31<06:28, 1.62s/it]
Training 1/1 epoch (loss 2.8081): 81%|ββββββββ | 1011/1250 [24:31<07:32, 1.89s/it]
Training 1/1 epoch (loss 2.6990): 81%|ββββββββ | 1011/1250 [24:31<07:32, 1.89s/it]
Training 1/1 epoch (loss 2.6990): 81%|ββββββββ | 1012/1250 [24:31<05:47, 1.46s/it]
Training 1/1 epoch (loss 2.5106): 81%|ββββββββ | 1012/1250 [24:33<05:47, 1.46s/it]
Training 1/1 epoch (loss 2.5106): 81%|ββββββββ | 1013/1250 [24:33<05:42, 1.44s/it]
Training 1/1 epoch (loss 2.4768): 81%|ββββββββ | 1013/1250 [24:35<05:42, 1.44s/it]
Training 1/1 epoch (loss 2.4768): 81%|ββββββββ | 1014/1250 [24:35<06:48, 1.73s/it]
Training 1/1 epoch (loss 2.5940): 81%|ββββββββ | 1014/1250 [24:36<06:48, 1.73s/it]
Training 1/1 epoch (loss 2.5940): 81%|ββββββββ | 1015/1250 [24:36<05:30, 1.41s/it]
Training 1/1 epoch (loss 2.6798): 81%|ββββββββ | 1015/1250 [24:38<05:30, 1.41s/it]
Training 1/1 epoch (loss 2.6798): 81%|βββββββββ | 1016/1250 [24:38<05:59, 1.54s/it]
Training 1/1 epoch (loss 2.6696): 81%|βββββββββ | 1016/1250 [24:39<05:59, 1.54s/it]
Training 1/1 epoch (loss 2.6696): 81%|βββββββββ | 1017/1250 [24:39<05:52, 1.51s/it]
Training 1/1 epoch (loss 2.7546): 81%|βββββββββ | 1017/1250 [24:42<05:52, 1.51s/it]
Training 1/1 epoch (loss 2.7546): 81%|βββββββββ | 1018/1250 [24:42<06:54, 1.79s/it]
Training 1/1 epoch (loss 2.6964): 81%|βββββββββ | 1018/1250 [24:43<06:54, 1.79s/it]
Training 1/1 epoch (loss 2.6964): 82%|βββββββββ | 1019/1250 [24:43<06:59, 1.82s/it]
Training 1/1 epoch (loss 2.8658): 82%|βββββββββ | 1019/1250 [24:44<06:59, 1.82s/it]
Training 1/1 epoch (loss 2.8658): 82%|βββββββββ | 1020/1250 [24:44<05:23, 1.41s/it]
Training 1/1 epoch (loss 2.6041): 82%|βββββββββ | 1020/1250 [24:46<05:23, 1.41s/it]
Training 1/1 epoch (loss 2.6041): 82%|βββββββββ | 1021/1250 [24:46<05:35, 1.46s/it]
Training 1/1 epoch (loss 2.8706): 82%|βββββββββ | 1021/1250 [24:47<05:35, 1.46s/it]
Training 1/1 epoch (loss 2.8706): 82%|βββββββββ | 1022/1250 [24:47<05:21, 1.41s/it]
Training 1/1 epoch (loss 2.5774): 82%|βββββββββ | 1022/1250 [24:47<05:21, 1.41s/it]
Training 1/1 epoch (loss 2.5774): 82%|βββββββββ | 1023/1250 [24:47<04:20, 1.15s/it]
Training 1/1 epoch (loss 2.6447): 82%|βββββββββ | 1023/1250 [24:49<04:20, 1.15s/it]
Training 1/1 epoch (loss 2.6447): 82%|βββββββββ | 1024/1250 [24:49<04:24, 1.17s/it]
Training 1/1 epoch (loss 2.5478): 82%|βββββββββ | 1024/1250 [24:51<04:24, 1.17s/it]
Training 1/1 epoch (loss 2.5478): 82%|βββββββββ | 1025/1250 [24:51<05:28, 1.46s/it]
Training 1/1 epoch (loss 2.6995): 82%|βββββββββ | 1025/1250 [24:51<05:28, 1.46s/it]
Training 1/1 epoch (loss 2.6995): 82%|βββββββββ | 1026/1250 [24:51<04:22, 1.17s/it]
Training 1/1 epoch (loss 2.5772): 82%|βββββββββ | 1026/1250 [24:53<04:22, 1.17s/it]
Training 1/1 epoch (loss 2.5772): 82%|βββββββββ | 1027/1250 [24:53<05:04, 1.36s/it]
Training 1/1 epoch (loss 2.5734): 82%|βββββββββ | 1027/1250 [24:54<05:04, 1.36s/it]
Training 1/1 epoch (loss 2.5734): 82%|βββββββββ | 1028/1250 [24:54<04:50, 1.31s/it]
Training 1/1 epoch (loss 2.6741): 82%|βββββββββ | 1028/1250 [24:56<04:50, 1.31s/it]
Training 1/1 epoch (loss 2.6741): 82%|βββββββββ | 1029/1250 [24:56<04:52, 1.32s/it]
Training 1/1 epoch (loss 2.7708): 82%|βββββββββ | 1029/1250 [24:57<04:52, 1.32s/it]
Training 1/1 epoch (loss 2.7708): 82%|βββββββββ | 1030/1250 [24:57<05:10, 1.41s/it]
Training 1/1 epoch (loss 2.6838): 82%|βββββββββ | 1030/1250 [24:59<05:10, 1.41s/it]
Training 1/1 epoch (loss 2.6838): 82%|βββββββββ | 1031/1250 [24:59<05:14, 1.44s/it]
Training 1/1 epoch (loss 2.7128): 82%|βββββββββ | 1031/1250 [25:01<05:14, 1.44s/it]
Training 1/1 epoch (loss 2.7128): 83%|βββββββββ | 1032/1250 [25:01<05:41, 1.57s/it]
Training 1/1 epoch (loss 2.5628): 83%|βββββββββ | 1032/1250 [25:02<05:41, 1.57s/it]
Training 1/1 epoch (loss 2.5628): 83%|βββββββββ | 1033/1250 [25:02<05:36, 1.55s/it]
Training 1/1 epoch (loss 2.5156): 83%|βββββββββ | 1033/1250 [25:03<05:36, 1.55s/it]
Training 1/1 epoch (loss 2.5156): 83%|βββββββββ | 1034/1250 [25:03<04:36, 1.28s/it]
Training 1/1 epoch (loss 2.4728): 83%|βββββββββ | 1034/1250 [25:05<04:36, 1.28s/it]
Training 1/1 epoch (loss 2.4728): 83%|βββββββββ | 1035/1250 [25:05<05:10, 1.44s/it]
Training 1/1 epoch (loss 2.5655): 83%|βββββββββ | 1035/1250 [25:07<05:10, 1.44s/it]
Training 1/1 epoch (loss 2.5655): 83%|βββββββββ | 1036/1250 [25:07<05:57, 1.67s/it]
Training 1/1 epoch (loss 2.7530): 83%|βββββββββ | 1036/1250 [25:07<05:57, 1.67s/it]
Training 1/1 epoch (loss 2.7530): 83%|βββββββββ | 1037/1250 [25:07<04:40, 1.32s/it]
Training 1/1 epoch (loss 2.5229): 83%|βββββββββ | 1037/1250 [25:09<04:40, 1.32s/it]
Training 1/1 epoch (loss 2.5229): 83%|βββββββββ | 1038/1250 [25:09<04:46, 1.35s/it]
Training 1/1 epoch (loss 2.4233): 83%|βββββββββ | 1038/1250 [25:10<04:46, 1.35s/it]
Training 1/1 epoch (loss 2.4233): 83%|βββββββββ | 1039/1250 [25:10<04:53, 1.39s/it]
Training 1/1 epoch (loss 2.5469): 83%|βββββββββ | 1039/1250 [25:11<04:53, 1.39s/it]
Training 1/1 epoch (loss 2.5469): 83%|βββββββββ | 1040/1250 [25:11<04:18, 1.23s/it]
Training 1/1 epoch (loss 2.3931): 83%|βββββββββ | 1040/1250 [25:12<04:18, 1.23s/it]
Training 1/1 epoch (loss 2.3931): 83%|βββββββββ | 1041/1250 [25:12<04:06, 1.18s/it]
Training 1/1 epoch (loss 2.8230): 83%|βββββββββ | 1041/1250 [25:13<04:06, 1.18s/it]
Training 1/1 epoch (loss 2.8230): 83%|βββββββββ | 1042/1250 [25:13<03:48, 1.10s/it]
Training 1/1 epoch (loss 2.6817): 83%|βββββββββ | 1042/1250 [25:15<03:48, 1.10s/it]
Training 1/1 epoch (loss 2.6817): 83%|βββββββββ | 1043/1250 [25:15<04:28, 1.30s/it]
Training 1/1 epoch (loss 2.6795): 83%|βββββββββ | 1043/1250 [25:17<04:28, 1.30s/it]
Training 1/1 epoch (loss 2.6795): 84%|βββββββββ | 1044/1250 [25:17<05:24, 1.57s/it]
Training 1/1 epoch (loss 2.7682): 84%|βββββββββ | 1044/1250 [25:18<05:24, 1.57s/it]
Training 1/1 epoch (loss 2.7682): 84%|βββββββββ | 1045/1250 [25:18<04:58, 1.45s/it]
Training 1/1 epoch (loss 2.7179): 84%|βββββββββ | 1045/1250 [25:19<04:58, 1.45s/it]
Training 1/1 epoch (loss 2.7179): 84%|βββββββββ | 1046/1250 [25:19<04:51, 1.43s/it]
Training 1/1 epoch (loss 2.7061): 84%|βββββββββ | 1046/1250 [25:21<04:51, 1.43s/it]
Training 1/1 epoch (loss 2.7061): 84%|βββββββββ | 1047/1250 [25:21<04:48, 1.42s/it]
Training 1/1 epoch (loss 2.7012): 84%|βββββββββ | 1047/1250 [25:22<04:48, 1.42s/it]
Training 1/1 epoch (loss 2.7012): 84%|βββββββββ | 1048/1250 [25:22<04:05, 1.22s/it]
Training 1/1 epoch (loss 2.5651): 84%|βββββββββ | 1048/1250 [25:24<04:05, 1.22s/it]
Training 1/1 epoch (loss 2.5651): 84%|βββββββββ | 1049/1250 [25:24<05:18, 1.58s/it]
Training 1/1 epoch (loss 2.8253): 84%|βββββββββ | 1049/1250 [25:25<05:18, 1.58s/it]
Training 1/1 epoch (loss 2.8253): 84%|βββββββββ | 1050/1250 [25:25<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.6614): 84%|βββββββββ | 1050/1250 [25:26<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.6614): 84%|βββββββββ | 1051/1250 [25:26<04:31, 1.36s/it]
Training 1/1 epoch (loss 2.6541): 84%|βββββββββ | 1051/1250 [25:28<04:31, 1.36s/it]
Training 1/1 epoch (loss 2.6541): 84%|βββββββββ | 1052/1250 [25:28<04:33, 1.38s/it]
Training 1/1 epoch (loss 2.5489): 84%|βββββββββ | 1052/1250 [25:29<04:33, 1.38s/it]
Training 1/1 epoch (loss 2.5489): 84%|βββββββββ | 1053/1250 [25:29<04:16, 1.30s/it]
Training 1/1 epoch (loss 2.5425): 84%|βββββββββ | 1053/1250 [25:31<04:16, 1.30s/it]
Training 1/1 epoch (loss 2.5425): 84%|βββββββββ | 1054/1250 [25:31<04:50, 1.48s/it]
Training 1/1 epoch (loss 2.4793): 84%|βββββββββ | 1054/1250 [25:33<04:50, 1.48s/it]
Training 1/1 epoch (loss 2.4793): 84%|βββββββββ | 1055/1250 [25:33<05:25, 1.67s/it]
Training 1/1 epoch (loss 2.5937): 84%|βββββββββ | 1055/1250 [25:34<05:25, 1.67s/it]
Training 1/1 epoch (loss 2.5937): 84%|βββββββββ | 1056/1250 [25:34<04:27, 1.38s/it]
Training 1/1 epoch (loss 2.7447): 84%|βββββββββ | 1056/1250 [25:36<04:27, 1.38s/it]
Training 1/1 epoch (loss 2.7447): 85%|βββββββββ | 1057/1250 [25:36<05:07, 1.59s/it]
Training 1/1 epoch (loss 2.6959): 85%|βββββββββ | 1057/1250 [25:37<05:07, 1.59s/it]
Training 1/1 epoch (loss 2.6959): 85%|βββββββββ | 1058/1250 [25:37<05:14, 1.64s/it]
Training 1/1 epoch (loss 2.6989): 85%|βββββββββ | 1058/1250 [25:39<05:14, 1.64s/it]
Training 1/1 epoch (loss 2.6989): 85%|βββββββββ | 1059/1250 [25:39<05:03, 1.59s/it]
Training 1/1 epoch (loss 2.7725): 85%|βββββββββ | 1059/1250 [25:41<05:03, 1.59s/it]
Training 1/1 epoch (loss 2.7725): 85%|βββββββββ | 1060/1250 [25:41<05:44, 1.81s/it]
Training 1/1 epoch (loss 2.5080): 85%|βββββββββ | 1060/1250 [25:42<05:44, 1.81s/it]
Training 1/1 epoch (loss 2.5080): 85%|βββββββββ | 1061/1250 [25:42<05:04, 1.61s/it]
Training 1/1 epoch (loss 2.8726): 85%|βββββββββ | 1061/1250 [25:44<05:04, 1.61s/it]
Training 1/1 epoch (loss 2.8726): 85%|βββββββββ | 1062/1250 [25:44<05:26, 1.74s/it]
Training 1/1 epoch (loss 2.7843): 85%|βββββββββ | 1062/1250 [25:46<05:26, 1.74s/it]
Training 1/1 epoch (loss 2.7843): 85%|βββββββββ | 1063/1250 [25:46<05:30, 1.77s/it]
Training 1/1 epoch (loss 2.7130): 85%|βββββββββ | 1063/1250 [25:47<05:30, 1.77s/it]
Training 1/1 epoch (loss 2.7130): 85%|βββββββββ | 1064/1250 [25:47<04:33, 1.47s/it]
Training 1/1 epoch (loss 2.6148): 85%|βββββββββ | 1064/1250 [25:48<04:33, 1.47s/it]
Training 1/1 epoch (loss 2.6148): 85%|βββββββββ | 1065/1250 [25:48<04:25, 1.44s/it]
Training 1/1 epoch (loss 2.8298): 85%|βββββββββ | 1065/1250 [25:50<04:25, 1.44s/it]
Training 1/1 epoch (loss 2.8298): 85%|βββββββββ | 1066/1250 [25:50<04:41, 1.53s/it]
Training 1/1 epoch (loss 2.6109): 85%|βββββββββ | 1066/1250 [25:51<04:41, 1.53s/it]
Training 1/1 epoch (loss 2.6109): 85%|βββββββββ | 1067/1250 [25:51<04:05, 1.34s/it]
Training 1/1 epoch (loss 2.5924): 85%|βββββββββ | 1067/1250 [25:52<04:05, 1.34s/it]
Training 1/1 epoch (loss 2.5924): 85%|βββββββββ | 1068/1250 [25:52<04:12, 1.39s/it]
Training 1/1 epoch (loss 2.5707): 85%|βββββββββ | 1068/1250 [25:54<04:12, 1.39s/it]
Training 1/1 epoch (loss 2.5707): 86%|βββββββββ | 1069/1250 [25:54<04:19, 1.43s/it]
Training 1/1 epoch (loss 2.6304): 86%|βββββββββ | 1069/1250 [25:55<04:19, 1.43s/it]
Training 1/1 epoch (loss 2.6304): 86%|βββββββββ | 1070/1250 [25:55<03:47, 1.26s/it]
Training 1/1 epoch (loss 2.7430): 86%|βββββββββ | 1070/1250 [25:57<03:47, 1.26s/it]
Training 1/1 epoch (loss 2.7430): 86%|βββββββββ | 1071/1250 [25:57<04:37, 1.55s/it]
Training 1/1 epoch (loss 2.6921): 86%|βββββββββ | 1071/1250 [25:58<04:37, 1.55s/it]
Training 1/1 epoch (loss 2.6921): 86%|βββββββββ | 1072/1250 [25:58<04:22, 1.47s/it]
Training 1/1 epoch (loss 2.6504): 86%|βββββββββ | 1072/1250 [26:01<04:22, 1.47s/it]
Training 1/1 epoch (loss 2.6504): 86%|βββββββββ | 1073/1250 [26:01<04:53, 1.66s/it]
Training 1/1 epoch (loss 2.7132): 86%|βββββββββ | 1073/1250 [26:03<04:53, 1.66s/it]
Training 1/1 epoch (loss 2.7132): 86%|βββββββββ | 1074/1250 [26:03<05:13, 1.78s/it]
Training 1/1 epoch (loss 2.7130): 86%|βββββββββ | 1074/1250 [26:03<05:13, 1.78s/it]
Training 1/1 epoch (loss 2.7130): 86%|βββββββββ | 1075/1250 [26:03<04:11, 1.44s/it]
Training 1/1 epoch (loss 2.7215): 86%|βββββββββ | 1075/1250 [26:05<04:11, 1.44s/it]
Training 1/1 epoch (loss 2.7215): 86%|βββββββββ | 1076/1250 [26:05<04:02, 1.40s/it]
Training 1/1 epoch (loss 2.6724): 86%|βββββββββ | 1076/1250 [26:06<04:02, 1.40s/it]
Training 1/1 epoch (loss 2.6724): 86%|βββββββββ | 1077/1250 [26:06<04:17, 1.49s/it]
Training 1/1 epoch (loss 2.6625): 86%|βββββββββ | 1077/1250 [26:07<04:17, 1.49s/it]
Training 1/1 epoch (loss 2.6625): 86%|βββββββββ | 1078/1250 [26:07<03:44, 1.30s/it]
Training 1/1 epoch (loss 2.6775): 86%|βββββββββ | 1078/1250 [26:08<03:44, 1.30s/it]
Training 1/1 epoch (loss 2.6775): 86%|βββββββββ | 1079/1250 [26:08<03:43, 1.31s/it]
Training 1/1 epoch (loss 2.8457): 86%|βββββββββ | 1079/1250 [26:09<03:43, 1.31s/it]
Training 1/1 epoch (loss 2.8457): 86%|βββββββββ | 1080/1250 [26:09<03:19, 1.17s/it]
Training 1/1 epoch (loss 2.7796): 86%|βββββββββ | 1080/1250 [26:11<03:19, 1.17s/it]
Training 1/1 epoch (loss 2.7796): 86%|βββββββββ | 1081/1250 [26:11<03:37, 1.29s/it]
Training 1/1 epoch (loss 2.4062): 86%|βββββββββ | 1081/1250 [26:13<03:37, 1.29s/it]
Training 1/1 epoch (loss 2.4062): 87%|βββββββββ | 1082/1250 [26:13<04:33, 1.63s/it]
Training 1/1 epoch (loss 2.7158): 87%|βββββββββ | 1082/1250 [26:14<04:33, 1.63s/it]
Training 1/1 epoch (loss 2.7158): 87%|βββββββββ | 1083/1250 [26:14<03:35, 1.29s/it]
Training 1/1 epoch (loss 2.8221): 87%|βββββββββ | 1083/1250 [26:15<03:35, 1.29s/it]
Training 1/1 epoch (loss 2.8221): 87%|βββββββββ | 1084/1250 [26:15<03:44, 1.35s/it]
Training 1/1 epoch (loss 2.5497): 87%|βββββββββ | 1084/1250 [26:18<03:44, 1.35s/it]
Training 1/1 epoch (loss 2.5497): 87%|βββββββββ | 1085/1250 [26:18<04:35, 1.67s/it]
Training 1/1 epoch (loss 2.6318): 87%|βββββββββ | 1085/1250 [26:18<04:35, 1.67s/it]
Training 1/1 epoch (loss 2.6318): 87%|βββββββββ | 1086/1250 [26:18<03:52, 1.42s/it]
Training 1/1 epoch (loss 2.7906): 87%|βββββββββ | 1086/1250 [26:20<03:52, 1.42s/it]
Training 1/1 epoch (loss 2.7906): 87%|βββββββββ | 1087/1250 [26:20<03:39, 1.35s/it]
Training 1/1 epoch (loss 2.7508): 87%|βββββββββ | 1087/1250 [26:21<03:39, 1.35s/it]
Training 1/1 epoch (loss 2.7508): 87%|βββββββββ | 1088/1250 [26:21<03:55, 1.45s/it]
Training 1/1 epoch (loss 2.5576): 87%|βββββββββ | 1088/1250 [26:22<03:55, 1.45s/it]
Training 1/1 epoch (loss 2.5576): 87%|βββββββββ | 1089/1250 [26:22<03:21, 1.25s/it]
Training 1/1 epoch (loss 2.6864): 87%|βββββββββ | 1089/1250 [26:24<03:21, 1.25s/it]
Training 1/1 epoch (loss 2.6864): 87%|βββββββββ | 1090/1250 [26:24<04:02, 1.51s/it]
Training 1/1 epoch (loss 2.7392): 87%|βββββββββ | 1090/1250 [26:25<04:02, 1.51s/it]
Training 1/1 epoch (loss 2.7392): 87%|βββββββββ | 1091/1250 [26:25<03:30, 1.33s/it]
Training 1/1 epoch (loss 2.5244): 87%|βββββββββ | 1091/1250 [26:27<03:30, 1.33s/it]
Training 1/1 epoch (loss 2.5244): 87%|βββββββββ | 1092/1250 [26:27<03:51, 1.47s/it]
Training 1/1 epoch (loss 2.5833): 87%|βββββββββ | 1092/1250 [26:29<03:51, 1.47s/it]
Training 1/1 epoch (loss 2.5833): 87%|βββββββββ | 1093/1250 [26:29<04:17, 1.64s/it]
Training 1/1 epoch (loss 2.5912): 87%|βββββββββ | 1093/1250 [26:30<04:17, 1.64s/it]
Training 1/1 epoch (loss 2.5912): 88%|βββββββββ | 1094/1250 [26:30<03:45, 1.44s/it]
Training 1/1 epoch (loss 2.7292): 88%|βββββββββ | 1094/1250 [26:31<03:45, 1.44s/it]
Training 1/1 epoch (loss 2.7292): 88%|βββββββββ | 1095/1250 [26:31<03:26, 1.33s/it]
Training 1/1 epoch (loss 2.5461): 88%|βββββββββ | 1095/1250 [26:33<03:26, 1.33s/it]
Training 1/1 epoch (loss 2.5461): 88%|βββββββββ | 1096/1250 [26:33<03:43, 1.45s/it]
Training 1/1 epoch (loss 2.8497): 88%|βββββββββ | 1096/1250 [26:33<03:43, 1.45s/it]
Training 1/1 epoch (loss 2.8497): 88%|βββββββββ | 1097/1250 [26:33<03:06, 1.22s/it]
Training 1/1 epoch (loss 2.7775): 88%|βββββββββ | 1097/1250 [26:36<03:06, 1.22s/it]
Training 1/1 epoch (loss 2.7775): 88%|βββββββββ | 1098/1250 [26:36<03:43, 1.47s/it]
Training 1/1 epoch (loss 2.7747): 88%|βββββββββ | 1098/1250 [26:37<03:43, 1.47s/it]
Training 1/1 epoch (loss 2.7747): 88%|βββββββββ | 1099/1250 [26:37<03:42, 1.47s/it]
Training 1/1 epoch (loss 2.7281): 88%|βββββββββ | 1099/1250 [26:38<03:42, 1.47s/it]
Training 1/1 epoch (loss 2.7281): 88%|βββββββββ | 1100/1250 [26:38<03:03, 1.22s/it]
Training 1/1 epoch (loss 2.5379): 88%|βββββββββ | 1100/1250 [26:40<03:03, 1.22s/it]
Training 1/1 epoch (loss 2.5379): 88%|βββββββββ | 1101/1250 [26:40<03:32, 1.43s/it]
Training 1/1 epoch (loss 2.7337): 88%|βββββββββ | 1101/1250 [26:42<03:32, 1.43s/it]
Training 1/1 epoch (loss 2.7337): 88%|βββββββββ | 1102/1250 [26:42<04:15, 1.73s/it]
Training 1/1 epoch (loss 2.6951): 88%|βββββββββ | 1102/1250 [26:43<04:15, 1.73s/it]
Training 1/1 epoch (loss 2.6951): 88%|βββββββββ | 1103/1250 [26:43<03:40, 1.50s/it]
Training 1/1 epoch (loss 2.5121): 88%|βββββββββ | 1103/1250 [26:45<03:40, 1.50s/it]
Training 1/1 epoch (loss 2.5121): 88%|βββββββββ | 1104/1250 [26:45<04:02, 1.66s/it]
Training 1/1 epoch (loss 2.5815): 88%|βββββββββ | 1104/1250 [26:46<04:02, 1.66s/it]
Training 1/1 epoch (loss 2.5815): 88%|βββββββββ | 1105/1250 [26:46<03:38, 1.51s/it]
Training 1/1 epoch (loss 2.5979): 88%|βββββββββ | 1105/1250 [26:48<03:38, 1.51s/it]
Training 1/1 epoch (loss 2.5979): 88%|βββββββββ | 1106/1250 [26:48<04:13, 1.76s/it]
Training 1/1 epoch (loss 2.5353): 88%|βββββββββ | 1106/1250 [26:51<04:13, 1.76s/it]
Training 1/1 epoch (loss 2.5353): 89%|βββββββββ | 1107/1250 [26:51<04:28, 1.87s/it]
Training 1/1 epoch (loss 2.7094): 89%|βββββββββ | 1107/1250 [26:51<04:28, 1.87s/it]
Training 1/1 epoch (loss 2.7094): 89%|βββββββββ | 1108/1250 [26:51<03:43, 1.57s/it]
Training 1/1 epoch (loss 2.7405): 89%|βββββββββ | 1108/1250 [26:53<03:43, 1.57s/it]
Training 1/1 epoch (loss 2.7405): 89%|βββββββββ | 1109/1250 [26:53<03:40, 1.56s/it]
Training 1/1 epoch (loss 2.8213): 89%|βββββββββ | 1109/1250 [26:55<03:40, 1.56s/it]
Training 1/1 epoch (loss 2.8213): 89%|βββββββββ | 1110/1250 [26:55<03:42, 1.59s/it]
Training 1/1 epoch (loss 2.7945): 89%|βββββββββ | 1110/1250 [26:56<03:42, 1.59s/it]
Training 1/1 epoch (loss 2.7945): 89%|βββββββββ | 1111/1250 [26:56<03:14, 1.40s/it]
Training 1/1 epoch (loss 2.6218): 89%|βββββββββ | 1111/1250 [26:58<03:14, 1.40s/it]
Training 1/1 epoch (loss 2.6218): 89%|βββββββββ | 1112/1250 [26:58<04:06, 1.79s/it]
Training 1/1 epoch (loss 2.7283): 89%|βββββββββ | 1112/1250 [26:59<04:06, 1.79s/it]
Training 1/1 epoch (loss 2.7283): 89%|βββββββββ | 1113/1250 [26:59<03:23, 1.49s/it]
Training 1/1 epoch (loss 2.6416): 89%|βββββββββ | 1113/1250 [27:01<03:23, 1.49s/it]
Training 1/1 epoch (loss 2.6416): 89%|βββββββββ | 1114/1250 [27:01<03:39, 1.62s/it]
Training 1/1 epoch (loss 2.5509): 89%|βββββββββ | 1114/1250 [27:02<03:39, 1.62s/it]
Training 1/1 epoch (loss 2.5509): 89%|βββββββββ | 1115/1250 [27:02<03:32, 1.58s/it]
Training 1/1 epoch (loss 2.6072): 89%|βββββββββ | 1115/1250 [27:03<03:32, 1.58s/it]
Training 1/1 epoch (loss 2.6072): 89%|βββββββββ | 1116/1250 [27:03<02:58, 1.33s/it]
Training 1/1 epoch (loss 2.7354): 89%|βββββββββ | 1116/1250 [27:06<02:58, 1.33s/it]
Training 1/1 epoch (loss 2.7354): 89%|βββββββββ | 1117/1250 [27:06<03:41, 1.66s/it]
Training 1/1 epoch (loss 2.4824): 89%|βββββββββ | 1117/1250 [27:08<03:41, 1.66s/it]
Training 1/1 epoch (loss 2.4824): 89%|βββββββββ | 1118/1250 [27:08<03:50, 1.75s/it]
Training 1/1 epoch (loss 2.5918): 89%|βββββββββ | 1118/1250 [27:09<03:50, 1.75s/it]
Training 1/1 epoch (loss 2.5918): 90%|βββββββββ | 1119/1250 [27:09<03:38, 1.67s/it]
Training 1/1 epoch (loss 2.4652): 90%|βββββββββ | 1119/1250 [27:11<03:38, 1.67s/it]
Training 1/1 epoch (loss 2.4652): 90%|βββββββββ | 1120/1250 [27:11<03:36, 1.67s/it]
Training 1/1 epoch (loss 2.6761): 90%|βββββββββ | 1120/1250 [27:12<03:36, 1.67s/it]
Training 1/1 epoch (loss 2.6761): 90%|βββββββββ | 1121/1250 [27:12<03:17, 1.53s/it]
Training 1/1 epoch (loss 2.6499): 90%|βββββββββ | 1121/1250 [27:14<03:17, 1.53s/it]
Training 1/1 epoch (loss 2.6499): 90%|βββββββββ | 1122/1250 [27:14<03:50, 1.80s/it]
Training 1/1 epoch (loss 2.6893): 90%|βββββββββ | 1122/1250 [27:16<03:50, 1.80s/it]
Training 1/1 epoch (loss 2.6893): 90%|βββββββββ | 1123/1250 [27:16<03:37, 1.71s/it]
Training 1/1 epoch (loss 2.6078): 90%|βββββββββ | 1123/1250 [27:17<03:37, 1.71s/it]
Training 1/1 epoch (loss 2.6078): 90%|βββββββββ | 1124/1250 [27:17<03:21, 1.60s/it]
Training 1/1 epoch (loss 2.7834): 90%|βββββββββ | 1124/1250 [27:19<03:21, 1.60s/it]
Training 1/1 epoch (loss 2.7834): 90%|βββββββββ | 1125/1250 [27:19<03:16, 1.57s/it]
Training 1/1 epoch (loss 2.7411): 90%|βββββββββ | 1125/1250 [27:20<03:16, 1.57s/it]
Training 1/1 epoch (loss 2.7411): 90%|βββββββββ | 1126/1250 [27:20<03:04, 1.49s/it]
Training 1/1 epoch (loss 2.5772): 90%|βββββββββ | 1126/1250 [27:21<03:04, 1.49s/it]
Training 1/1 epoch (loss 2.5772): 90%|βββββββββ | 1127/1250 [27:21<02:56, 1.43s/it]
Training 1/1 epoch (loss 2.7157): 90%|βββββββββ | 1127/1250 [27:23<02:56, 1.43s/it]
Training 1/1 epoch (loss 2.7157): 90%|βββββββββ | 1128/1250 [27:23<03:14, 1.60s/it]
Training 1/1 epoch (loss 2.6344): 90%|βββββββββ | 1128/1250 [27:24<03:14, 1.60s/it]
Training 1/1 epoch (loss 2.6344): 90%|βββββββββ | 1129/1250 [27:24<02:41, 1.33s/it]
Training 1/1 epoch (loss 2.6729): 90%|βββββββββ | 1129/1250 [27:26<02:41, 1.33s/it]
Training 1/1 epoch (loss 2.6729): 90%|βββββββββ | 1130/1250 [27:26<02:45, 1.38s/it]
Training 1/1 epoch (loss 2.4997): 90%|βββββββββ | 1130/1250 [27:27<02:45, 1.38s/it]
Training 1/1 epoch (loss 2.4997): 90%|βββββββββ | 1131/1250 [27:27<02:46, 1.40s/it]
Training 1/1 epoch (loss 2.6076): 90%|βββββββββ | 1131/1250 [27:28<02:46, 1.40s/it]
Training 1/1 epoch (loss 2.6076): 91%|βββββββββ | 1132/1250 [27:28<02:26, 1.24s/it]
Training 1/1 epoch (loss 2.5657): 91%|βββββββββ | 1132/1250 [27:29<02:26, 1.24s/it]
Training 1/1 epoch (loss 2.5657): 91%|βββββββββ | 1133/1250 [27:29<02:37, 1.34s/it]
Training 1/1 epoch (loss 2.6656): 91%|βββββββββ | 1133/1250 [27:30<02:37, 1.34s/it]
Training 1/1 epoch (loss 2.6656): 91%|βββββββββ | 1134/1250 [27:30<02:22, 1.23s/it]
Training 1/1 epoch (loss 2.6797): 91%|βββββββββ | 1134/1250 [27:32<02:22, 1.23s/it]
Training 1/1 epoch (loss 2.6797): 91%|βββββββββ | 1135/1250 [27:32<02:45, 1.44s/it]
Training 1/1 epoch (loss 2.5835): 91%|βββββββββ | 1135/1250 [27:34<02:45, 1.44s/it]
Training 1/1 epoch (loss 2.5835): 91%|βββββββββ | 1136/1250 [27:34<02:58, 1.56s/it]
Training 1/1 epoch (loss 2.6732): 91%|βββββββββ | 1136/1250 [27:35<02:58, 1.56s/it]
Training 1/1 epoch (loss 2.6732): 91%|βββββββββ | 1137/1250 [27:35<02:39, 1.41s/it]
Training 1/1 epoch (loss 2.6627): 91%|βββββββββ | 1137/1250 [27:38<02:39, 1.41s/it]
Training 1/1 epoch (loss 2.6627): 91%|βββββββββ | 1138/1250 [27:38<03:13, 1.73s/it]
Training 1/1 epoch (loss 2.4516): 91%|βββββββββ | 1138/1250 [27:39<03:13, 1.73s/it]
Training 1/1 epoch (loss 2.4516): 91%|βββββββββ | 1139/1250 [27:39<03:02, 1.64s/it]
Training 1/1 epoch (loss 2.8666): 91%|βββββββββ | 1139/1250 [27:40<03:02, 1.64s/it]
Training 1/1 epoch (loss 2.8666): 91%|βββββββββ | 1140/1250 [27:40<02:40, 1.45s/it]
Training 1/1 epoch (loss 2.5266): 91%|βββββββββ | 1140/1250 [27:42<02:40, 1.45s/it]
Training 1/1 epoch (loss 2.5266): 91%|ββββββββββ| 1141/1250 [27:42<02:51, 1.58s/it]
Training 1/1 epoch (loss 2.6130): 91%|ββββββββββ| 1141/1250 [27:43<02:51, 1.58s/it]
Training 1/1 epoch (loss 2.6130): 91%|ββββββββββ| 1142/1250 [27:43<02:30, 1.39s/it]
Training 1/1 epoch (loss 2.6837): 91%|ββββββββββ| 1142/1250 [27:44<02:30, 1.39s/it]
Training 1/1 epoch (loss 2.6837): 91%|ββββββββββ| 1143/1250 [27:44<02:29, 1.40s/it]
Training 1/1 epoch (loss 2.5330): 91%|ββββββββββ| 1143/1250 [27:47<02:29, 1.40s/it]
Training 1/1 epoch (loss 2.5330): 92%|ββββββββββ| 1144/1250 [27:47<03:02, 1.72s/it]
Training 1/1 epoch (loss 2.6329): 92%|ββββββββββ| 1144/1250 [27:48<03:02, 1.72s/it]
Training 1/1 epoch (loss 2.6329): 92%|ββββββββββ| 1145/1250 [27:48<02:31, 1.45s/it]
Training 1/1 epoch (loss 2.6721): 92%|ββββββββββ| 1145/1250 [27:49<02:31, 1.45s/it]
Training 1/1 epoch (loss 2.6721): 92%|ββββββββββ| 1146/1250 [27:49<02:40, 1.54s/it]
Training 1/1 epoch (loss 2.5186): 92%|ββββββββββ| 1146/1250 [27:51<02:40, 1.54s/it]
Training 1/1 epoch (loss 2.5186): 92%|ββββββββββ| 1147/1250 [27:51<02:40, 1.56s/it]
Training 1/1 epoch (loss 2.7204): 92%|ββββββββββ| 1147/1250 [27:52<02:40, 1.56s/it]
Training 1/1 epoch (loss 2.7204): 92%|ββββββββββ| 1148/1250 [27:52<02:23, 1.41s/it]
Training 1/1 epoch (loss 2.5175): 92%|ββββββββββ| 1148/1250 [27:54<02:23, 1.41s/it]
Training 1/1 epoch (loss 2.5175): 92%|ββββββββββ| 1149/1250 [27:54<02:46, 1.65s/it]
Training 1/1 epoch (loss 2.4896): 92%|ββββββββββ| 1149/1250 [27:56<02:46, 1.65s/it]
Training 1/1 epoch (loss 2.4896): 92%|ββββββββββ| 1150/1250 [27:56<02:32, 1.53s/it]
Training 1/1 epoch (loss 2.8449): 92%|ββββββββββ| 1150/1250 [27:57<02:32, 1.53s/it]
Training 1/1 epoch (loss 2.8449): 92%|ββββββββββ| 1151/1250 [27:57<02:24, 1.46s/it]
Training 1/1 epoch (loss 2.6639): 92%|ββββββββββ| 1151/1250 [28:00<02:24, 1.46s/it]
Training 1/1 epoch (loss 2.6639): 92%|ββββββββββ| 1152/1250 [28:00<03:00, 1.84s/it]
Training 1/1 epoch (loss 2.6488): 92%|ββββββββββ| 1152/1250 [28:01<03:00, 1.84s/it]
Training 1/1 epoch (loss 2.6488): 92%|ββββββββββ| 1153/1250 [28:01<02:40, 1.66s/it]
Training 1/1 epoch (loss 2.5971): 92%|ββββββββββ| 1153/1250 [28:02<02:40, 1.66s/it]
Training 1/1 epoch (loss 2.5971): 92%|ββββββββββ| 1154/1250 [28:02<02:37, 1.64s/it]
Training 1/1 epoch (loss 2.6224): 92%|ββββββββββ| 1154/1250 [28:04<02:37, 1.64s/it]
Training 1/1 epoch (loss 2.6224): 92%|ββββββββββ| 1155/1250 [28:04<02:45, 1.75s/it]
Training 1/1 epoch (loss 2.6335): 92%|ββββββββββ| 1155/1250 [28:05<02:45, 1.75s/it]
Training 1/1 epoch (loss 2.6335): 92%|ββββββββββ| 1156/1250 [28:05<02:21, 1.51s/it]
Training 1/1 epoch (loss 2.6171): 92%|ββββββββββ| 1156/1250 [28:07<02:21, 1.51s/it]
Training 1/1 epoch (loss 2.6171): 93%|ββββββββββ| 1157/1250 [28:07<02:20, 1.51s/it]
Training 1/1 epoch (loss 2.6556): 93%|ββββββββββ| 1157/1250 [28:08<02:20, 1.51s/it]
Training 1/1 epoch (loss 2.6556): 93%|ββββββββββ| 1158/1250 [28:08<02:17, 1.50s/it]
Training 1/1 epoch (loss 2.5998): 93%|ββββββββββ| 1158/1250 [28:10<02:17, 1.50s/it]
Training 1/1 epoch (loss 2.5998): 93%|ββββββββββ| 1159/1250 [28:10<02:13, 1.47s/it]
Training 1/1 epoch (loss 2.5895): 93%|ββββββββββ| 1159/1250 [28:12<02:13, 1.47s/it]
Training 1/1 epoch (loss 2.5895): 93%|ββββββββββ| 1160/1250 [28:12<02:22, 1.58s/it]
Training 1/1 epoch (loss 2.4923): 93%|ββββββββββ| 1160/1250 [28:13<02:22, 1.58s/it]
Training 1/1 epoch (loss 2.4923): 93%|ββββββββββ| 1161/1250 [28:13<02:06, 1.43s/it]
Training 1/1 epoch (loss 2.5896): 93%|ββββββββββ| 1161/1250 [28:14<02:06, 1.43s/it]
Training 1/1 epoch (loss 2.5896): 93%|ββββββββββ| 1162/1250 [28:14<02:03, 1.41s/it]
Training 1/1 epoch (loss 2.5098): 93%|ββββββββββ| 1162/1250 [28:15<02:03, 1.41s/it]
Training 1/1 epoch (loss 2.5098): 93%|ββββββββββ| 1163/1250 [28:15<01:57, 1.35s/it]
Training 1/1 epoch (loss 2.4169): 93%|ββββββββββ| 1163/1250 [28:16<01:57, 1.35s/it]
Training 1/1 epoch (loss 2.4169): 93%|ββββββββββ| 1164/1250 [28:16<01:41, 1.18s/it]
Training 1/1 epoch (loss 2.6400): 93%|ββββββββββ| 1164/1250 [28:17<01:41, 1.18s/it]
Training 1/1 epoch (loss 2.6400): 93%|ββββββββββ| 1165/1250 [28:17<01:40, 1.18s/it]
Training 1/1 epoch (loss 2.7224): 93%|ββββββββββ| 1165/1250 [28:19<01:40, 1.18s/it]
Training 1/1 epoch (loss 2.7224): 93%|ββββββββββ| 1166/1250 [28:19<01:55, 1.37s/it]
Training 1/1 epoch (loss 2.8258): 93%|ββββββββββ| 1166/1250 [28:20<01:55, 1.37s/it]
Training 1/1 epoch (loss 2.8258): 93%|ββββββββββ| 1167/1250 [28:20<01:52, 1.35s/it]
Training 1/1 epoch (loss 2.7537): 93%|ββββββββββ| 1167/1250 [28:22<01:52, 1.35s/it]
Training 1/1 epoch (loss 2.7537): 93%|ββββββββββ| 1168/1250 [28:22<01:46, 1.30s/it]
Training 1/1 epoch (loss 2.4462): 93%|ββββββββββ| 1168/1250 [28:23<01:46, 1.30s/it]
Training 1/1 epoch (loss 2.4462): 94%|ββββββββββ| 1169/1250 [28:23<01:38, 1.22s/it]
Training 1/1 epoch (loss 2.5994): 94%|ββββββββββ| 1169/1250 [28:25<01:38, 1.22s/it]
Training 1/1 epoch (loss 2.5994): 94%|ββββββββββ| 1170/1250 [28:25<02:06, 1.58s/it]
Training 1/1 epoch (loss 2.6530): 94%|ββββββββββ| 1170/1250 [28:27<02:06, 1.58s/it]
Training 1/1 epoch (loss 2.6530): 94%|ββββββββββ| 1171/1250 [28:27<02:11, 1.66s/it]
Training 1/1 epoch (loss 2.7850): 94%|ββββββββββ| 1171/1250 [28:28<02:11, 1.66s/it]
Training 1/1 epoch (loss 2.7850): 94%|ββββββββββ| 1172/1250 [28:28<01:49, 1.40s/it]
Training 1/1 epoch (loss 2.7988): 94%|ββββββββββ| 1172/1250 [28:29<01:49, 1.40s/it]
Training 1/1 epoch (loss 2.7988): 94%|ββββββββββ| 1173/1250 [28:29<01:49, 1.42s/it]
Training 1/1 epoch (loss 2.7309): 94%|ββββββββββ| 1173/1250 [28:31<01:49, 1.42s/it]
Training 1/1 epoch (loss 2.7309): 94%|ββββββββββ| 1174/1250 [28:31<01:49, 1.44s/it]
Training 1/1 epoch (loss 2.7070): 94%|ββββββββββ| 1174/1250 [28:32<01:49, 1.44s/it]
Training 1/1 epoch (loss 2.7070): 94%|ββββββββββ| 1175/1250 [28:32<01:46, 1.42s/it]
Training 1/1 epoch (loss 2.6675): 94%|ββββββββββ| 1175/1250 [28:34<01:46, 1.42s/it]
Training 1/1 epoch (loss 2.6675): 94%|ββββββββββ| 1176/1250 [28:34<01:52, 1.51s/it]
Training 1/1 epoch (loss 2.7301): 94%|ββββββββββ| 1176/1250 [28:35<01:52, 1.51s/it]
Training 1/1 epoch (loss 2.7301): 94%|ββββββββββ| 1177/1250 [28:35<01:51, 1.53s/it]
Training 1/1 epoch (loss 2.7230): 94%|ββββββββββ| 1177/1250 [28:37<01:51, 1.53s/it]
Training 1/1 epoch (loss 2.7230): 94%|ββββββββββ| 1178/1250 [28:37<01:47, 1.49s/it]
Training 1/1 epoch (loss 2.5987): 94%|ββββββββββ| 1178/1250 [28:38<01:47, 1.49s/it]
Training 1/1 epoch (loss 2.5987): 94%|ββββββββββ| 1179/1250 [28:38<01:35, 1.35s/it]
Training 1/1 epoch (loss 2.7532): 94%|ββββββββββ| 1179/1250 [28:39<01:35, 1.35s/it]
Training 1/1 epoch (loss 2.7532): 94%|ββββββββββ| 1180/1250 [28:39<01:32, 1.32s/it]
Training 1/1 epoch (loss 2.7098): 94%|ββββββββββ| 1180/1250 [28:41<01:32, 1.32s/it]
Training 1/1 epoch (loss 2.7098): 94%|ββββββββββ| 1181/1250 [28:41<01:43, 1.50s/it]
Training 1/1 epoch (loss 2.6863): 94%|ββββββββββ| 1181/1250 [28:42<01:43, 1.50s/it]
Training 1/1 epoch (loss 2.6863): 95%|ββββββββββ| 1182/1250 [28:42<01:28, 1.31s/it]
Training 1/1 epoch (loss 2.6751): 95%|ββββββββββ| 1182/1250 [28:44<01:28, 1.31s/it]
Training 1/1 epoch (loss 2.6751): 95%|ββββββββββ| 1183/1250 [28:44<01:41, 1.52s/it]
Training 1/1 epoch (loss 2.5807): 95%|ββββββββββ| 1183/1250 [28:46<01:41, 1.52s/it]
Training 1/1 epoch (loss 2.5807): 95%|ββββββββββ| 1184/1250 [28:46<01:48, 1.64s/it]
Training 1/1 epoch (loss 2.6895): 95%|ββββββββββ| 1184/1250 [28:47<01:48, 1.64s/it]
Training 1/1 epoch (loss 2.6895): 95%|ββββββββββ| 1185/1250 [28:47<01:33, 1.44s/it]
Training 1/1 epoch (loss 2.6914): 95%|ββββββββββ| 1185/1250 [28:48<01:33, 1.44s/it]
Training 1/1 epoch (loss 2.6914): 95%|ββββββββββ| 1186/1250 [28:48<01:35, 1.48s/it]
Training 1/1 epoch (loss 2.8358): 95%|ββββββββββ| 1186/1250 [28:49<01:35, 1.48s/it]
Training 1/1 epoch (loss 2.8358): 95%|ββββββββββ| 1187/1250 [28:49<01:27, 1.38s/it]
Training 1/1 epoch (loss 2.4470): 95%|ββββββββββ| 1187/1250 [28:51<01:27, 1.38s/it]
Training 1/1 epoch (loss 2.4470): 95%|ββββββββββ| 1188/1250 [28:51<01:22, 1.33s/it]
Training 1/1 epoch (loss 2.5376): 95%|ββββββββββ| 1188/1250 [28:52<01:22, 1.33s/it]
Training 1/1 epoch (loss 2.5376): 95%|ββββββββββ| 1189/1250 [28:52<01:21, 1.33s/it]
Training 1/1 epoch (loss 2.8963): 95%|ββββββββββ| 1189/1250 [28:53<01:21, 1.33s/it]
Training 1/1 epoch (loss 2.8963): 95%|ββββββββββ| 1190/1250 [28:53<01:17, 1.30s/it]
Training 1/1 epoch (loss 2.8114): 95%|ββββββββββ| 1190/1250 [28:54<01:17, 1.30s/it]
Training 1/1 epoch (loss 2.8114): 95%|ββββββββββ| 1191/1250 [28:54<01:17, 1.31s/it]
Training 1/1 epoch (loss 2.6716): 95%|ββββββββββ| 1191/1250 [28:56<01:17, 1.31s/it]
Training 1/1 epoch (loss 2.6716): 95%|ββββββββββ| 1192/1250 [28:56<01:19, 1.38s/it]
Training 1/1 epoch (loss 2.6818): 95%|ββββββββββ| 1192/1250 [28:57<01:19, 1.38s/it]
Training 1/1 epoch (loss 2.6818): 95%|ββββββββββ| 1193/1250 [28:57<01:09, 1.22s/it]
Training 1/1 epoch (loss 2.6838): 95%|ββββββββββ| 1193/1250 [28:58<01:09, 1.22s/it]
Training 1/1 epoch (loss 2.6838): 96%|ββββββββββ| 1194/1250 [28:58<01:07, 1.20s/it]
Training 1/1 epoch (loss 2.7715): 96%|ββββββββββ| 1194/1250 [28:59<01:07, 1.20s/it]
Training 1/1 epoch (loss 2.7715): 96%|ββββββββββ| 1195/1250 [28:59<01:10, 1.29s/it]
Training 1/1 epoch (loss 2.9073): 96%|ββββββββββ| 1195/1250 [29:00<01:10, 1.29s/it]
Training 1/1 epoch (loss 2.9073): 96%|ββββββββββ| 1196/1250 [29:00<01:03, 1.18s/it]
Training 1/1 epoch (loss 2.7287): 96%|ββββββββββ| 1196/1250 [29:02<01:03, 1.18s/it]
Training 1/1 epoch (loss 2.7287): 96%|ββββββββββ| 1197/1250 [29:02<01:04, 1.22s/it]
Training 1/1 epoch (loss 2.5943): 96%|ββββββββββ| 1197/1250 [29:03<01:04, 1.22s/it]
Training 1/1 epoch (loss 2.5943): 96%|ββββββββββ| 1198/1250 [29:03<01:08, 1.32s/it]
Training 1/1 epoch (loss 2.6009): 96%|ββββββββββ| 1198/1250 [29:04<01:08, 1.32s/it]
Training 1/1 epoch (loss 2.6009): 96%|ββββββββββ| 1199/1250 [29:04<01:02, 1.22s/it]
Training 1/1 epoch (loss 2.5766): 96%|ββββββββββ| 1199/1250 [29:06<01:02, 1.22s/it]
Training 1/1 epoch (loss 2.5766): 96%|ββββββββββ| 1200/1250 [29:06<01:15, 1.51s/it]
Training 1/1 epoch (loss 2.5523): 96%|ββββββββββ| 1200/1250 [29:08<01:15, 1.51s/it]
Training 1/1 epoch (loss 2.5523): 96%|ββββββββββ| 1201/1250 [29:08<01:07, 1.38s/it]
Training 1/1 epoch (loss 2.4942): 96%|ββββββββββ| 1201/1250 [29:09<01:07, 1.38s/it]
Training 1/1 epoch (loss 2.4942): 96%|ββββββββββ| 1202/1250 [29:09<01:01, 1.28s/it]
Training 1/1 epoch (loss 2.6885): 96%|ββββββββββ| 1202/1250 [29:11<01:01, 1.28s/it]
Training 1/1 epoch (loss 2.6885): 96%|ββββββββββ| 1203/1250 [29:11<01:12, 1.54s/it]
Training 1/1 epoch (loss 2.5476): 96%|ββββββββββ| 1203/1250 [29:12<01:12, 1.54s/it]
Training 1/1 epoch (loss 2.5476): 96%|ββββββββββ| 1204/1250 [29:12<01:07, 1.47s/it]
Training 1/1 epoch (loss 2.9282): 96%|ββββββββββ| 1204/1250 [29:14<01:07, 1.47s/it]
Training 1/1 epoch (loss 2.9282): 96%|ββββββββββ| 1205/1250 [29:14<01:08, 1.53s/it]
Training 1/1 epoch (loss 2.5893): 96%|ββββββββββ| 1205/1250 [29:15<01:08, 1.53s/it]
Training 1/1 epoch (loss 2.5893): 96%|ββββββββββ| 1206/1250 [29:15<00:58, 1.34s/it]
Training 1/1 epoch (loss 2.6265): 96%|ββββββββββ| 1206/1250 [29:16<00:58, 1.34s/it]
Training 1/1 epoch (loss 2.6265): 97%|ββββββββββ| 1207/1250 [29:16<01:02, 1.45s/it]
Training 1/1 epoch (loss 2.6339): 97%|ββββββββββ| 1207/1250 [29:18<01:02, 1.45s/it]
Training 1/1 epoch (loss 2.6339): 97%|ββββββββββ| 1208/1250 [29:18<01:06, 1.59s/it]
Training 1/1 epoch (loss 2.6649): 97%|ββββββββββ| 1208/1250 [29:19<01:06, 1.59s/it]
Training 1/1 epoch (loss 2.6649): 97%|ββββββββββ| 1209/1250 [29:19<00:58, 1.42s/it]
Training 1/1 epoch (loss 2.7912): 97%|ββββββββββ| 1209/1250 [29:22<00:58, 1.42s/it]
Training 1/1 epoch (loss 2.7912): 97%|ββββββββββ| 1210/1250 [29:22<01:08, 1.72s/it]
Training 1/1 epoch (loss 2.6918): 97%|ββββββββββ| 1210/1250 [29:23<01:08, 1.72s/it]
Training 1/1 epoch (loss 2.6918): 97%|ββββββββββ| 1211/1250 [29:23<01:03, 1.62s/it]
Training 1/1 epoch (loss 2.6323): 97%|ββββββββββ| 1211/1250 [29:25<01:03, 1.62s/it]
Training 1/1 epoch (loss 2.6323): 97%|ββββββββββ| 1212/1250 [29:25<01:01, 1.62s/it]
Training 1/1 epoch (loss 2.7369): 97%|ββββββββββ| 1212/1250 [29:27<01:01, 1.62s/it]
Training 1/1 epoch (loss 2.7369): 97%|ββββββββββ| 1213/1250 [29:27<01:09, 1.89s/it]
Training 1/1 epoch (loss 2.5094): 97%|ββββββββββ| 1213/1250 [29:28<01:09, 1.89s/it]
Training 1/1 epoch (loss 2.5094): 97%|ββββββββββ| 1214/1250 [29:28<00:57, 1.61s/it]
Training 1/1 epoch (loss 2.6175): 97%|ββββββββββ| 1214/1250 [29:30<00:57, 1.61s/it]
Training 1/1 epoch (loss 2.6175): 97%|ββββββββββ| 1215/1250 [29:30<00:56, 1.62s/it]
Training 1/1 epoch (loss 2.9312): 97%|ββββββββββ| 1215/1250 [29:32<00:56, 1.62s/it]
Training 1/1 epoch (loss 2.9312): 97%|ββββββββββ| 1216/1250 [29:32<01:04, 1.89s/it]
Training 1/1 epoch (loss 2.5154): 97%|ββββββββββ| 1216/1250 [29:34<01:04, 1.89s/it]
Training 1/1 epoch (loss 2.5154): 97%|ββββββββββ| 1217/1250 [29:34<00:56, 1.70s/it]
Training 1/1 epoch (loss 2.5841): 97%|ββββββββββ| 1217/1250 [29:36<00:56, 1.70s/it]
Training 1/1 epoch (loss 2.5841): 97%|ββββββββββ| 1218/1250 [29:36<00:58, 1.82s/it]
Training 1/1 epoch (loss 2.7106): 97%|ββββββββββ| 1218/1250 [29:37<00:58, 1.82s/it]
Training 1/1 epoch (loss 2.7106): 98%|ββββββββββ| 1219/1250 [29:37<00:54, 1.75s/it]
Training 1/1 epoch (loss 2.5974): 98%|ββββββββββ| 1219/1250 [29:38<00:54, 1.75s/it]
Training 1/1 epoch (loss 2.5974): 98%|ββββββββββ| 1220/1250 [29:38<00:46, 1.54s/it]
Training 1/1 epoch (loss 2.5307): 98%|ββββββββββ| 1220/1250 [29:40<00:46, 1.54s/it]
Training 1/1 epoch (loss 2.5307): 98%|ββββββββββ| 1221/1250 [29:40<00:47, 1.65s/it]
Training 1/1 epoch (loss 2.4781): 98%|ββββββββββ| 1221/1250 [29:41<00:47, 1.65s/it]
Training 1/1 epoch (loss 2.4781): 98%|ββββββββββ| 1222/1250 [29:41<00:37, 1.35s/it]
Training 1/1 epoch (loss 2.5372): 98%|ββββββββββ| 1222/1250 [29:42<00:37, 1.35s/it]
Training 1/1 epoch (loss 2.5372): 98%|ββββββββββ| 1223/1250 [29:42<00:38, 1.42s/it]
Training 1/1 epoch (loss 2.7001): 98%|ββββββββββ| 1223/1250 [29:45<00:38, 1.42s/it]
Training 1/1 epoch (loss 2.7001): 98%|ββββββββββ| 1224/1250 [29:45<00:43, 1.67s/it]
Training 1/1 epoch (loss 2.6748): 98%|ββββββββββ| 1224/1250 [29:46<00:43, 1.67s/it]
Training 1/1 epoch (loss 2.6748): 98%|ββββββββββ| 1225/1250 [29:46<00:36, 1.46s/it]
Training 1/1 epoch (loss 2.7613): 98%|ββββββββββ| 1225/1250 [29:48<00:36, 1.46s/it]
Training 1/1 epoch (loss 2.7613): 98%|ββββββββββ| 1226/1250 [29:48<00:40, 1.71s/it]
Training 1/1 epoch (loss 2.5920): 98%|ββββββββββ| 1226/1250 [29:49<00:40, 1.71s/it]
Training 1/1 epoch (loss 2.5920): 98%|ββββββββββ| 1227/1250 [29:49<00:35, 1.55s/it]
Training 1/1 epoch (loss 2.7750): 98%|ββββββββββ| 1227/1250 [29:50<00:35, 1.55s/it]
Training 1/1 epoch (loss 2.7750): 98%|ββββββββββ| 1228/1250 [29:50<00:29, 1.35s/it]
Training 1/1 epoch (loss 2.4490): 98%|ββββββββββ| 1228/1250 [29:52<00:29, 1.35s/it]
Training 1/1 epoch (loss 2.4490): 98%|ββββββββββ| 1229/1250 [29:52<00:32, 1.53s/it]
Training 1/1 epoch (loss 2.7231): 98%|ββββββββββ| 1229/1250 [29:53<00:32, 1.53s/it]
Training 1/1 epoch (loss 2.7231): 98%|ββββββββββ| 1230/1250 [29:53<00:29, 1.45s/it]
Training 1/1 epoch (loss 2.5655): 98%|ββββββββββ| 1230/1250 [29:55<00:29, 1.45s/it]
Training 1/1 epoch (loss 2.5655): 98%|ββββββββββ| 1231/1250 [29:55<00:32, 1.69s/it]
Training 1/1 epoch (loss 2.7202): 98%|ββββββββββ| 1231/1250 [29:58<00:32, 1.69s/it]
Training 1/1 epoch (loss 2.7202): 99%|ββββββββββ| 1232/1250 [29:58<00:33, 1.85s/it]
Training 1/1 epoch (loss 2.7348): 99%|ββββββββββ| 1232/1250 [29:59<00:33, 1.85s/it]
Training 1/1 epoch (loss 2.7348): 99%|ββββββββββ| 1233/1250 [29:59<00:28, 1.68s/it]
Training 1/1 epoch (loss 2.7303): 99%|ββββββββββ| 1233/1250 [30:01<00:28, 1.68s/it]
Training 1/1 epoch (loss 2.7303): 99%|ββββββββββ| 1234/1250 [30:01<00:26, 1.65s/it]
Training 1/1 epoch (loss 2.6274): 99%|ββββββββββ| 1234/1250 [30:02<00:26, 1.65s/it]
Training 1/1 epoch (loss 2.6274): 99%|ββββββββββ| 1235/1250 [30:02<00:23, 1.59s/it]
Training 1/1 epoch (loss 2.5260): 99%|ββββββββββ| 1235/1250 [30:03<00:23, 1.59s/it]
Training 1/1 epoch (loss 2.5260): 99%|ββββββββββ| 1236/1250 [30:03<00:20, 1.47s/it]
Training 1/1 epoch (loss 2.5806): 99%|ββββββββββ| 1236/1250 [30:05<00:20, 1.47s/it]
Training 1/1 epoch (loss 2.5806): 99%|ββββββββββ| 1237/1250 [30:05<00:19, 1.49s/it]
Training 1/1 epoch (loss 2.6502): 99%|ββββββββββ| 1237/1250 [30:06<00:19, 1.49s/it]
Training 1/1 epoch (loss 2.6502): 99%|ββββββββββ| 1238/1250 [30:06<00:16, 1.34s/it]
Training 1/1 epoch (loss 2.7734): 99%|ββββββββββ| 1238/1250 [30:08<00:16, 1.34s/it]
Training 1/1 epoch (loss 2.7734): 99%|ββββββββββ| 1239/1250 [30:08<00:17, 1.57s/it]
Training 1/1 epoch (loss 2.6343): 99%|ββββββββββ| 1239/1250 [30:10<00:17, 1.57s/it]
Training 1/1 epoch (loss 2.6343): 99%|ββββββββββ| 1240/1250 [30:10<00:16, 1.61s/it]
Training 1/1 epoch (loss 2.7026): 99%|ββββββββββ| 1240/1250 [30:10<00:16, 1.61s/it]
Training 1/1 epoch (loss 2.7026): 99%|ββββββββββ| 1241/1250 [30:10<00:12, 1.36s/it]
Training 1/1 epoch (loss 2.6986): 99%|ββββββββββ| 1241/1250 [30:12<00:12, 1.36s/it]
Training 1/1 epoch (loss 2.6986): 99%|ββββββββββ| 1242/1250 [30:12<00:12, 1.52s/it]
Training 1/1 epoch (loss 2.8446): 99%|ββββββββββ| 1242/1250 [30:14<00:12, 1.52s/it]
Training 1/1 epoch (loss 2.8446): 99%|ββββββββββ| 1243/1250 [30:14<00:10, 1.48s/it]
Training 1/1 epoch (loss 2.7785): 99%|ββββββββββ| 1243/1250 [30:14<00:10, 1.48s/it]
Training 1/1 epoch (loss 2.7785): 100%|ββββββββββ| 1244/1250 [30:14<00:07, 1.21s/it]
Training 1/1 epoch (loss 2.8996): 100%|ββββββββββ| 1244/1250 [30:16<00:07, 1.21s/it]
Training 1/1 epoch (loss 2.8996): 100%|ββββββββββ| 1245/1250 [30:16<00:06, 1.26s/it]
Training 1/1 epoch (loss 2.5351): 100%|ββββββββββ| 1245/1250 [30:17<00:06, 1.26s/it]
Training 1/1 epoch (loss 2.5351): 100%|ββββββββββ| 1246/1250 [30:17<00:05, 1.37s/it]
Training 1/1 epoch (loss 2.8519): 100%|ββββββββββ| 1246/1250 [30:18<00:05, 1.37s/it]
Training 1/1 epoch (loss 2.8519): 100%|ββββββββββ| 1247/1250 [30:18<00:04, 1.35s/it]
Training 1/1 epoch (loss 2.7077): 100%|ββββββββββ| 1247/1250 [30:20<00:04, 1.35s/it]
Training 1/1 epoch (loss 2.7077): 100%|ββββββββββ| 1248/1250 [30:20<00:02, 1.47s/it]
Training 1/1 epoch (loss 2.8051): 100%|ββββββββββ| 1248/1250 [30:21<00:02, 1.47s/it]
Training 1/1 epoch (loss 2.8051): 100%|ββββββββββ| 1249/1250 [30:21<00:01, 1.28s/it]
Training 1/1 epoch (loss 2.4727): 100%|ββββββββββ| 1249/1250 [30:23<00:01, 1.28s/it]
Training 1/1 epoch (loss 2.4727): 100%|ββββββββββ| 1250/1250 [30:23<00:00, 1.63s/it]
Training 1/1 epoch (loss 2.4727): 100%|ββββββββββ| 1250/1250 [30:23<00:00, 1.46s/it] |