|
Training 1/1 epoch (loss 2.6895): 0%| | 0/1250 [00:08<?, ?it/s]
Training 1/1 epoch (loss 2.6895): 0%| | 1/1250 [00:08<2:56:03, 8.46s/it]
Training 1/1 epoch (loss 2.7344): 0%| | 1/1250 [00:10<2:56:03, 8.46s/it]
Training 1/1 epoch (loss 2.7344): 0%| | 2/1250 [00:10<1:39:30, 4.78s/it]
Training 1/1 epoch (loss 2.6357): 0%| | 2/1250 [00:12<1:39:30, 4.78s/it]
Training 1/1 epoch (loss 2.6357): 0%| | 3/1250 [00:12<1:11:50, 3.46s/it]
Training 1/1 epoch (loss 2.9972): 0%| | 3/1250 [00:13<1:11:50, 3.46s/it]
Training 1/1 epoch (loss 2.9972): 0%| | 4/1250 [00:13<54:33, 2.63s/it]
Training 1/1 epoch (loss 2.7498): 0%| | 4/1250 [00:14<54:33, 2.63s/it]
Training 1/1 epoch (loss 2.7498): 0%| | 5/1250 [00:14<42:12, 2.03s/it]
Training 1/1 epoch (loss 2.9447): 0%| | 5/1250 [00:16<42:12, 2.03s/it]
Training 1/1 epoch (loss 2.9447): 0%| | 6/1250 [00:16<41:11, 1.99s/it]
Training 1/1 epoch (loss 2.7219): 0%| | 6/1250 [00:18<41:11, 1.99s/it]
Training 1/1 epoch (loss 2.7219): 1%| | 7/1250 [00:18<37:49, 1.83s/it]
Training 1/1 epoch (loss 2.6385): 1%| | 7/1250 [00:18<37:49, 1.83s/it]
Training 1/1 epoch (loss 2.6385): 1%| | 8/1250 [00:18<30:25, 1.47s/it]
Training 1/1 epoch (loss 2.7001): 1%| | 8/1250 [00:21<30:25, 1.47s/it]
Training 1/1 epoch (loss 2.7001): 1%| | 9/1250 [00:21<36:40, 1.77s/it]
Training 1/1 epoch (loss 2.4852): 1%| | 9/1250 [00:22<36:40, 1.77s/it]
Training 1/1 epoch (loss 2.4852): 1%| | 10/1250 [00:22<31:07, 1.51s/it]
Training 1/1 epoch (loss 2.7661): 1%| | 10/1250 [00:23<31:07, 1.51s/it]
Training 1/1 epoch (loss 2.7661): 1%| | 11/1250 [00:23<28:09, 1.36s/it]
Training 1/1 epoch (loss 2.7108): 1%| | 11/1250 [00:24<28:09, 1.36s/it]
Training 1/1 epoch (loss 2.7108): 1%| | 12/1250 [00:24<28:51, 1.40s/it]
Training 1/1 epoch (loss 2.6189): 1%| | 12/1250 [00:25<28:51, 1.40s/it]
Training 1/1 epoch (loss 2.6189): 1%| | 13/1250 [00:25<26:25, 1.28s/it]
Training 1/1 epoch (loss 2.4735): 1%| | 13/1250 [00:27<26:25, 1.28s/it]
Training 1/1 epoch (loss 2.4735): 1%| | 14/1250 [00:27<27:29, 1.33s/it]
Training 1/1 epoch (loss 2.7584): 1%| | 14/1250 [00:29<27:29, 1.33s/it]
Training 1/1 epoch (loss 2.7584): 1%| | 15/1250 [00:29<30:27, 1.48s/it]
Training 1/1 epoch (loss 2.6672): 1%| | 15/1250 [00:29<30:27, 1.48s/it]
Training 1/1 epoch (loss 2.6672): 1%|β | 16/1250 [00:29<25:37, 1.25s/it]
Training 1/1 epoch (loss 2.7078): 1%|β | 16/1250 [00:30<25:37, 1.25s/it]
Training 1/1 epoch (loss 2.7078): 1%|β | 17/1250 [00:30<23:16, 1.13s/it]
Training 1/1 epoch (loss 2.7807): 1%|β | 17/1250 [00:32<23:16, 1.13s/it]
Training 1/1 epoch (loss 2.7807): 1%|β | 18/1250 [00:32<26:22, 1.28s/it]
Training 1/1 epoch (loss 2.7497): 1%|β | 18/1250 [00:32<26:22, 1.28s/it]
Training 1/1 epoch (loss 2.7497): 2%|β | 19/1250 [00:32<22:18, 1.09s/it]
Training 1/1 epoch (loss 2.7083): 2%|β | 19/1250 [00:34<22:18, 1.09s/it]
Training 1/1 epoch (loss 2.7083): 2%|β | 20/1250 [00:34<27:02, 1.32s/it]
Training 1/1 epoch (loss 2.6306): 2%|β | 20/1250 [00:35<27:02, 1.32s/it]
Training 1/1 epoch (loss 2.6306): 2%|β | 21/1250 [00:35<25:33, 1.25s/it]
Training 1/1 epoch (loss 2.8995): 2%|β | 21/1250 [00:36<25:33, 1.25s/it]
Training 1/1 epoch (loss 2.8995): 2%|β | 22/1250 [00:36<22:33, 1.10s/it]
Training 1/1 epoch (loss 2.7049): 2%|β | 22/1250 [00:37<22:33, 1.10s/it]
Training 1/1 epoch (loss 2.7049): 2%|β | 23/1250 [00:37<22:54, 1.12s/it]
Training 1/1 epoch (loss 2.4003): 2%|β | 23/1250 [00:39<22:54, 1.12s/it]
Training 1/1 epoch (loss 2.4003): 2%|β | 24/1250 [00:39<27:25, 1.34s/it]
Training 1/1 epoch (loss 2.5756): 2%|β | 24/1250 [00:40<27:25, 1.34s/it]
Training 1/1 epoch (loss 2.5756): 2%|β | 25/1250 [00:40<23:53, 1.17s/it]
Training 1/1 epoch (loss 2.5142): 2%|β | 25/1250 [00:41<23:53, 1.17s/it]
Training 1/1 epoch (loss 2.5142): 2%|β | 26/1250 [00:41<25:27, 1.25s/it]
Training 1/1 epoch (loss 2.7769): 2%|β | 26/1250 [00:43<25:27, 1.25s/it]
Training 1/1 epoch (loss 2.7769): 2%|β | 27/1250 [00:43<28:32, 1.40s/it]
Training 1/1 epoch (loss 2.6898): 2%|β | 27/1250 [00:44<28:32, 1.40s/it]
Training 1/1 epoch (loss 2.6898): 2%|β | 28/1250 [00:44<27:37, 1.36s/it]
Training 1/1 epoch (loss 2.7262): 2%|β | 28/1250 [00:46<27:37, 1.36s/it]
Training 1/1 epoch (loss 2.7262): 2%|β | 29/1250 [00:46<30:02, 1.48s/it]
Training 1/1 epoch (loss 2.7273): 2%|β | 29/1250 [00:47<30:02, 1.48s/it]
Training 1/1 epoch (loss 2.7273): 2%|β | 30/1250 [00:47<26:47, 1.32s/it]
Training 1/1 epoch (loss 2.8782): 2%|β | 30/1250 [00:48<26:47, 1.32s/it]
Training 1/1 epoch (loss 2.8782): 2%|β | 31/1250 [00:48<25:31, 1.26s/it]
Training 1/1 epoch (loss 2.7278): 2%|β | 31/1250 [00:50<25:31, 1.26s/it]
Training 1/1 epoch (loss 2.7278): 3%|β | 32/1250 [00:50<30:05, 1.48s/it]
Training 1/1 epoch (loss 2.7458): 3%|β | 32/1250 [00:52<30:05, 1.48s/it]
Training 1/1 epoch (loss 2.7458): 3%|β | 33/1250 [00:52<28:45, 1.42s/it]
Training 1/1 epoch (loss 2.5771): 3%|β | 33/1250 [00:54<28:45, 1.42s/it]
Training 1/1 epoch (loss 2.5771): 3%|β | 34/1250 [00:54<33:50, 1.67s/it]
Training 1/1 epoch (loss 2.6981): 3%|β | 34/1250 [00:56<33:50, 1.67s/it]
Training 1/1 epoch (loss 2.6981): 3%|β | 35/1250 [00:56<38:57, 1.92s/it]
Training 1/1 epoch (loss 2.6769): 3%|β | 35/1250 [00:57<38:57, 1.92s/it]
Training 1/1 epoch (loss 2.6769): 3%|β | 36/1250 [00:57<29:58, 1.48s/it]
Training 1/1 epoch (loss 2.5012): 3%|β | 36/1250 [00:59<29:58, 1.48s/it]
Training 1/1 epoch (loss 2.5012): 3%|β | 37/1250 [00:59<32:40, 1.62s/it]
Training 1/1 epoch (loss 2.4744): 3%|β | 37/1250 [01:00<32:40, 1.62s/it]
Training 1/1 epoch (loss 2.4744): 3%|β | 38/1250 [01:00<33:00, 1.63s/it]
Training 1/1 epoch (loss 2.8530): 3%|β | 38/1250 [01:01<33:00, 1.63s/it]
Training 1/1 epoch (loss 2.8530): 3%|β | 39/1250 [01:01<26:29, 1.31s/it]
Training 1/1 epoch (loss 2.5973): 3%|β | 39/1250 [01:04<26:29, 1.31s/it]
Training 1/1 epoch (loss 2.5973): 3%|β | 40/1250 [01:04<34:33, 1.71s/it]
Training 1/1 epoch (loss 2.6672): 3%|β | 40/1250 [01:06<34:33, 1.71s/it]
Training 1/1 epoch (loss 2.6672): 3%|β | 41/1250 [01:06<39:08, 1.94s/it]
Training 1/1 epoch (loss 2.5157): 3%|β | 41/1250 [01:06<39:08, 1.94s/it]
Training 1/1 epoch (loss 2.5157): 3%|β | 42/1250 [01:06<29:54, 1.49s/it]
Training 1/1 epoch (loss 2.6115): 3%|β | 42/1250 [01:08<29:54, 1.49s/it]
Training 1/1 epoch (loss 2.6115): 3%|β | 43/1250 [01:08<30:22, 1.51s/it]
Training 1/1 epoch (loss 2.6442): 3%|β | 43/1250 [01:10<30:22, 1.51s/it]
Training 1/1 epoch (loss 2.6442): 4%|β | 44/1250 [01:10<35:52, 1.78s/it]
Training 1/1 epoch (loss 2.6178): 4%|β | 44/1250 [01:11<35:52, 1.78s/it]
Training 1/1 epoch (loss 2.6178): 4%|β | 45/1250 [01:11<27:52, 1.39s/it]
Training 1/1 epoch (loss 2.4632): 4%|β | 45/1250 [01:13<27:52, 1.39s/it]
Training 1/1 epoch (loss 2.4632): 4%|β | 46/1250 [01:13<32:05, 1.60s/it]
Training 1/1 epoch (loss 2.6651): 4%|β | 46/1250 [01:15<32:05, 1.60s/it]
Training 1/1 epoch (loss 2.6651): 4%|β | 47/1250 [01:15<35:32, 1.77s/it]
Training 1/1 epoch (loss 2.7233): 4%|β | 47/1250 [01:16<35:32, 1.77s/it]
Training 1/1 epoch (loss 2.7233): 4%|β | 48/1250 [01:16<28:05, 1.40s/it]
Training 1/1 epoch (loss 2.5598): 4%|β | 48/1250 [01:18<28:05, 1.40s/it]
Training 1/1 epoch (loss 2.5598): 4%|β | 49/1250 [01:18<30:43, 1.54s/it]
Training 1/1 epoch (loss 2.6908): 4%|β | 49/1250 [01:19<30:43, 1.54s/it]
Training 1/1 epoch (loss 2.6908): 4%|β | 50/1250 [01:19<31:22, 1.57s/it]
Training 1/1 epoch (loss 2.6063): 4%|β | 50/1250 [01:20<31:22, 1.57s/it]
Training 1/1 epoch (loss 2.6063): 4%|β | 51/1250 [01:20<24:43, 1.24s/it]
Training 1/1 epoch (loss 2.6327): 4%|β | 51/1250 [01:22<24:43, 1.24s/it]
Training 1/1 epoch (loss 2.6327): 4%|β | 52/1250 [01:22<32:22, 1.62s/it]
Training 1/1 epoch (loss 2.6822): 4%|β | 52/1250 [01:23<32:22, 1.62s/it]
Training 1/1 epoch (loss 2.6822): 4%|β | 53/1250 [01:23<29:05, 1.46s/it]
Training 1/1 epoch (loss 2.4893): 4%|β | 53/1250 [01:24<29:05, 1.46s/it]
Training 1/1 epoch (loss 2.4893): 4%|β | 54/1250 [01:24<23:39, 1.19s/it]
Training 1/1 epoch (loss 2.5229): 4%|β | 54/1250 [01:26<23:39, 1.19s/it]
Training 1/1 epoch (loss 2.5229): 4%|β | 55/1250 [01:26<28:09, 1.41s/it]
Training 1/1 epoch (loss 2.5454): 4%|β | 55/1250 [01:28<28:09, 1.41s/it]
Training 1/1 epoch (loss 2.5454): 4%|β | 56/1250 [01:28<32:29, 1.63s/it]
Training 1/1 epoch (loss 2.5369): 4%|β | 56/1250 [01:29<32:29, 1.63s/it]
Training 1/1 epoch (loss 2.5369): 5%|β | 57/1250 [01:29<28:53, 1.45s/it]
Training 1/1 epoch (loss 2.3972): 5%|β | 57/1250 [01:31<28:53, 1.45s/it]
Training 1/1 epoch (loss 2.3972): 5%|β | 58/1250 [01:31<34:33, 1.74s/it]
Training 1/1 epoch (loss 2.4651): 5%|β | 58/1250 [01:33<34:33, 1.74s/it]
Training 1/1 epoch (loss 2.4651): 5%|β | 59/1250 [01:33<31:38, 1.59s/it]
Training 1/1 epoch (loss 2.6384): 5%|β | 59/1250 [01:34<31:38, 1.59s/it]
Training 1/1 epoch (loss 2.6384): 5%|β | 60/1250 [01:34<28:26, 1.43s/it]
Training 1/1 epoch (loss 2.4940): 5%|β | 60/1250 [01:35<28:26, 1.43s/it]
Training 1/1 epoch (loss 2.4940): 5%|β | 61/1250 [01:35<30:48, 1.55s/it]
Training 1/1 epoch (loss 2.6199): 5%|β | 61/1250 [01:37<30:48, 1.55s/it]
Training 1/1 epoch (loss 2.6199): 5%|β | 62/1250 [01:37<29:39, 1.50s/it]
Training 1/1 epoch (loss 2.7417): 5%|β | 62/1250 [01:39<29:39, 1.50s/it]
Training 1/1 epoch (loss 2.7417): 5%|β | 63/1250 [01:39<35:30, 1.79s/it]
Training 1/1 epoch (loss 2.8153): 5%|β | 63/1250 [01:41<35:30, 1.79s/it]
Training 1/1 epoch (loss 2.8153): 5%|β | 64/1250 [01:41<35:50, 1.81s/it]
Training 1/1 epoch (loss 2.6814): 5%|β | 64/1250 [01:42<35:50, 1.81s/it]
Training 1/1 epoch (loss 2.6814): 5%|β | 65/1250 [01:42<27:49, 1.41s/it]
Training 1/1 epoch (loss 2.5319): 5%|β | 65/1250 [01:43<27:49, 1.41s/it]
Training 1/1 epoch (loss 2.5319): 5%|β | 66/1250 [01:43<28:27, 1.44s/it]
Training 1/1 epoch (loss 2.5994): 5%|β | 66/1250 [01:45<28:27, 1.44s/it]
Training 1/1 epoch (loss 2.5994): 5%|β | 67/1250 [01:45<28:59, 1.47s/it]
Training 1/1 epoch (loss 2.6963): 5%|β | 67/1250 [01:45<28:59, 1.47s/it]
Training 1/1 epoch (loss 2.6963): 5%|β | 68/1250 [01:45<23:37, 1.20s/it]
Training 1/1 epoch (loss 2.5470): 5%|β | 68/1250 [01:47<23:37, 1.20s/it]
Training 1/1 epoch (loss 2.5470): 6%|β | 69/1250 [01:47<24:11, 1.23s/it]
Training 1/1 epoch (loss 2.6826): 6%|β | 69/1250 [01:49<24:11, 1.23s/it]
Training 1/1 epoch (loss 2.6826): 6%|β | 70/1250 [01:49<28:58, 1.47s/it]
Training 1/1 epoch (loss 2.5648): 6%|β | 70/1250 [01:49<28:58, 1.47s/it]
Training 1/1 epoch (loss 2.5648): 6%|β | 71/1250 [01:49<24:45, 1.26s/it]
Training 1/1 epoch (loss 2.5215): 6%|β | 71/1250 [01:52<24:45, 1.26s/it]
Training 1/1 epoch (loss 2.5215): 6%|β | 72/1250 [01:52<32:32, 1.66s/it]
Training 1/1 epoch (loss 2.7345): 6%|β | 72/1250 [01:54<32:32, 1.66s/it]
Training 1/1 epoch (loss 2.7345): 6%|β | 73/1250 [01:54<31:49, 1.62s/it]
Training 1/1 epoch (loss 2.6594): 6%|β | 73/1250 [01:55<31:49, 1.62s/it]
Training 1/1 epoch (loss 2.6594): 6%|β | 74/1250 [01:55<31:55, 1.63s/it]
Training 1/1 epoch (loss 2.5403): 6%|β | 74/1250 [01:57<31:55, 1.63s/it]
Training 1/1 epoch (loss 2.5403): 6%|β | 75/1250 [01:57<31:40, 1.62s/it]
Training 1/1 epoch (loss 2.6318): 6%|β | 75/1250 [01:58<31:40, 1.62s/it]
Training 1/1 epoch (loss 2.6318): 6%|β | 76/1250 [01:58<28:49, 1.47s/it]
Training 1/1 epoch (loss 2.5235): 6%|β | 76/1250 [01:59<28:49, 1.47s/it]
Training 1/1 epoch (loss 2.5235): 6%|β | 77/1250 [01:59<26:41, 1.37s/it]
Training 1/1 epoch (loss 2.7579): 6%|β | 77/1250 [02:01<26:41, 1.37s/it]
Training 1/1 epoch (loss 2.7579): 6%|β | 78/1250 [02:01<29:48, 1.53s/it]
Training 1/1 epoch (loss 2.5993): 6%|β | 78/1250 [02:02<29:48, 1.53s/it]
Training 1/1 epoch (loss 2.5993): 6%|β | 79/1250 [02:02<27:51, 1.43s/it]
Training 1/1 epoch (loss 2.5968): 6%|β | 79/1250 [02:04<27:51, 1.43s/it]
Training 1/1 epoch (loss 2.5968): 6%|β | 80/1250 [02:04<28:26, 1.46s/it]
Training 1/1 epoch (loss 2.6585): 6%|β | 80/1250 [02:05<28:26, 1.46s/it]
Training 1/1 epoch (loss 2.6585): 6%|β | 81/1250 [02:05<27:35, 1.42s/it]
Training 1/1 epoch (loss 2.7149): 6%|β | 81/1250 [02:07<27:35, 1.42s/it]
Training 1/1 epoch (loss 2.7149): 7%|β | 82/1250 [02:07<31:25, 1.61s/it]
Training 1/1 epoch (loss 2.7487): 7%|β | 82/1250 [02:08<31:25, 1.61s/it]
Training 1/1 epoch (loss 2.7487): 7%|β | 83/1250 [02:08<27:18, 1.40s/it]
Training 1/1 epoch (loss 2.5938): 7%|β | 83/1250 [02:09<27:18, 1.40s/it]
Training 1/1 epoch (loss 2.5938): 7%|β | 84/1250 [02:09<25:39, 1.32s/it]
Training 1/1 epoch (loss 2.4955): 7%|β | 84/1250 [02:11<25:39, 1.32s/it]
Training 1/1 epoch (loss 2.4955): 7%|β | 85/1250 [02:11<28:40, 1.48s/it]
Training 1/1 epoch (loss 2.5313): 7%|β | 85/1250 [02:13<28:40, 1.48s/it]
Training 1/1 epoch (loss 2.5313): 7%|β | 86/1250 [02:13<30:53, 1.59s/it]
Training 1/1 epoch (loss 2.7306): 7%|β | 86/1250 [02:13<30:53, 1.59s/it]
Training 1/1 epoch (loss 2.7306): 7%|β | 87/1250 [02:13<24:41, 1.27s/it]
Training 1/1 epoch (loss 2.5057): 7%|β | 87/1250 [02:15<24:41, 1.27s/it]
Training 1/1 epoch (loss 2.5057): 7%|β | 88/1250 [02:15<29:39, 1.53s/it]
Training 1/1 epoch (loss 2.6186): 7%|β | 88/1250 [02:17<29:39, 1.53s/it]
Training 1/1 epoch (loss 2.6186): 7%|β | 89/1250 [02:17<28:49, 1.49s/it]
Training 1/1 epoch (loss 2.8304): 7%|β | 89/1250 [02:17<28:49, 1.49s/it]
Training 1/1 epoch (loss 2.8304): 7%|β | 90/1250 [02:17<23:24, 1.21s/it]
Training 1/1 epoch (loss 2.4773): 7%|β | 90/1250 [02:18<23:24, 1.21s/it]
Training 1/1 epoch (loss 2.4773): 7%|β | 91/1250 [02:18<22:30, 1.17s/it]
Training 1/1 epoch (loss 2.5533): 7%|β | 91/1250 [02:20<22:30, 1.17s/it]
Training 1/1 epoch (loss 2.5533): 7%|β | 92/1250 [02:20<26:51, 1.39s/it]
Training 1/1 epoch (loss 2.5505): 7%|β | 92/1250 [02:21<26:51, 1.39s/it]
Training 1/1 epoch (loss 2.5505): 7%|β | 93/1250 [02:21<23:17, 1.21s/it]
Training 1/1 epoch (loss 2.5337): 7%|β | 93/1250 [02:23<23:17, 1.21s/it]
Training 1/1 epoch (loss 2.5337): 8%|β | 94/1250 [02:23<29:44, 1.54s/it]
Training 1/1 epoch (loss 2.5630): 8%|β | 94/1250 [02:26<29:44, 1.54s/it]
Training 1/1 epoch (loss 2.5630): 8%|β | 95/1250 [02:26<34:49, 1.81s/it]
Training 1/1 epoch (loss 2.6254): 8%|β | 95/1250 [02:27<34:49, 1.81s/it]
Training 1/1 epoch (loss 2.6254): 8%|β | 96/1250 [02:27<28:32, 1.48s/it]
Training 1/1 epoch (loss 2.5093): 8%|β | 96/1250 [02:28<28:32, 1.48s/it]
Training 1/1 epoch (loss 2.5093): 8%|β | 97/1250 [02:28<29:03, 1.51s/it]
Training 1/1 epoch (loss 2.5464): 8%|β | 97/1250 [02:29<29:03, 1.51s/it]
Training 1/1 epoch (loss 2.5464): 8%|β | 98/1250 [02:29<26:06, 1.36s/it]
Training 1/1 epoch (loss 2.6155): 8%|β | 98/1250 [02:30<26:06, 1.36s/it]
Training 1/1 epoch (loss 2.6155): 8%|β | 99/1250 [02:30<23:25, 1.22s/it]
Training 1/1 epoch (loss 2.8116): 8%|β | 99/1250 [02:32<23:25, 1.22s/it]
Training 1/1 epoch (loss 2.8116): 8%|β | 100/1250 [02:32<25:55, 1.35s/it]
Training 1/1 epoch (loss 2.6058): 8%|β | 100/1250 [02:33<25:55, 1.35s/it]
Training 1/1 epoch (loss 2.6058): 8%|β | 101/1250 [02:33<24:56, 1.30s/it]
Training 1/1 epoch (loss 2.5238): 8%|β | 101/1250 [02:34<24:56, 1.30s/it]
Training 1/1 epoch (loss 2.5238): 8%|β | 102/1250 [02:34<25:49, 1.35s/it]
Training 1/1 epoch (loss 2.5948): 8%|β | 102/1250 [02:36<25:49, 1.35s/it]
Training 1/1 epoch (loss 2.5948): 8%|β | 103/1250 [02:36<27:30, 1.44s/it]
Training 1/1 epoch (loss 2.7258): 8%|β | 103/1250 [02:37<27:30, 1.44s/it]
Training 1/1 epoch (loss 2.7258): 8%|β | 104/1250 [02:37<25:39, 1.34s/it]
Training 1/1 epoch (loss 2.5715): 8%|β | 104/1250 [02:39<25:39, 1.34s/it]
Training 1/1 epoch (loss 2.5715): 8%|β | 105/1250 [02:39<30:08, 1.58s/it]
Training 1/1 epoch (loss 2.5886): 8%|β | 105/1250 [02:41<30:08, 1.58s/it]
Training 1/1 epoch (loss 2.5886): 8%|β | 106/1250 [02:41<31:24, 1.65s/it]
Training 1/1 epoch (loss 2.5757): 8%|β | 106/1250 [02:42<31:24, 1.65s/it]
Training 1/1 epoch (loss 2.5757): 9%|β | 107/1250 [02:42<24:57, 1.31s/it]
Training 1/1 epoch (loss 2.7386): 9%|β | 107/1250 [02:44<24:57, 1.31s/it]
Training 1/1 epoch (loss 2.7386): 9%|β | 108/1250 [02:44<31:18, 1.65s/it]
Training 1/1 epoch (loss 2.6917): 9%|β | 108/1250 [02:46<31:18, 1.65s/it]
Training 1/1 epoch (loss 2.6917): 9%|β | 109/1250 [02:46<31:23, 1.65s/it]
Training 1/1 epoch (loss 2.6011): 9%|β | 109/1250 [02:46<31:23, 1.65s/it]
Training 1/1 epoch (loss 2.6011): 9%|β | 110/1250 [02:46<25:31, 1.34s/it]
Training 1/1 epoch (loss 2.6944): 9%|β | 110/1250 [02:49<25:31, 1.34s/it]
Training 1/1 epoch (loss 2.6944): 9%|β | 111/1250 [02:49<30:28, 1.61s/it]
Training 1/1 epoch (loss 2.5051): 9%|β | 111/1250 [02:51<30:28, 1.61s/it]
Training 1/1 epoch (loss 2.5051): 9%|β | 112/1250 [02:51<32:35, 1.72s/it]
Training 1/1 epoch (loss 2.4703): 9%|β | 112/1250 [02:52<32:35, 1.72s/it]
Training 1/1 epoch (loss 2.4703): 9%|β | 113/1250 [02:52<31:40, 1.67s/it]
Training 1/1 epoch (loss 2.4521): 9%|β | 113/1250 [02:54<31:40, 1.67s/it]
Training 1/1 epoch (loss 2.4521): 9%|β | 114/1250 [02:54<30:36, 1.62s/it]
Training 1/1 epoch (loss 2.5111): 9%|β | 114/1250 [02:55<30:36, 1.62s/it]
Training 1/1 epoch (loss 2.5111): 9%|β | 115/1250 [02:55<28:16, 1.50s/it]
Training 1/1 epoch (loss 2.5967): 9%|β | 115/1250 [02:56<28:16, 1.50s/it]
Training 1/1 epoch (loss 2.5967): 9%|β | 116/1250 [02:56<25:41, 1.36s/it]
Training 1/1 epoch (loss 2.5230): 9%|β | 116/1250 [02:57<25:41, 1.36s/it]
Training 1/1 epoch (loss 2.5230): 9%|β | 117/1250 [02:57<26:51, 1.42s/it]
Training 1/1 epoch (loss 2.5070): 9%|β | 117/1250 [02:58<26:51, 1.42s/it]
Training 1/1 epoch (loss 2.5070): 9%|β | 118/1250 [02:58<22:59, 1.22s/it]
Training 1/1 epoch (loss 2.5612): 9%|β | 118/1250 [03:00<22:59, 1.22s/it]
Training 1/1 epoch (loss 2.5612): 10%|β | 119/1250 [03:00<24:41, 1.31s/it]
Training 1/1 epoch (loss 2.7372): 10%|β | 119/1250 [03:02<24:41, 1.31s/it]
Training 1/1 epoch (loss 2.7372): 10%|β | 120/1250 [03:02<29:32, 1.57s/it]
Training 1/1 epoch (loss 2.6908): 10%|β | 120/1250 [03:02<29:32, 1.57s/it]
Training 1/1 epoch (loss 2.6908): 10%|β | 121/1250 [03:02<24:08, 1.28s/it]
Training 1/1 epoch (loss 2.4961): 10%|β | 121/1250 [03:04<24:08, 1.28s/it]
Training 1/1 epoch (loss 2.4961): 10%|β | 122/1250 [03:04<24:24, 1.30s/it]
Training 1/1 epoch (loss 2.6315): 10%|β | 122/1250 [03:05<24:24, 1.30s/it]
Training 1/1 epoch (loss 2.6315): 10%|β | 123/1250 [03:05<25:33, 1.36s/it]
Training 1/1 epoch (loss 2.4585): 10%|β | 123/1250 [03:06<25:33, 1.36s/it]
Training 1/1 epoch (loss 2.4585): 10%|β | 124/1250 [03:06<23:31, 1.25s/it]
Training 1/1 epoch (loss 2.4177): 10%|β | 124/1250 [03:08<23:31, 1.25s/it]
Training 1/1 epoch (loss 2.4177): 10%|β | 125/1250 [03:08<26:32, 1.42s/it]
Training 1/1 epoch (loss 2.6298): 10%|β | 125/1250 [03:10<26:32, 1.42s/it]
Training 1/1 epoch (loss 2.6298): 10%|β | 126/1250 [03:10<28:41, 1.53s/it]
Training 1/1 epoch (loss 2.4514): 10%|β | 126/1250 [03:11<28:41, 1.53s/it]
Training 1/1 epoch (loss 2.4514): 10%|β | 127/1250 [03:11<27:17, 1.46s/it]
Training 1/1 epoch (loss 2.6429): 10%|β | 127/1250 [03:12<27:17, 1.46s/it]
Training 1/1 epoch (loss 2.6429): 10%|β | 128/1250 [03:12<21:52, 1.17s/it]
Training 1/1 epoch (loss 2.5333): 10%|β | 128/1250 [03:13<21:52, 1.17s/it]
Training 1/1 epoch (loss 2.5333): 10%|β | 129/1250 [03:13<20:59, 1.12s/it]
Training 1/1 epoch (loss 2.5523): 10%|β | 129/1250 [03:14<20:59, 1.12s/it]
Training 1/1 epoch (loss 2.5523): 10%|β | 130/1250 [03:14<21:52, 1.17s/it]
Training 1/1 epoch (loss 2.7132): 10%|β | 130/1250 [03:16<21:52, 1.17s/it]
Training 1/1 epoch (loss 2.7132): 10%|β | 131/1250 [03:16<24:43, 1.33s/it]
Training 1/1 epoch (loss 2.7097): 10%|β | 131/1250 [03:17<24:43, 1.33s/it]
Training 1/1 epoch (loss 2.7097): 11%|β | 132/1250 [03:17<24:16, 1.30s/it]
Training 1/1 epoch (loss 2.6387): 11%|β | 132/1250 [03:18<24:16, 1.30s/it]
Training 1/1 epoch (loss 2.6387): 11%|β | 133/1250 [03:18<25:38, 1.38s/it]
Training 1/1 epoch (loss 2.7275): 11%|β | 133/1250 [03:20<25:38, 1.38s/it]
Training 1/1 epoch (loss 2.7275): 11%|β | 134/1250 [03:20<25:21, 1.36s/it]
Training 1/1 epoch (loss 2.6526): 11%|β | 134/1250 [03:21<25:21, 1.36s/it]
Training 1/1 epoch (loss 2.6526): 11%|β | 135/1250 [03:21<23:01, 1.24s/it]
Training 1/1 epoch (loss 2.6057): 11%|β | 135/1250 [03:22<23:01, 1.24s/it]
Training 1/1 epoch (loss 2.6057): 11%|β | 136/1250 [03:22<23:36, 1.27s/it]
Training 1/1 epoch (loss 2.5167): 11%|β | 136/1250 [03:24<23:36, 1.27s/it]
Training 1/1 epoch (loss 2.5167): 11%|β | 137/1250 [03:24<28:44, 1.55s/it]
Training 1/1 epoch (loss 2.4851): 11%|β | 137/1250 [03:25<28:44, 1.55s/it]
Training 1/1 epoch (loss 2.4851): 11%|β | 138/1250 [03:25<23:53, 1.29s/it]
Training 1/1 epoch (loss 2.7420): 11%|β | 138/1250 [03:27<23:53, 1.29s/it]
Training 1/1 epoch (loss 2.7420): 11%|β | 139/1250 [03:27<30:15, 1.63s/it]
Training 1/1 epoch (loss 2.6645): 11%|β | 139/1250 [03:30<30:15, 1.63s/it]
Training 1/1 epoch (loss 2.6645): 11%|β | 140/1250 [03:30<34:25, 1.86s/it]
Training 1/1 epoch (loss 2.5382): 11%|β | 140/1250 [03:30<34:25, 1.86s/it]
Training 1/1 epoch (loss 2.5382): 11%|ββ | 141/1250 [03:30<26:26, 1.43s/it]
Training 1/1 epoch (loss 2.6845): 11%|ββ | 141/1250 [03:33<26:26, 1.43s/it]
Training 1/1 epoch (loss 2.6845): 11%|ββ | 142/1250 [03:33<31:15, 1.69s/it]
Training 1/1 epoch (loss 2.6086): 11%|ββ | 142/1250 [03:34<31:15, 1.69s/it]
Training 1/1 epoch (loss 2.6086): 11%|ββ | 143/1250 [03:34<30:05, 1.63s/it]
Training 1/1 epoch (loss 2.7442): 11%|ββ | 143/1250 [03:35<30:05, 1.63s/it]
Training 1/1 epoch (loss 2.7442): 12%|ββ | 144/1250 [03:35<26:14, 1.42s/it]
Training 1/1 epoch (loss 2.6195): 12%|ββ | 144/1250 [03:37<26:14, 1.42s/it]
Training 1/1 epoch (loss 2.6195): 12%|ββ | 145/1250 [03:37<27:33, 1.50s/it]
Training 1/1 epoch (loss 2.7169): 12%|ββ | 145/1250 [03:38<27:33, 1.50s/it]
Training 1/1 epoch (loss 2.7169): 12%|ββ | 146/1250 [03:38<25:18, 1.38s/it]
Training 1/1 epoch (loss 2.5402): 12%|ββ | 146/1250 [03:39<25:18, 1.38s/it]
Training 1/1 epoch (loss 2.5402): 12%|ββ | 147/1250 [03:39<24:28, 1.33s/it]
Training 1/1 epoch (loss 2.6608): 12%|ββ | 147/1250 [03:41<24:28, 1.33s/it]
Training 1/1 epoch (loss 2.6608): 12%|ββ | 148/1250 [03:41<27:34, 1.50s/it]
Training 1/1 epoch (loss 2.5834): 12%|ββ | 148/1250 [03:42<27:34, 1.50s/it]
Training 1/1 epoch (loss 2.5834): 12%|ββ | 149/1250 [03:42<25:25, 1.39s/it]
Training 1/1 epoch (loss 2.8507): 12%|ββ | 149/1250 [03:43<25:25, 1.39s/it]
Training 1/1 epoch (loss 2.8507): 12%|ββ | 150/1250 [03:43<21:57, 1.20s/it]
Training 1/1 epoch (loss 2.6130): 12%|ββ | 150/1250 [03:45<21:57, 1.20s/it]
Training 1/1 epoch (loss 2.6130): 12%|ββ | 151/1250 [03:45<26:07, 1.43s/it]
Training 1/1 epoch (loss 2.6731): 12%|ββ | 151/1250 [03:46<26:07, 1.43s/it]
Training 1/1 epoch (loss 2.6731): 12%|ββ | 152/1250 [03:46<25:29, 1.39s/it]
Training 1/1 epoch (loss 2.8082): 12%|ββ | 152/1250 [03:47<25:29, 1.39s/it]
Training 1/1 epoch (loss 2.8082): 12%|ββ | 153/1250 [03:47<25:32, 1.40s/it]
Training 1/1 epoch (loss 2.7212): 12%|ββ | 153/1250 [03:49<25:32, 1.40s/it]
Training 1/1 epoch (loss 2.7212): 12%|ββ | 154/1250 [03:49<24:53, 1.36s/it]
Training 1/1 epoch (loss 2.6458): 12%|ββ | 154/1250 [03:50<24:53, 1.36s/it]
Training 1/1 epoch (loss 2.6458): 12%|ββ | 155/1250 [03:50<22:42, 1.24s/it]
Training 1/1 epoch (loss 2.7682): 12%|ββ | 155/1250 [03:51<22:42, 1.24s/it]
Training 1/1 epoch (loss 2.7682): 12%|ββ | 156/1250 [03:51<24:57, 1.37s/it]
Training 1/1 epoch (loss 2.7527): 12%|ββ | 156/1250 [03:54<24:57, 1.37s/it]
Training 1/1 epoch (loss 2.7527): 13%|ββ | 157/1250 [03:54<29:55, 1.64s/it]
Training 1/1 epoch (loss 2.7489): 13%|ββ | 157/1250 [03:55<29:55, 1.64s/it]
Training 1/1 epoch (loss 2.7489): 13%|ββ | 158/1250 [03:55<26:58, 1.48s/it]
Training 1/1 epoch (loss 2.8730): 13%|ββ | 158/1250 [03:57<26:58, 1.48s/it]
Training 1/1 epoch (loss 2.8730): 13%|ββ | 159/1250 [03:57<29:42, 1.63s/it]
Training 1/1 epoch (loss 2.6000): 13%|ββ | 159/1250 [03:59<29:42, 1.63s/it]
Training 1/1 epoch (loss 2.6000): 13%|ββ | 160/1250 [03:59<31:38, 1.74s/it]
Training 1/1 epoch (loss 2.6196): 13%|ββ | 160/1250 [03:59<31:38, 1.74s/it]
Training 1/1 epoch (loss 2.6196): 13%|ββ | 161/1250 [03:59<26:15, 1.45s/it]
Training 1/1 epoch (loss 2.5049): 13%|ββ | 161/1250 [04:02<26:15, 1.45s/it]
Training 1/1 epoch (loss 2.5049): 13%|ββ | 162/1250 [04:02<30:15, 1.67s/it]
Training 1/1 epoch (loss 2.6809): 13%|ββ | 162/1250 [04:04<30:15, 1.67s/it]
Training 1/1 epoch (loss 2.6809): 13%|ββ | 163/1250 [04:04<31:31, 1.74s/it]
Training 1/1 epoch (loss 2.4809): 13%|ββ | 163/1250 [04:04<31:31, 1.74s/it]
Training 1/1 epoch (loss 2.4809): 13%|ββ | 164/1250 [04:04<25:02, 1.38s/it]
Training 1/1 epoch (loss 2.5805): 13%|ββ | 164/1250 [04:07<25:02, 1.38s/it]
Training 1/1 epoch (loss 2.5805): 13%|ββ | 165/1250 [04:07<30:42, 1.70s/it]
Training 1/1 epoch (loss 2.5779): 13%|ββ | 165/1250 [04:08<30:42, 1.70s/it]
Training 1/1 epoch (loss 2.5779): 13%|ββ | 166/1250 [04:08<29:51, 1.65s/it]
Training 1/1 epoch (loss 2.5566): 13%|ββ | 166/1250 [04:09<29:51, 1.65s/it]
Training 1/1 epoch (loss 2.5566): 13%|ββ | 167/1250 [04:09<23:08, 1.28s/it]
Training 1/1 epoch (loss 2.5927): 13%|ββ | 167/1250 [04:10<23:08, 1.28s/it]
Training 1/1 epoch (loss 2.5927): 13%|ββ | 168/1250 [04:10<26:33, 1.47s/it]
Training 1/1 epoch (loss 2.6260): 13%|ββ | 168/1250 [04:12<26:33, 1.47s/it]
Training 1/1 epoch (loss 2.6260): 14%|ββ | 169/1250 [04:12<28:15, 1.57s/it]
Training 1/1 epoch (loss 2.6720): 14%|ββ | 169/1250 [04:13<28:15, 1.57s/it]
Training 1/1 epoch (loss 2.6720): 14%|ββ | 170/1250 [04:13<24:43, 1.37s/it]
Training 1/1 epoch (loss 2.6212): 14%|ββ | 170/1250 [04:14<24:43, 1.37s/it]
Training 1/1 epoch (loss 2.6212): 14%|ββ | 171/1250 [04:14<24:31, 1.36s/it]
Training 1/1 epoch (loss 2.5141): 14%|ββ | 171/1250 [04:16<24:31, 1.36s/it]
Training 1/1 epoch (loss 2.5141): 14%|ββ | 172/1250 [04:16<26:19, 1.46s/it]
Training 1/1 epoch (loss 2.4364): 14%|ββ | 172/1250 [04:17<26:19, 1.46s/it]
Training 1/1 epoch (loss 2.4364): 14%|ββ | 173/1250 [04:17<23:37, 1.32s/it]
Training 1/1 epoch (loss 2.7576): 14%|ββ | 173/1250 [04:19<23:37, 1.32s/it]
Training 1/1 epoch (loss 2.7576): 14%|ββ | 174/1250 [04:19<25:37, 1.43s/it]
Training 1/1 epoch (loss 2.7910): 14%|ββ | 174/1250 [04:20<25:37, 1.43s/it]
Training 1/1 epoch (loss 2.7910): 14%|ββ | 175/1250 [04:20<23:30, 1.31s/it]
Training 1/1 epoch (loss 2.5759): 14%|ββ | 175/1250 [04:21<23:30, 1.31s/it]
Training 1/1 epoch (loss 2.5759): 14%|ββ | 176/1250 [04:21<24:35, 1.37s/it]
Training 1/1 epoch (loss 2.8340): 14%|ββ | 176/1250 [04:24<24:35, 1.37s/it]
Training 1/1 epoch (loss 2.8340): 14%|ββ | 177/1250 [04:24<29:29, 1.65s/it]
Training 1/1 epoch (loss 2.6586): 14%|ββ | 177/1250 [04:24<29:29, 1.65s/it]
Training 1/1 epoch (loss 2.6586): 14%|ββ | 178/1250 [04:24<22:38, 1.27s/it]
Training 1/1 epoch (loss 2.8715): 14%|ββ | 178/1250 [04:26<22:38, 1.27s/it]
Training 1/1 epoch (loss 2.8715): 14%|ββ | 179/1250 [04:26<27:07, 1.52s/it]
Training 1/1 epoch (loss 2.7768): 14%|ββ | 179/1250 [04:28<27:07, 1.52s/it]
Training 1/1 epoch (loss 2.7768): 14%|ββ | 180/1250 [04:28<29:22, 1.65s/it]
Training 1/1 epoch (loss 2.8022): 14%|ββ | 180/1250 [04:29<29:22, 1.65s/it]
Training 1/1 epoch (loss 2.8022): 14%|ββ | 181/1250 [04:29<23:29, 1.32s/it]
Training 1/1 epoch (loss 2.6742): 14%|ββ | 181/1250 [04:31<23:29, 1.32s/it]
Training 1/1 epoch (loss 2.6742): 15%|ββ | 182/1250 [04:31<27:49, 1.56s/it]
Training 1/1 epoch (loss 2.7101): 15%|ββ | 182/1250 [04:32<27:49, 1.56s/it]
Training 1/1 epoch (loss 2.7101): 15%|ββ | 183/1250 [04:32<27:46, 1.56s/it]
Training 1/1 epoch (loss 2.6604): 15%|ββ | 183/1250 [04:33<27:46, 1.56s/it]
Training 1/1 epoch (loss 2.6604): 15%|ββ | 184/1250 [04:33<23:09, 1.30s/it]
Training 1/1 epoch (loss 2.5734): 15%|ββ | 184/1250 [04:35<23:09, 1.30s/it]
Training 1/1 epoch (loss 2.5734): 15%|ββ | 185/1250 [04:35<23:58, 1.35s/it]
Training 1/1 epoch (loss 2.6561): 15%|ββ | 185/1250 [04:36<23:58, 1.35s/it]
Training 1/1 epoch (loss 2.6561): 15%|ββ | 186/1250 [04:36<25:26, 1.43s/it]
Training 1/1 epoch (loss 2.7225): 15%|ββ | 186/1250 [04:37<25:26, 1.43s/it]
Training 1/1 epoch (loss 2.7225): 15%|ββ | 187/1250 [04:37<20:48, 1.17s/it]
Training 1/1 epoch (loss 2.4615): 15%|ββ | 187/1250 [04:38<20:48, 1.17s/it]
Training 1/1 epoch (loss 2.4615): 15%|ββ | 188/1250 [04:38<23:44, 1.34s/it]
Training 1/1 epoch (loss 2.5255): 15%|ββ | 188/1250 [04:40<23:44, 1.34s/it]
Training 1/1 epoch (loss 2.5255): 15%|ββ | 189/1250 [04:40<24:42, 1.40s/it]
Training 1/1 epoch (loss 2.5048): 15%|ββ | 189/1250 [04:41<24:42, 1.40s/it]
Training 1/1 epoch (loss 2.5048): 15%|ββ | 190/1250 [04:41<22:26, 1.27s/it]
Training 1/1 epoch (loss 2.6060): 15%|ββ | 190/1250 [04:43<22:26, 1.27s/it]
Training 1/1 epoch (loss 2.6060): 15%|ββ | 191/1250 [04:43<24:20, 1.38s/it]
Training 1/1 epoch (loss 2.8505): 15%|ββ | 191/1250 [04:44<24:20, 1.38s/it]
Training 1/1 epoch (loss 2.8505): 15%|ββ | 192/1250 [04:44<25:16, 1.43s/it]
Training 1/1 epoch (loss 2.6298): 15%|ββ | 192/1250 [04:45<25:16, 1.43s/it]
Training 1/1 epoch (loss 2.6298): 15%|ββ | 193/1250 [04:45<23:48, 1.35s/it]
Training 1/1 epoch (loss 2.7686): 15%|ββ | 193/1250 [04:47<23:48, 1.35s/it]
Training 1/1 epoch (loss 2.7686): 16%|ββ | 194/1250 [04:47<24:44, 1.41s/it]
Training 1/1 epoch (loss 2.6273): 16%|ββ | 194/1250 [04:48<24:44, 1.41s/it]
Training 1/1 epoch (loss 2.6273): 16%|ββ | 195/1250 [04:48<23:18, 1.33s/it]
Training 1/1 epoch (loss 2.4896): 16%|ββ | 195/1250 [04:49<23:18, 1.33s/it]
Training 1/1 epoch (loss 2.4896): 16%|ββ | 196/1250 [04:49<24:01, 1.37s/it]
Training 1/1 epoch (loss 2.5446): 16%|ββ | 196/1250 [04:52<24:01, 1.37s/it]
Training 1/1 epoch (loss 2.5446): 16%|ββ | 197/1250 [04:52<28:29, 1.62s/it]
Training 1/1 epoch (loss 2.7473): 16%|ββ | 197/1250 [04:52<28:29, 1.62s/it]
Training 1/1 epoch (loss 2.7473): 16%|ββ | 198/1250 [04:52<24:08, 1.38s/it]
Training 1/1 epoch (loss 2.6832): 16%|ββ | 198/1250 [04:54<24:08, 1.38s/it]
Training 1/1 epoch (loss 2.6832): 16%|ββ | 199/1250 [04:54<25:46, 1.47s/it]
Training 1/1 epoch (loss 2.6156): 16%|ββ | 199/1250 [04:56<25:46, 1.47s/it]
Training 1/1 epoch (loss 2.6156): 16%|ββ | 200/1250 [04:56<29:48, 1.70s/it]
Training 1/1 epoch (loss 2.7322): 16%|ββ | 200/1250 [04:57<29:48, 1.70s/it]
Training 1/1 epoch (loss 2.7322): 16%|ββ | 201/1250 [04:57<24:15, 1.39s/it]
Training 1/1 epoch (loss 2.5959): 16%|ββ | 201/1250 [04:59<24:15, 1.39s/it]
Training 1/1 epoch (loss 2.5959): 16%|ββ | 202/1250 [04:59<29:09, 1.67s/it]
Training 1/1 epoch (loss 2.5177): 16%|ββ | 202/1250 [05:01<29:09, 1.67s/it]
Training 1/1 epoch (loss 2.5177): 16%|ββ | 203/1250 [05:01<28:13, 1.62s/it]
Training 1/1 epoch (loss 2.5243): 16%|ββ | 203/1250 [05:02<28:13, 1.62s/it]
Training 1/1 epoch (loss 2.5243): 16%|ββ | 204/1250 [05:02<23:27, 1.35s/it]
Training 1/1 epoch (loss 2.5084): 16%|ββ | 204/1250 [05:03<23:27, 1.35s/it]
Training 1/1 epoch (loss 2.5084): 16%|ββ | 205/1250 [05:03<25:42, 1.48s/it]
Training 1/1 epoch (loss 2.8023): 16%|ββ | 205/1250 [05:05<25:42, 1.48s/it]
Training 1/1 epoch (loss 2.8023): 16%|ββ | 206/1250 [05:05<25:00, 1.44s/it]
Training 1/1 epoch (loss 2.6461): 16%|ββ | 206/1250 [05:05<25:00, 1.44s/it]
Training 1/1 epoch (loss 2.6461): 17%|ββ | 207/1250 [05:05<21:07, 1.22s/it]
Training 1/1 epoch (loss 2.6567): 17%|ββ | 207/1250 [05:08<21:07, 1.22s/it]
Training 1/1 epoch (loss 2.6567): 17%|ββ | 208/1250 [05:08<25:43, 1.48s/it]
Training 1/1 epoch (loss 2.6318): 17%|ββ | 208/1250 [05:09<25:43, 1.48s/it]
Training 1/1 epoch (loss 2.6318): 17%|ββ | 209/1250 [05:09<24:45, 1.43s/it]
Training 1/1 epoch (loss 2.6482): 17%|ββ | 209/1250 [05:09<24:45, 1.43s/it]
Training 1/1 epoch (loss 2.6482): 17%|ββ | 210/1250 [05:09<20:31, 1.18s/it]
Training 1/1 epoch (loss 2.6710): 17%|ββ | 210/1250 [05:11<20:31, 1.18s/it]
Training 1/1 epoch (loss 2.6710): 17%|ββ | 211/1250 [05:11<22:49, 1.32s/it]
Training 1/1 epoch (loss 2.8160): 17%|ββ | 211/1250 [05:12<22:49, 1.32s/it]
Training 1/1 epoch (loss 2.8160): 17%|ββ | 212/1250 [05:12<21:05, 1.22s/it]
Training 1/1 epoch (loss 2.5953): 17%|ββ | 212/1250 [05:13<21:05, 1.22s/it]
Training 1/1 epoch (loss 2.5953): 17%|ββ | 213/1250 [05:13<21:47, 1.26s/it]
Training 1/1 epoch (loss 2.4638): 17%|ββ | 213/1250 [05:15<21:47, 1.26s/it]
Training 1/1 epoch (loss 2.4638): 17%|ββ | 214/1250 [05:15<23:17, 1.35s/it]
Training 1/1 epoch (loss 2.4792): 17%|ββ | 214/1250 [05:16<23:17, 1.35s/it]
Training 1/1 epoch (loss 2.4792): 17%|ββ | 215/1250 [05:16<21:21, 1.24s/it]
Training 1/1 epoch (loss 2.3677): 17%|ββ | 215/1250 [05:17<21:21, 1.24s/it]
Training 1/1 epoch (loss 2.3677): 17%|ββ | 216/1250 [05:17<22:45, 1.32s/it]
Training 1/1 epoch (loss 2.4528): 17%|ββ | 216/1250 [05:20<22:45, 1.32s/it]
Training 1/1 epoch (loss 2.4528): 17%|ββ | 217/1250 [05:20<28:27, 1.65s/it]
Training 1/1 epoch (loss 2.6955): 17%|ββ | 217/1250 [05:21<28:27, 1.65s/it]
Training 1/1 epoch (loss 2.6955): 17%|ββ | 218/1250 [05:21<24:35, 1.43s/it]
Training 1/1 epoch (loss 2.7063): 17%|ββ | 218/1250 [05:23<24:35, 1.43s/it]
Training 1/1 epoch (loss 2.7063): 18%|ββ | 219/1250 [05:23<26:19, 1.53s/it]
Training 1/1 epoch (loss 2.4764): 18%|ββ | 219/1250 [05:24<26:19, 1.53s/it]
Training 1/1 epoch (loss 2.4764): 18%|ββ | 220/1250 [05:24<28:18, 1.65s/it]
Training 1/1 epoch (loss 2.5192): 18%|ββ | 220/1250 [05:25<28:18, 1.65s/it]
Training 1/1 epoch (loss 2.5192): 18%|ββ | 221/1250 [05:25<22:31, 1.31s/it]
Training 1/1 epoch (loss 2.6507): 18%|ββ | 221/1250 [05:27<22:31, 1.31s/it]
Training 1/1 epoch (loss 2.6507): 18%|ββ | 222/1250 [05:27<26:50, 1.57s/it]
Training 1/1 epoch (loss 2.5295): 18%|ββ | 222/1250 [05:29<26:50, 1.57s/it]
Training 1/1 epoch (loss 2.5295): 18%|ββ | 223/1250 [05:29<27:28, 1.60s/it]
Training 1/1 epoch (loss 2.5683): 18%|ββ | 223/1250 [05:30<27:28, 1.60s/it]
Training 1/1 epoch (loss 2.5683): 18%|ββ | 224/1250 [05:30<24:55, 1.46s/it]
Training 1/1 epoch (loss 2.4135): 18%|ββ | 224/1250 [05:32<24:55, 1.46s/it]
Training 1/1 epoch (loss 2.4135): 18%|ββ | 225/1250 [05:32<29:44, 1.74s/it]
Training 1/1 epoch (loss 2.5973): 18%|ββ | 225/1250 [05:34<29:44, 1.74s/it]
Training 1/1 epoch (loss 2.5973): 18%|ββ | 226/1250 [05:34<30:13, 1.77s/it]
Training 1/1 epoch (loss 2.7840): 18%|ββ | 226/1250 [05:35<30:13, 1.77s/it]
Training 1/1 epoch (loss 2.7840): 18%|ββ | 227/1250 [05:35<25:06, 1.47s/it]
Training 1/1 epoch (loss 2.7061): 18%|ββ | 227/1250 [05:37<25:06, 1.47s/it]
Training 1/1 epoch (loss 2.7061): 18%|ββ | 228/1250 [05:37<28:08, 1.65s/it]
Training 1/1 epoch (loss 2.6362): 18%|ββ | 228/1250 [05:39<28:08, 1.65s/it]
Training 1/1 epoch (loss 2.6362): 18%|ββ | 229/1250 [05:39<28:10, 1.66s/it]
Training 1/1 epoch (loss 2.5504): 18%|ββ | 229/1250 [05:39<28:10, 1.66s/it]
Training 1/1 epoch (loss 2.5504): 18%|ββ | 230/1250 [05:39<22:29, 1.32s/it]
Training 1/1 epoch (loss 2.7516): 18%|ββ | 230/1250 [05:41<22:29, 1.32s/it]
Training 1/1 epoch (loss 2.7516): 18%|ββ | 231/1250 [05:41<26:16, 1.55s/it]
Training 1/1 epoch (loss 2.7772): 18%|ββ | 231/1250 [05:43<26:16, 1.55s/it]
Training 1/1 epoch (loss 2.7772): 19%|ββ | 232/1250 [05:43<25:55, 1.53s/it]
Training 1/1 epoch (loss 2.6560): 19%|ββ | 232/1250 [05:45<25:55, 1.53s/it]
Training 1/1 epoch (loss 2.6560): 19%|ββ | 233/1250 [05:45<26:49, 1.58s/it]
Training 1/1 epoch (loss 2.6433): 19%|ββ | 233/1250 [05:46<26:49, 1.58s/it]
Training 1/1 epoch (loss 2.6433): 19%|ββ | 234/1250 [05:46<28:04, 1.66s/it]
Training 1/1 epoch (loss 2.6847): 19%|ββ | 234/1250 [05:47<28:04, 1.66s/it]
Training 1/1 epoch (loss 2.6847): 19%|ββ | 235/1250 [05:47<22:16, 1.32s/it]
Training 1/1 epoch (loss 2.6686): 19%|ββ | 235/1250 [05:48<22:16, 1.32s/it]
Training 1/1 epoch (loss 2.6686): 19%|ββ | 236/1250 [05:48<23:15, 1.38s/it]
Training 1/1 epoch (loss 2.4901): 19%|ββ | 236/1250 [05:50<23:15, 1.38s/it]
Training 1/1 epoch (loss 2.4901): 19%|ββ | 237/1250 [05:50<26:30, 1.57s/it]
Training 1/1 epoch (loss 2.6159): 19%|ββ | 237/1250 [05:51<26:30, 1.57s/it]
Training 1/1 epoch (loss 2.6159): 19%|ββ | 238/1250 [05:51<20:58, 1.24s/it]
Training 1/1 epoch (loss 2.5147): 19%|ββ | 238/1250 [05:52<20:58, 1.24s/it]
Training 1/1 epoch (loss 2.5147): 19%|ββ | 239/1250 [05:52<21:26, 1.27s/it]
Training 1/1 epoch (loss 2.5738): 19%|ββ | 239/1250 [05:54<21:26, 1.27s/it]
Training 1/1 epoch (loss 2.5738): 19%|ββ | 240/1250 [05:54<25:59, 1.54s/it]
Training 1/1 epoch (loss 2.4772): 19%|ββ | 240/1250 [05:55<25:59, 1.54s/it]
Training 1/1 epoch (loss 2.4772): 19%|ββ | 241/1250 [05:55<22:04, 1.31s/it]
Training 1/1 epoch (loss 2.6071): 19%|ββ | 241/1250 [05:57<22:04, 1.31s/it]
Training 1/1 epoch (loss 2.6071): 19%|ββ | 242/1250 [05:57<25:56, 1.54s/it]
Training 1/1 epoch (loss 2.4817): 19%|ββ | 242/1250 [05:59<25:56, 1.54s/it]
Training 1/1 epoch (loss 2.4817): 19%|ββ | 243/1250 [05:59<24:19, 1.45s/it]
Training 1/1 epoch (loss 2.4051): 19%|ββ | 243/1250 [05:59<24:19, 1.45s/it]
Training 1/1 epoch (loss 2.4051): 20%|ββ | 244/1250 [05:59<20:53, 1.25s/it]
Training 1/1 epoch (loss 2.5612): 20%|ββ | 244/1250 [06:02<20:53, 1.25s/it]
Training 1/1 epoch (loss 2.5612): 20%|ββ | 245/1250 [06:02<26:57, 1.61s/it]
Training 1/1 epoch (loss 2.6350): 20%|ββ | 245/1250 [06:03<26:57, 1.61s/it]
Training 1/1 epoch (loss 2.6350): 20%|ββ | 246/1250 [06:03<25:38, 1.53s/it]
Training 1/1 epoch (loss 2.5225): 20%|ββ | 246/1250 [06:06<25:38, 1.53s/it]
Training 1/1 epoch (loss 2.5225): 20%|ββ | 247/1250 [06:06<30:35, 1.83s/it]
Training 1/1 epoch (loss 2.5179): 20%|ββ | 247/1250 [06:07<30:35, 1.83s/it]
Training 1/1 epoch (loss 2.5179): 20%|ββ | 248/1250 [06:07<29:08, 1.75s/it]
Training 1/1 epoch (loss 2.7152): 20%|ββ | 248/1250 [06:08<29:08, 1.75s/it]
Training 1/1 epoch (loss 2.7152): 20%|ββ | 249/1250 [06:08<23:23, 1.40s/it]
Training 1/1 epoch (loss 2.6878): 20%|ββ | 249/1250 [06:09<23:23, 1.40s/it]
Training 1/1 epoch (loss 2.6878): 20%|ββ | 250/1250 [06:09<23:59, 1.44s/it]
Training 1/1 epoch (loss 2.5125): 20%|ββ | 250/1250 [06:11<23:59, 1.44s/it]
Training 1/1 epoch (loss 2.5125): 20%|ββ | 251/1250 [06:11<25:48, 1.55s/it]
Training 1/1 epoch (loss 2.6733): 20%|ββ | 251/1250 [06:11<25:48, 1.55s/it]
Training 1/1 epoch (loss 2.6733): 20%|ββ | 252/1250 [06:11<19:59, 1.20s/it]
Training 1/1 epoch (loss 2.5993): 20%|ββ | 252/1250 [06:14<19:59, 1.20s/it]
Training 1/1 epoch (loss 2.5993): 20%|ββ | 253/1250 [06:14<25:26, 1.53s/it]
Training 1/1 epoch (loss 2.6995): 20%|ββ | 253/1250 [06:15<25:26, 1.53s/it]
Training 1/1 epoch (loss 2.6995): 20%|ββ | 254/1250 [06:15<24:21, 1.47s/it]
Training 1/1 epoch (loss 2.6003): 20%|ββ | 254/1250 [06:16<24:21, 1.47s/it]
Training 1/1 epoch (loss 2.6003): 20%|ββ | 255/1250 [06:16<20:01, 1.21s/it]
Training 1/1 epoch (loss 2.6789): 20%|ββ | 255/1250 [06:18<20:01, 1.21s/it]
Training 1/1 epoch (loss 2.6789): 20%|ββ | 256/1250 [06:18<23:43, 1.43s/it]
Training 1/1 epoch (loss 2.6560): 20%|ββ | 256/1250 [06:19<23:43, 1.43s/it]
Training 1/1 epoch (loss 2.6560): 21%|ββ | 257/1250 [06:19<22:12, 1.34s/it]
Training 1/1 epoch (loss 2.6079): 21%|ββ | 257/1250 [06:21<22:12, 1.34s/it]
Training 1/1 epoch (loss 2.6079): 21%|ββ | 258/1250 [06:21<27:57, 1.69s/it]
Training 1/1 epoch (loss 2.5981): 21%|ββ | 258/1250 [06:23<27:57, 1.69s/it]
Training 1/1 epoch (loss 2.5981): 21%|ββ | 259/1250 [06:23<30:06, 1.82s/it]
Training 1/1 epoch (loss 2.6281): 21%|ββ | 259/1250 [06:24<30:06, 1.82s/it]
Training 1/1 epoch (loss 2.6281): 21%|ββ | 260/1250 [06:24<23:18, 1.41s/it]
Training 1/1 epoch (loss 2.5308): 21%|ββ | 260/1250 [06:26<23:18, 1.41s/it]
Training 1/1 epoch (loss 2.5308): 21%|ββ | 261/1250 [06:26<25:59, 1.58s/it]
Training 1/1 epoch (loss 2.5638): 21%|ββ | 261/1250 [06:27<25:59, 1.58s/it]
Training 1/1 epoch (loss 2.5638): 21%|ββ | 262/1250 [06:27<24:09, 1.47s/it]
Training 1/1 epoch (loss 2.5609): 21%|ββ | 262/1250 [06:28<24:09, 1.47s/it]
Training 1/1 epoch (loss 2.5609): 21%|ββ | 263/1250 [06:28<20:22, 1.24s/it]
Training 1/1 epoch (loss 2.3152): 21%|ββ | 263/1250 [06:30<20:22, 1.24s/it]
Training 1/1 epoch (loss 2.3152): 21%|ββ | 264/1250 [06:30<25:35, 1.56s/it]
Training 1/1 epoch (loss 2.6688): 21%|ββ | 264/1250 [06:33<25:35, 1.56s/it]
Training 1/1 epoch (loss 2.6688): 21%|ββ | 265/1250 [06:33<30:05, 1.83s/it]
Training 1/1 epoch (loss 2.6196): 21%|ββ | 265/1250 [06:33<30:05, 1.83s/it]
Training 1/1 epoch (loss 2.6196): 21%|βββ | 266/1250 [06:33<24:30, 1.49s/it]
Training 1/1 epoch (loss 2.6876): 21%|βββ | 266/1250 [06:36<24:30, 1.49s/it]
Training 1/1 epoch (loss 2.6876): 21%|βββ | 267/1250 [06:36<28:57, 1.77s/it]
Training 1/1 epoch (loss 2.6644): 21%|βββ | 267/1250 [06:37<28:57, 1.77s/it]
Training 1/1 epoch (loss 2.6644): 21%|βββ | 268/1250 [06:37<26:51, 1.64s/it]
Training 1/1 epoch (loss 2.6001): 21%|βββ | 268/1250 [06:38<26:51, 1.64s/it]
Training 1/1 epoch (loss 2.6001): 22%|βββ | 269/1250 [06:38<21:25, 1.31s/it]
Training 1/1 epoch (loss 2.5625): 22%|βββ | 269/1250 [06:40<21:25, 1.31s/it]
Training 1/1 epoch (loss 2.5625): 22%|βββ | 270/1250 [06:40<26:07, 1.60s/it]
Training 1/1 epoch (loss 2.4593): 22%|βββ | 270/1250 [06:42<26:07, 1.60s/it]
Training 1/1 epoch (loss 2.4593): 22%|βββ | 271/1250 [06:42<26:37, 1.63s/it]
Training 1/1 epoch (loss 2.3700): 22%|βββ | 271/1250 [06:42<26:37, 1.63s/it]
Training 1/1 epoch (loss 2.3700): 22%|βββ | 272/1250 [06:42<23:08, 1.42s/it]
Training 1/1 epoch (loss 2.5599): 22%|βββ | 272/1250 [06:45<23:08, 1.42s/it]
Training 1/1 epoch (loss 2.5599): 22%|βββ | 273/1250 [06:45<26:44, 1.64s/it]
Training 1/1 epoch (loss 2.6491): 22%|βββ | 273/1250 [06:46<26:44, 1.64s/it]
Training 1/1 epoch (loss 2.6491): 22%|βββ | 274/1250 [06:46<24:37, 1.51s/it]
Training 1/1 epoch (loss 2.6812): 22%|βββ | 274/1250 [06:47<24:37, 1.51s/it]
Training 1/1 epoch (loss 2.6812): 22%|βββ | 275/1250 [06:47<21:38, 1.33s/it]
Training 1/1 epoch (loss 2.6862): 22%|βββ | 275/1250 [06:48<21:38, 1.33s/it]
Training 1/1 epoch (loss 2.6862): 22%|βββ | 276/1250 [06:48<22:27, 1.38s/it]
Training 1/1 epoch (loss 2.5880): 22%|βββ | 276/1250 [06:49<22:27, 1.38s/it]
Training 1/1 epoch (loss 2.5880): 22%|βββ | 277/1250 [06:49<19:43, 1.22s/it]
Training 1/1 epoch (loss 2.7165): 22%|βββ | 277/1250 [06:51<19:43, 1.22s/it]
Training 1/1 epoch (loss 2.7165): 22%|βββ | 278/1250 [06:51<22:40, 1.40s/it]
Training 1/1 epoch (loss 2.7984): 22%|βββ | 278/1250 [06:53<22:40, 1.40s/it]
Training 1/1 epoch (loss 2.7984): 22%|βββ | 279/1250 [06:53<26:01, 1.61s/it]
Training 1/1 epoch (loss 2.5445): 22%|βββ | 279/1250 [06:53<26:01, 1.61s/it]
Training 1/1 epoch (loss 2.5445): 22%|βββ | 280/1250 [06:53<20:34, 1.27s/it]
Training 1/1 epoch (loss 2.7172): 22%|βββ | 280/1250 [06:55<20:34, 1.27s/it]
Training 1/1 epoch (loss 2.7172): 22%|βββ | 281/1250 [06:55<22:12, 1.38s/it]
Training 1/1 epoch (loss 2.5838): 22%|βββ | 281/1250 [06:57<22:12, 1.38s/it]
Training 1/1 epoch (loss 2.5838): 23%|βββ | 282/1250 [06:57<24:52, 1.54s/it]
Training 1/1 epoch (loss 2.5956): 23%|βββ | 282/1250 [06:58<24:52, 1.54s/it]
Training 1/1 epoch (loss 2.5956): 23%|βββ | 283/1250 [06:58<20:31, 1.27s/it]
Training 1/1 epoch (loss 2.7378): 23%|βββ | 283/1250 [06:59<20:31, 1.27s/it]
Training 1/1 epoch (loss 2.7378): 23%|βββ | 284/1250 [06:59<22:29, 1.40s/it]
Training 1/1 epoch (loss 2.6273): 23%|βββ | 284/1250 [07:01<22:29, 1.40s/it]
Training 1/1 epoch (loss 2.6273): 23%|βββ | 285/1250 [07:01<23:03, 1.43s/it]
Training 1/1 epoch (loss 2.4468): 23%|βββ | 285/1250 [07:02<23:03, 1.43s/it]
Training 1/1 epoch (loss 2.4468): 23%|βββ | 286/1250 [07:02<21:48, 1.36s/it]
Training 1/1 epoch (loss 2.5782): 23%|βββ | 286/1250 [07:04<21:48, 1.36s/it]
Training 1/1 epoch (loss 2.5782): 23%|βββ | 287/1250 [07:04<22:53, 1.43s/it]
Training 1/1 epoch (loss 2.6086): 23%|βββ | 287/1250 [07:05<22:53, 1.43s/it]
Training 1/1 epoch (loss 2.6086): 23%|βββ | 288/1250 [07:05<23:59, 1.50s/it]
Training 1/1 epoch (loss 2.5192): 23%|βββ | 288/1250 [07:07<23:59, 1.50s/it]
Training 1/1 epoch (loss 2.5192): 23%|βββ | 289/1250 [07:07<23:46, 1.48s/it]
Training 1/1 epoch (loss 2.4951): 23%|βββ | 289/1250 [07:09<23:46, 1.48s/it]
Training 1/1 epoch (loss 2.4951): 23%|βββ | 290/1250 [07:09<28:13, 1.76s/it]
Training 1/1 epoch (loss 2.5562): 23%|βββ | 290/1250 [07:10<28:13, 1.76s/it]
Training 1/1 epoch (loss 2.5562): 23%|βββ | 291/1250 [07:10<24:00, 1.50s/it]
Training 1/1 epoch (loss 2.7603): 23%|βββ | 291/1250 [07:12<24:00, 1.50s/it]
Training 1/1 epoch (loss 2.7603): 23%|βββ | 292/1250 [07:12<25:48, 1.62s/it]
Training 1/1 epoch (loss 2.7137): 23%|βββ | 292/1250 [07:13<25:48, 1.62s/it]
Training 1/1 epoch (loss 2.7137): 23%|βββ | 293/1250 [07:13<24:19, 1.53s/it]
Training 1/1 epoch (loss 2.5779): 23%|βββ | 293/1250 [07:14<24:19, 1.53s/it]
Training 1/1 epoch (loss 2.5779): 24%|βββ | 294/1250 [07:14<19:34, 1.23s/it]
Training 1/1 epoch (loss 2.5600): 24%|βββ | 294/1250 [07:16<19:34, 1.23s/it]
Training 1/1 epoch (loss 2.5600): 24%|βββ | 295/1250 [07:16<22:56, 1.44s/it]
Training 1/1 epoch (loss 2.6078): 24%|βββ | 295/1250 [07:18<22:56, 1.44s/it]
Training 1/1 epoch (loss 2.6078): 24%|βββ | 296/1250 [07:18<25:20, 1.59s/it]
Training 1/1 epoch (loss 2.5779): 24%|βββ | 296/1250 [07:19<25:20, 1.59s/it]
Training 1/1 epoch (loss 2.5779): 24%|βββ | 297/1250 [07:19<24:04, 1.52s/it]
Training 1/1 epoch (loss 2.6552): 24%|βββ | 297/1250 [07:21<24:04, 1.52s/it]
Training 1/1 epoch (loss 2.6552): 24%|βββ | 298/1250 [07:21<24:55, 1.57s/it]
Training 1/1 epoch (loss 2.5353): 24%|βββ | 298/1250 [07:22<24:55, 1.57s/it]
Training 1/1 epoch (loss 2.5353): 24%|βββ | 299/1250 [07:22<21:09, 1.33s/it]
Training 1/1 epoch (loss 2.6761): 24%|βββ | 299/1250 [07:23<21:09, 1.33s/it]
Training 1/1 epoch (loss 2.6761): 24%|βββ | 300/1250 [07:23<21:01, 1.33s/it]
Training 1/1 epoch (loss 2.5656): 24%|βββ | 300/1250 [07:24<21:01, 1.33s/it]
Training 1/1 epoch (loss 2.5656): 24%|βββ | 301/1250 [07:24<21:38, 1.37s/it]
Training 1/1 epoch (loss 2.4238): 24%|βββ | 301/1250 [07:25<21:38, 1.37s/it]
Training 1/1 epoch (loss 2.4238): 24%|βββ | 302/1250 [07:25<17:27, 1.11s/it]
Training 1/1 epoch (loss 2.3293): 24%|βββ | 302/1250 [07:27<17:27, 1.11s/it]
Training 1/1 epoch (loss 2.3293): 24%|βββ | 303/1250 [07:27<23:31, 1.49s/it]
Training 1/1 epoch (loss 2.7249): 24%|βββ | 303/1250 [07:29<23:31, 1.49s/it]
Training 1/1 epoch (loss 2.7249): 24%|βββ | 304/1250 [07:29<27:20, 1.73s/it]
Training 1/1 epoch (loss 2.7654): 24%|βββ | 304/1250 [07:31<27:20, 1.73s/it]
Training 1/1 epoch (loss 2.7654): 24%|βββ | 305/1250 [07:31<24:43, 1.57s/it]
Training 1/1 epoch (loss 2.7577): 24%|βββ | 305/1250 [07:33<24:43, 1.57s/it]
Training 1/1 epoch (loss 2.7577): 24%|βββ | 306/1250 [07:33<29:09, 1.85s/it]
Training 1/1 epoch (loss 2.6189): 24%|βββ | 306/1250 [07:35<29:09, 1.85s/it]
Training 1/1 epoch (loss 2.6189): 25%|βββ | 307/1250 [07:35<27:41, 1.76s/it]
Training 1/1 epoch (loss 2.5382): 25%|βββ | 307/1250 [07:36<27:41, 1.76s/it]
Training 1/1 epoch (loss 2.5382): 25%|βββ | 308/1250 [07:36<26:11, 1.67s/it]
Training 1/1 epoch (loss 2.5121): 25%|βββ | 308/1250 [07:38<26:11, 1.67s/it]
Training 1/1 epoch (loss 2.5121): 25%|βββ | 309/1250 [07:38<26:33, 1.69s/it]
Training 1/1 epoch (loss 2.5264): 25%|βββ | 309/1250 [07:39<26:33, 1.69s/it]
Training 1/1 epoch (loss 2.5264): 25%|βββ | 310/1250 [07:39<22:56, 1.46s/it]
Training 1/1 epoch (loss 2.6667): 25%|βββ | 310/1250 [07:41<22:56, 1.46s/it]
Training 1/1 epoch (loss 2.6667): 25%|βββ | 311/1250 [07:41<24:36, 1.57s/it]
Training 1/1 epoch (loss 2.4802): 25%|βββ | 311/1250 [07:43<24:36, 1.57s/it]
Training 1/1 epoch (loss 2.4802): 25%|βββ | 312/1250 [07:43<27:37, 1.77s/it]
Training 1/1 epoch (loss 2.3413): 25%|βββ | 312/1250 [07:44<27:37, 1.77s/it]
Training 1/1 epoch (loss 2.3413): 25%|βββ | 313/1250 [07:44<22:13, 1.42s/it]
Training 1/1 epoch (loss 2.5338): 25%|βββ | 313/1250 [07:45<22:13, 1.42s/it]
Training 1/1 epoch (loss 2.5338): 25%|βββ | 314/1250 [07:45<22:20, 1.43s/it]
Training 1/1 epoch (loss 2.6837): 25%|βββ | 314/1250 [07:46<22:20, 1.43s/it]
Training 1/1 epoch (loss 2.6837): 25%|βββ | 315/1250 [07:46<20:56, 1.34s/it]
Training 1/1 epoch (loss 2.5422): 25%|βββ | 315/1250 [07:47<20:56, 1.34s/it]
Training 1/1 epoch (loss 2.5422): 25%|βββ | 316/1250 [07:47<17:21, 1.12s/it]
Training 1/1 epoch (loss 2.6326): 25%|βββ | 316/1250 [07:49<17:21, 1.12s/it]
Training 1/1 epoch (loss 2.6326): 25%|βββ | 317/1250 [07:49<21:11, 1.36s/it]
Training 1/1 epoch (loss 2.7725): 25%|βββ | 317/1250 [07:50<21:11, 1.36s/it]
Training 1/1 epoch (loss 2.7725): 25%|βββ | 318/1250 [07:50<22:23, 1.44s/it]
Training 1/1 epoch (loss 2.4994): 25%|βββ | 318/1250 [07:51<22:23, 1.44s/it]
Training 1/1 epoch (loss 2.4994): 26%|βββ | 319/1250 [07:51<19:00, 1.23s/it]
Training 1/1 epoch (loss 2.6697): 26%|βββ | 319/1250 [07:53<19:00, 1.23s/it]
Training 1/1 epoch (loss 2.6697): 26%|βββ | 320/1250 [07:53<24:49, 1.60s/it]
Training 1/1 epoch (loss 2.5395): 26%|βββ | 320/1250 [07:55<24:49, 1.60s/it]
Training 1/1 epoch (loss 2.5395): 26%|βββ | 321/1250 [07:55<23:37, 1.53s/it]
Training 1/1 epoch (loss 2.6640): 26%|βββ | 321/1250 [07:56<23:37, 1.53s/it]
Training 1/1 epoch (loss 2.6640): 26%|βββ | 322/1250 [07:56<21:38, 1.40s/it]
Training 1/1 epoch (loss 2.4870): 26%|βββ | 322/1250 [07:58<21:38, 1.40s/it]
Training 1/1 epoch (loss 2.4870): 26%|βββ | 323/1250 [07:58<24:05, 1.56s/it]
Training 1/1 epoch (loss 2.7707): 26%|βββ | 323/1250 [07:59<24:05, 1.56s/it]
Training 1/1 epoch (loss 2.7707): 26%|βββ | 324/1250 [07:59<20:04, 1.30s/it]
Training 1/1 epoch (loss 2.3809): 26%|βββ | 324/1250 [08:00<20:04, 1.30s/it]
Training 1/1 epoch (loss 2.3809): 26%|βββ | 325/1250 [08:00<20:13, 1.31s/it]
Training 1/1 epoch (loss 2.6545): 26%|βββ | 325/1250 [08:01<20:13, 1.31s/it]
Training 1/1 epoch (loss 2.6545): 26%|βββ | 326/1250 [08:01<20:10, 1.31s/it]
Training 1/1 epoch (loss 2.5101): 26%|βββ | 326/1250 [08:02<20:10, 1.31s/it]
Training 1/1 epoch (loss 2.5101): 26%|βββ | 327/1250 [08:02<17:25, 1.13s/it]
Training 1/1 epoch (loss 2.3084): 26%|βββ | 327/1250 [08:04<17:25, 1.13s/it]
Training 1/1 epoch (loss 2.3084): 26%|βββ | 328/1250 [08:04<21:04, 1.37s/it]
Training 1/1 epoch (loss 2.8567): 26%|βββ | 328/1250 [08:05<21:04, 1.37s/it]
Training 1/1 epoch (loss 2.8567): 26%|βββ | 329/1250 [08:05<22:22, 1.46s/it]
Training 1/1 epoch (loss 2.4954): 26%|βββ | 329/1250 [08:06<22:22, 1.46s/it]
Training 1/1 epoch (loss 2.4954): 26%|βββ | 330/1250 [08:06<19:41, 1.28s/it]
Training 1/1 epoch (loss 2.5785): 26%|βββ | 330/1250 [08:08<19:41, 1.28s/it]
Training 1/1 epoch (loss 2.5785): 26%|βββ | 331/1250 [08:08<19:02, 1.24s/it]
Training 1/1 epoch (loss 2.7224): 26%|βββ | 331/1250 [08:09<19:02, 1.24s/it]
Training 1/1 epoch (loss 2.7224): 27%|βββ | 332/1250 [08:09<18:13, 1.19s/it]
Training 1/1 epoch (loss 2.5375): 27%|βββ | 332/1250 [08:10<18:13, 1.19s/it]
Training 1/1 epoch (loss 2.5375): 27%|βββ | 333/1250 [08:10<19:48, 1.30s/it]
Training 1/1 epoch (loss 2.7098): 27%|βββ | 333/1250 [08:11<19:48, 1.30s/it]
Training 1/1 epoch (loss 2.7098): 27%|βββ | 334/1250 [08:11<20:07, 1.32s/it]
Training 1/1 epoch (loss 2.5793): 27%|βββ | 334/1250 [08:12<20:07, 1.32s/it]
Training 1/1 epoch (loss 2.5793): 27%|βββ | 335/1250 [08:12<16:49, 1.10s/it]
Training 1/1 epoch (loss 2.7352): 27%|βββ | 335/1250 [08:15<16:49, 1.10s/it]
Training 1/1 epoch (loss 2.7352): 27%|βββ | 336/1250 [08:15<24:28, 1.61s/it]
Training 1/1 epoch (loss 2.9033): 27%|βββ | 336/1250 [08:17<24:28, 1.61s/it]
Training 1/1 epoch (loss 2.9033): 27%|βββ | 337/1250 [08:17<26:59, 1.77s/it]
Training 1/1 epoch (loss 2.6548): 27%|βββ | 337/1250 [08:18<26:59, 1.77s/it]
Training 1/1 epoch (loss 2.6548): 27%|βββ | 338/1250 [08:18<21:12, 1.40s/it]
Training 1/1 epoch (loss 2.6761): 27%|βββ | 338/1250 [08:19<21:12, 1.40s/it]
Training 1/1 epoch (loss 2.6761): 27%|βββ | 339/1250 [08:19<23:39, 1.56s/it]
Training 1/1 epoch (loss 2.6127): 27%|βββ | 339/1250 [08:22<23:39, 1.56s/it]
Training 1/1 epoch (loss 2.6127): 27%|βββ | 340/1250 [08:22<27:18, 1.80s/it]
Training 1/1 epoch (loss 2.4530): 27%|βββ | 340/1250 [08:23<27:18, 1.80s/it]
Training 1/1 epoch (loss 2.4530): 27%|βββ | 341/1250 [08:23<22:23, 1.48s/it]
Training 1/1 epoch (loss 2.5442): 27%|βββ | 341/1250 [08:24<22:23, 1.48s/it]
Training 1/1 epoch (loss 2.5442): 27%|βββ | 342/1250 [08:24<21:29, 1.42s/it]
Training 1/1 epoch (loss 2.4876): 27%|βββ | 342/1250 [08:25<21:29, 1.42s/it]
Training 1/1 epoch (loss 2.4876): 27%|βββ | 343/1250 [08:25<21:10, 1.40s/it]
Training 1/1 epoch (loss 2.6410): 27%|βββ | 343/1250 [08:26<21:10, 1.40s/it]
Training 1/1 epoch (loss 2.6410): 28%|βββ | 344/1250 [08:26<17:27, 1.16s/it]
Training 1/1 epoch (loss 2.5860): 28%|βββ | 344/1250 [08:28<17:27, 1.16s/it]
Training 1/1 epoch (loss 2.5860): 28%|βββ | 345/1250 [08:28<22:28, 1.49s/it]
Training 1/1 epoch (loss 2.5541): 28%|βββ | 345/1250 [08:30<22:28, 1.49s/it]
Training 1/1 epoch (loss 2.5541): 28%|βββ | 346/1250 [08:30<22:55, 1.52s/it]
Training 1/1 epoch (loss 2.6330): 28%|βββ | 346/1250 [08:30<22:55, 1.52s/it]
Training 1/1 epoch (loss 2.6330): 28%|βββ | 347/1250 [08:30<19:15, 1.28s/it]
Training 1/1 epoch (loss 2.7124): 28%|βββ | 347/1250 [08:32<19:15, 1.28s/it]
Training 1/1 epoch (loss 2.7124): 28%|βββ | 348/1250 [08:32<20:38, 1.37s/it]
Training 1/1 epoch (loss 2.5615): 28%|βββ | 348/1250 [08:34<20:38, 1.37s/it]
Training 1/1 epoch (loss 2.5615): 28%|βββ | 349/1250 [08:34<24:11, 1.61s/it]
Training 1/1 epoch (loss 2.6647): 28%|βββ | 349/1250 [08:35<24:11, 1.61s/it]
Training 1/1 epoch (loss 2.6647): 28%|βββ | 350/1250 [08:35<20:26, 1.36s/it]
Training 1/1 epoch (loss 2.5803): 28%|βββ | 350/1250 [08:37<20:26, 1.36s/it]
Training 1/1 epoch (loss 2.5803): 28%|βββ | 351/1250 [08:37<22:13, 1.48s/it]
Training 1/1 epoch (loss 2.7046): 28%|βββ | 351/1250 [08:38<22:13, 1.48s/it]
Training 1/1 epoch (loss 2.7046): 28%|βββ | 352/1250 [08:38<21:52, 1.46s/it]
Training 1/1 epoch (loss 2.7857): 28%|βββ | 352/1250 [08:40<21:52, 1.46s/it]
Training 1/1 epoch (loss 2.7857): 28%|βββ | 353/1250 [08:40<22:34, 1.51s/it]
Training 1/1 epoch (loss 2.6357): 28%|βββ | 353/1250 [08:42<22:34, 1.51s/it]
Training 1/1 epoch (loss 2.6357): 28%|βββ | 354/1250 [08:42<24:15, 1.62s/it]
Training 1/1 epoch (loss 2.4520): 28%|βββ | 354/1250 [08:43<24:15, 1.62s/it]
Training 1/1 epoch (loss 2.4520): 28%|βββ | 355/1250 [08:43<21:38, 1.45s/it]
Training 1/1 epoch (loss 2.5976): 28%|βββ | 355/1250 [08:44<21:38, 1.45s/it]
Training 1/1 epoch (loss 2.5976): 28%|βββ | 356/1250 [08:44<21:54, 1.47s/it]
Training 1/1 epoch (loss 2.7423): 28%|βββ | 356/1250 [08:46<21:54, 1.47s/it]
Training 1/1 epoch (loss 2.7423): 29%|βββ | 357/1250 [08:46<22:32, 1.52s/it]
Training 1/1 epoch (loss 2.7117): 29%|βββ | 357/1250 [08:46<22:32, 1.52s/it]
Training 1/1 epoch (loss 2.7117): 29%|βββ | 358/1250 [08:46<17:53, 1.20s/it]
Training 1/1 epoch (loss 2.6029): 29%|βββ | 358/1250 [08:48<17:53, 1.20s/it]
Training 1/1 epoch (loss 2.6029): 29%|βββ | 359/1250 [08:48<20:32, 1.38s/it]
Training 1/1 epoch (loss 2.4152): 29%|βββ | 359/1250 [08:50<20:32, 1.38s/it]
Training 1/1 epoch (loss 2.4152): 29%|βββ | 360/1250 [08:50<20:50, 1.40s/it]
Training 1/1 epoch (loss 2.5098): 29%|βββ | 360/1250 [08:50<20:50, 1.40s/it]
Training 1/1 epoch (loss 2.5098): 29%|βββ | 361/1250 [08:50<17:53, 1.21s/it]
Training 1/1 epoch (loss 2.6053): 29%|βββ | 361/1250 [08:52<17:53, 1.21s/it]
Training 1/1 epoch (loss 2.6053): 29%|βββ | 362/1250 [08:52<22:10, 1.50s/it]
Training 1/1 epoch (loss 2.6451): 29%|βββ | 362/1250 [08:54<22:10, 1.50s/it]
Training 1/1 epoch (loss 2.6451): 29%|βββ | 363/1250 [08:54<21:41, 1.47s/it]
Training 1/1 epoch (loss 2.7097): 29%|βββ | 363/1250 [08:55<21:41, 1.47s/it]
Training 1/1 epoch (loss 2.7097): 29%|βββ | 364/1250 [08:55<18:39, 1.26s/it]
Training 1/1 epoch (loss 2.5016): 29%|βββ | 364/1250 [08:57<18:39, 1.26s/it]
Training 1/1 epoch (loss 2.5016): 29%|βββ | 365/1250 [08:57<22:17, 1.51s/it]
Training 1/1 epoch (loss 2.3725): 29%|βββ | 365/1250 [08:58<22:17, 1.51s/it]
Training 1/1 epoch (loss 2.3725): 29%|βββ | 366/1250 [08:58<23:26, 1.59s/it]
Training 1/1 epoch (loss 2.4695): 29%|βββ | 366/1250 [08:59<23:26, 1.59s/it]
Training 1/1 epoch (loss 2.4695): 29%|βββ | 367/1250 [08:59<19:51, 1.35s/it]
Training 1/1 epoch (loss 2.5351): 29%|βββ | 367/1250 [09:01<19:51, 1.35s/it]
Training 1/1 epoch (loss 2.5351): 29%|βββ | 368/1250 [09:01<19:45, 1.34s/it]
Training 1/1 epoch (loss 2.6431): 29%|βββ | 368/1250 [09:02<19:45, 1.34s/it]
Training 1/1 epoch (loss 2.6431): 30%|βββ | 369/1250 [09:02<18:02, 1.23s/it]
Training 1/1 epoch (loss 2.7314): 30%|βββ | 369/1250 [09:04<18:02, 1.23s/it]
Training 1/1 epoch (loss 2.7314): 30%|βββ | 370/1250 [09:04<22:41, 1.55s/it]
Training 1/1 epoch (loss 2.5126): 30%|βββ | 370/1250 [09:06<22:41, 1.55s/it]
Training 1/1 epoch (loss 2.5126): 30%|βββ | 371/1250 [09:06<24:31, 1.67s/it]
Training 1/1 epoch (loss 2.5577): 30%|βββ | 371/1250 [09:07<24:31, 1.67s/it]
Training 1/1 epoch (loss 2.5577): 30%|βββ | 372/1250 [09:07<20:49, 1.42s/it]
Training 1/1 epoch (loss 2.5325): 30%|βββ | 372/1250 [09:08<20:49, 1.42s/it]
Training 1/1 epoch (loss 2.5325): 30%|βββ | 373/1250 [09:08<20:39, 1.41s/it]
Training 1/1 epoch (loss 2.3176): 30%|βββ | 373/1250 [09:11<20:39, 1.41s/it]
Training 1/1 epoch (loss 2.3176): 30%|βββ | 374/1250 [09:11<25:10, 1.72s/it]
Training 1/1 epoch (loss 2.6262): 30%|βββ | 374/1250 [09:11<25:10, 1.72s/it]
Training 1/1 epoch (loss 2.6262): 30%|βββ | 375/1250 [09:11<20:00, 1.37s/it]
Training 1/1 epoch (loss 2.7935): 30%|βββ | 375/1250 [09:14<20:00, 1.37s/it]
Training 1/1 epoch (loss 2.7935): 30%|βββ | 376/1250 [09:14<25:21, 1.74s/it]
Training 1/1 epoch (loss 2.7203): 30%|βββ | 376/1250 [09:15<25:21, 1.74s/it]
Training 1/1 epoch (loss 2.7203): 30%|βββ | 377/1250 [09:15<23:34, 1.62s/it]
Training 1/1 epoch (loss 2.5729): 30%|βββ | 377/1250 [09:16<23:34, 1.62s/it]
Training 1/1 epoch (loss 2.5729): 30%|βββ | 378/1250 [09:16<21:32, 1.48s/it]
Training 1/1 epoch (loss 2.4096): 30%|βββ | 378/1250 [09:18<21:32, 1.48s/it]
Training 1/1 epoch (loss 2.4096): 30%|βββ | 379/1250 [09:18<23:12, 1.60s/it]
Training 1/1 epoch (loss 2.6214): 30%|βββ | 379/1250 [09:19<23:12, 1.60s/it]
Training 1/1 epoch (loss 2.6214): 30%|βββ | 380/1250 [09:19<20:59, 1.45s/it]
Training 1/1 epoch (loss 2.6523): 30%|βββ | 380/1250 [09:21<20:59, 1.45s/it]
Training 1/1 epoch (loss 2.6523): 30%|βββ | 381/1250 [09:21<20:43, 1.43s/it]
Training 1/1 epoch (loss 2.7056): 30%|βββ | 381/1250 [09:22<20:43, 1.43s/it]
Training 1/1 epoch (loss 2.7056): 31%|βββ | 382/1250 [09:22<20:32, 1.42s/it]
Training 1/1 epoch (loss 2.5801): 31%|βββ | 382/1250 [09:23<20:32, 1.42s/it]
Training 1/1 epoch (loss 2.5801): 31%|βββ | 383/1250 [09:23<17:36, 1.22s/it]
Training 1/1 epoch (loss 2.4524): 31%|βββ | 383/1250 [09:25<17:36, 1.22s/it]
Training 1/1 epoch (loss 2.4524): 31%|βββ | 384/1250 [09:25<23:37, 1.64s/it]
Training 1/1 epoch (loss 2.5614): 31%|βββ | 384/1250 [09:27<23:37, 1.64s/it]
Training 1/1 epoch (loss 2.5614): 31%|βββ | 385/1250 [09:27<23:10, 1.61s/it]
Training 1/1 epoch (loss 2.4120): 31%|βββ | 385/1250 [09:28<23:10, 1.61s/it]
Training 1/1 epoch (loss 2.4120): 31%|βββ | 386/1250 [09:28<20:31, 1.43s/it]
Training 1/1 epoch (loss 2.4607): 31%|βββ | 386/1250 [09:29<20:31, 1.43s/it]
Training 1/1 epoch (loss 2.4607): 31%|βββ | 387/1250 [09:29<21:08, 1.47s/it]
Training 1/1 epoch (loss 2.5548): 31%|βββ | 387/1250 [09:32<21:08, 1.47s/it]
Training 1/1 epoch (loss 2.5548): 31%|βββ | 388/1250 [09:32<25:35, 1.78s/it]
Training 1/1 epoch (loss 2.6726): 31%|βββ | 388/1250 [09:33<25:35, 1.78s/it]
Training 1/1 epoch (loss 2.6726): 31%|βββ | 389/1250 [09:33<21:37, 1.51s/it]
Training 1/1 epoch (loss 2.5596): 31%|βββ | 389/1250 [09:34<21:37, 1.51s/it]
Training 1/1 epoch (loss 2.5596): 31%|βββ | 390/1250 [09:34<22:12, 1.55s/it]
Training 1/1 epoch (loss 2.7853): 31%|βββ | 390/1250 [09:36<22:12, 1.55s/it]
Training 1/1 epoch (loss 2.7853): 31%|ββββ | 391/1250 [09:36<22:01, 1.54s/it]
Training 1/1 epoch (loss 2.6251): 31%|ββββ | 391/1250 [09:38<22:01, 1.54s/it]
Training 1/1 epoch (loss 2.6251): 31%|ββββ | 392/1250 [09:38<25:42, 1.80s/it]
Training 1/1 epoch (loss 2.6902): 31%|ββββ | 392/1250 [09:40<25:42, 1.80s/it]
Training 1/1 epoch (loss 2.6902): 31%|ββββ | 393/1250 [09:40<23:59, 1.68s/it]
Training 1/1 epoch (loss 2.4547): 31%|ββββ | 393/1250 [09:40<23:59, 1.68s/it]
Training 1/1 epoch (loss 2.4547): 32%|ββββ | 394/1250 [09:40<18:51, 1.32s/it]
Training 1/1 epoch (loss 2.3229): 32%|ββββ | 394/1250 [09:42<18:51, 1.32s/it]
Training 1/1 epoch (loss 2.3229): 32%|ββββ | 395/1250 [09:42<20:20, 1.43s/it]
Training 1/1 epoch (loss 2.5291): 32%|ββββ | 395/1250 [09:44<20:20, 1.43s/it]
Training 1/1 epoch (loss 2.5291): 32%|ββββ | 396/1250 [09:44<24:04, 1.69s/it]
Training 1/1 epoch (loss 2.3587): 32%|ββββ | 396/1250 [09:45<24:04, 1.69s/it]
Training 1/1 epoch (loss 2.3587): 32%|ββββ | 397/1250 [09:45<20:53, 1.47s/it]
Training 1/1 epoch (loss 2.3998): 32%|ββββ | 397/1250 [09:47<20:53, 1.47s/it]
Training 1/1 epoch (loss 2.3998): 32%|ββββ | 398/1250 [09:47<23:49, 1.68s/it]
Training 1/1 epoch (loss 2.6377): 32%|ββββ | 398/1250 [09:48<23:49, 1.68s/it]
Training 1/1 epoch (loss 2.6377): 32%|ββββ | 399/1250 [09:48<20:42, 1.46s/it]
Training 1/1 epoch (loss 2.6170): 32%|ββββ | 399/1250 [09:49<20:42, 1.46s/it]
Training 1/1 epoch (loss 2.6170): 32%|ββββ | 400/1250 [09:49<19:29, 1.38s/it]
Training 1/1 epoch (loss 2.4228): 32%|ββββ | 400/1250 [09:51<19:29, 1.38s/it]
Training 1/1 epoch (loss 2.4228): 32%|ββββ | 401/1250 [09:51<18:52, 1.33s/it]
Training 1/1 epoch (loss 2.7312): 32%|ββββ | 401/1250 [09:52<18:52, 1.33s/it]
Training 1/1 epoch (loss 2.7312): 32%|ββββ | 402/1250 [09:52<18:18, 1.30s/it]
Training 1/1 epoch (loss 2.6724): 32%|ββββ | 402/1250 [09:53<18:18, 1.30s/it]
Training 1/1 epoch (loss 2.6724): 32%|ββββ | 403/1250 [09:53<16:53, 1.20s/it]
Training 1/1 epoch (loss 2.4254): 32%|ββββ | 403/1250 [09:55<16:53, 1.20s/it]
Training 1/1 epoch (loss 2.4254): 32%|ββββ | 404/1250 [09:55<19:40, 1.40s/it]
Training 1/1 epoch (loss 2.5950): 32%|ββββ | 404/1250 [09:55<19:40, 1.40s/it]
Training 1/1 epoch (loss 2.5950): 32%|ββββ | 405/1250 [09:55<16:29, 1.17s/it]
Training 1/1 epoch (loss 2.5874): 32%|ββββ | 405/1250 [09:57<16:29, 1.17s/it]
Training 1/1 epoch (loss 2.5874): 32%|ββββ | 406/1250 [09:57<18:54, 1.34s/it]
Training 1/1 epoch (loss 2.4829): 32%|ββββ | 406/1250 [09:59<18:54, 1.34s/it]
Training 1/1 epoch (loss 2.4829): 33%|ββββ | 407/1250 [09:59<22:34, 1.61s/it]
Training 1/1 epoch (loss 2.6153): 33%|ββββ | 407/1250 [10:00<22:34, 1.61s/it]
Training 1/1 epoch (loss 2.6153): 33%|ββββ | 408/1250 [10:00<19:08, 1.36s/it]
Training 1/1 epoch (loss 2.7196): 33%|ββββ | 408/1250 [10:02<19:08, 1.36s/it]
Training 1/1 epoch (loss 2.7196): 33%|ββββ | 409/1250 [10:02<19:12, 1.37s/it]
Training 1/1 epoch (loss 2.7338): 33%|ββββ | 409/1250 [10:03<19:12, 1.37s/it]
Training 1/1 epoch (loss 2.7338): 33%|ββββ | 410/1250 [10:03<19:02, 1.36s/it]
Training 1/1 epoch (loss 2.5394): 33%|ββββ | 410/1250 [10:04<19:02, 1.36s/it]
Training 1/1 epoch (loss 2.5394): 33%|ββββ | 411/1250 [10:04<16:05, 1.15s/it]
Training 1/1 epoch (loss 2.3797): 33%|ββββ | 411/1250 [10:06<16:05, 1.15s/it]
Training 1/1 epoch (loss 2.3797): 33%|ββββ | 412/1250 [10:06<20:35, 1.47s/it]
Training 1/1 epoch (loss 2.5171): 33%|ββββ | 412/1250 [10:08<20:35, 1.47s/it]
Training 1/1 epoch (loss 2.5171): 33%|ββββ | 413/1250 [10:08<24:57, 1.79s/it]
Training 1/1 epoch (loss 2.6820): 33%|ββββ | 413/1250 [10:09<24:57, 1.79s/it]
Training 1/1 epoch (loss 2.6820): 33%|ββββ | 414/1250 [10:09<19:18, 1.39s/it]
Training 1/1 epoch (loss 2.5399): 33%|ββββ | 414/1250 [10:10<19:18, 1.39s/it]
Training 1/1 epoch (loss 2.5399): 33%|ββββ | 415/1250 [10:10<20:46, 1.49s/it]
Training 1/1 epoch (loss 2.6981): 33%|ββββ | 415/1250 [10:13<20:46, 1.49s/it]
Training 1/1 epoch (loss 2.6981): 33%|ββββ | 416/1250 [10:13<23:40, 1.70s/it]
Training 1/1 epoch (loss 2.7274): 33%|ββββ | 416/1250 [10:14<23:40, 1.70s/it]
Training 1/1 epoch (loss 2.7274): 33%|ββββ | 417/1250 [10:14<21:45, 1.57s/it]
Training 1/1 epoch (loss 2.6189): 33%|ββββ | 417/1250 [10:16<21:45, 1.57s/it]
Training 1/1 epoch (loss 2.6189): 33%|ββββ | 418/1250 [10:16<22:36, 1.63s/it]
Training 1/1 epoch (loss 2.7220): 33%|ββββ | 418/1250 [10:17<22:36, 1.63s/it]
Training 1/1 epoch (loss 2.7220): 34%|ββββ | 419/1250 [10:17<19:33, 1.41s/it]
Training 1/1 epoch (loss 2.6373): 34%|ββββ | 419/1250 [10:18<19:33, 1.41s/it]
Training 1/1 epoch (loss 2.6373): 34%|ββββ | 420/1250 [10:18<20:26, 1.48s/it]
Training 1/1 epoch (loss 2.5842): 34%|ββββ | 420/1250 [10:20<20:26, 1.48s/it]
Training 1/1 epoch (loss 2.5842): 34%|ββββ | 421/1250 [10:20<22:33, 1.63s/it]
Training 1/1 epoch (loss 2.5287): 34%|ββββ | 421/1250 [10:21<22:33, 1.63s/it]
Training 1/1 epoch (loss 2.5287): 34%|ββββ | 422/1250 [10:21<18:09, 1.32s/it]
Training 1/1 epoch (loss 2.5558): 34%|ββββ | 422/1250 [10:23<18:09, 1.32s/it]
Training 1/1 epoch (loss 2.5558): 34%|ββββ | 423/1250 [10:23<22:31, 1.63s/it]
Training 1/1 epoch (loss 2.5308): 34%|ββββ | 423/1250 [10:26<22:31, 1.63s/it]
Training 1/1 epoch (loss 2.5308): 34%|ββββ | 424/1250 [10:26<25:34, 1.86s/it]
Training 1/1 epoch (loss 2.7111): 34%|ββββ | 424/1250 [10:26<25:34, 1.86s/it]
Training 1/1 epoch (loss 2.7111): 34%|ββββ | 425/1250 [10:26<21:23, 1.56s/it]
Training 1/1 epoch (loss 2.7780): 34%|ββββ | 425/1250 [10:28<21:23, 1.56s/it]
Training 1/1 epoch (loss 2.7780): 34%|ββββ | 426/1250 [10:28<22:20, 1.63s/it]
Training 1/1 epoch (loss 2.5859): 34%|ββββ | 426/1250 [10:30<22:20, 1.63s/it]
Training 1/1 epoch (loss 2.5859): 34%|ββββ | 427/1250 [10:30<21:48, 1.59s/it]
Training 1/1 epoch (loss 2.5363): 34%|ββββ | 427/1250 [10:30<21:48, 1.59s/it]
Training 1/1 epoch (loss 2.5363): 34%|ββββ | 428/1250 [10:30<18:31, 1.35s/it]
Training 1/1 epoch (loss 2.5184): 34%|ββββ | 428/1250 [10:32<18:31, 1.35s/it]
Training 1/1 epoch (loss 2.5184): 34%|ββββ | 429/1250 [10:32<20:54, 1.53s/it]
Training 1/1 epoch (loss 2.4157): 34%|ββββ | 429/1250 [10:34<20:54, 1.53s/it]
Training 1/1 epoch (loss 2.4157): 34%|ββββ | 430/1250 [10:34<20:18, 1.49s/it]
Training 1/1 epoch (loss 2.5697): 34%|ββββ | 430/1250 [10:35<20:18, 1.49s/it]
Training 1/1 epoch (loss 2.5697): 34%|ββββ | 431/1250 [10:35<19:41, 1.44s/it]
Training 1/1 epoch (loss 2.6282): 34%|ββββ | 431/1250 [10:37<19:41, 1.44s/it]
Training 1/1 epoch (loss 2.6282): 35%|ββββ | 432/1250 [10:37<22:55, 1.68s/it]
Training 1/1 epoch (loss 2.5365): 35%|ββββ | 432/1250 [10:38<22:55, 1.68s/it]
Training 1/1 epoch (loss 2.5365): 35%|ββββ | 433/1250 [10:38<18:47, 1.38s/it]
Training 1/1 epoch (loss 2.6741): 35%|ββββ | 433/1250 [10:40<18:47, 1.38s/it]
Training 1/1 epoch (loss 2.6741): 35%|ββββ | 434/1250 [10:40<20:43, 1.52s/it]
Training 1/1 epoch (loss 2.6643): 35%|ββββ | 434/1250 [10:41<20:43, 1.52s/it]
Training 1/1 epoch (loss 2.6643): 35%|ββββ | 435/1250 [10:41<18:45, 1.38s/it]
Training 1/1 epoch (loss 2.7269): 35%|ββββ | 435/1250 [10:41<18:45, 1.38s/it]
Training 1/1 epoch (loss 2.7269): 35%|ββββ | 436/1250 [10:41<15:07, 1.11s/it]
Training 1/1 epoch (loss 2.5342): 35%|ββββ | 436/1250 [10:43<15:07, 1.11s/it]
Training 1/1 epoch (loss 2.5342): 35%|ββββ | 437/1250 [10:43<18:50, 1.39s/it]
Training 1/1 epoch (loss 2.5106): 35%|ββββ | 437/1250 [10:45<18:50, 1.39s/it]
Training 1/1 epoch (loss 2.5106): 35%|ββββ | 438/1250 [10:45<19:04, 1.41s/it]
Training 1/1 epoch (loss 2.6423): 35%|ββββ | 438/1250 [10:46<19:04, 1.41s/it]
Training 1/1 epoch (loss 2.6423): 35%|ββββ | 439/1250 [10:46<16:15, 1.20s/it]
Training 1/1 epoch (loss 2.9034): 35%|ββββ | 439/1250 [10:48<16:15, 1.20s/it]
Training 1/1 epoch (loss 2.9034): 35%|ββββ | 440/1250 [10:48<19:55, 1.48s/it]
Training 1/1 epoch (loss 2.4833): 35%|ββββ | 440/1250 [10:49<19:55, 1.48s/it]
Training 1/1 epoch (loss 2.4833): 35%|ββββ | 441/1250 [10:49<19:51, 1.47s/it]
Training 1/1 epoch (loss 2.5617): 35%|ββββ | 441/1250 [10:51<19:51, 1.47s/it]
Training 1/1 epoch (loss 2.5617): 35%|ββββ | 442/1250 [10:51<21:24, 1.59s/it]
Training 1/1 epoch (loss 2.5330): 35%|ββββ | 442/1250 [10:53<21:24, 1.59s/it]
Training 1/1 epoch (loss 2.5330): 35%|ββββ | 443/1250 [10:53<22:00, 1.64s/it]
Training 1/1 epoch (loss 2.7666): 35%|ββββ | 443/1250 [10:53<22:00, 1.64s/it]
Training 1/1 epoch (loss 2.7666): 36%|ββββ | 444/1250 [10:53<17:09, 1.28s/it]
Training 1/1 epoch (loss 2.5395): 36%|ββββ | 444/1250 [10:56<17:09, 1.28s/it]
Training 1/1 epoch (loss 2.5395): 36%|ββββ | 445/1250 [10:56<21:28, 1.60s/it]
Training 1/1 epoch (loss 2.4934): 36%|ββββ | 445/1250 [10:57<21:28, 1.60s/it]
Training 1/1 epoch (loss 2.4934): 36%|ββββ | 446/1250 [10:57<21:58, 1.64s/it]
Training 1/1 epoch (loss 2.5582): 36%|ββββ | 446/1250 [10:58<21:58, 1.64s/it]
Training 1/1 epoch (loss 2.5582): 36%|ββββ | 447/1250 [10:58<18:07, 1.35s/it]
Training 1/1 epoch (loss 2.4085): 36%|ββββ | 447/1250 [11:00<18:07, 1.35s/it]
Training 1/1 epoch (loss 2.4085): 36%|ββββ | 448/1250 [11:00<20:40, 1.55s/it]
Training 1/1 epoch (loss 2.6301): 36%|ββββ | 448/1250 [11:01<20:40, 1.55s/it]
Training 1/1 epoch (loss 2.6301): 36%|ββββ | 449/1250 [11:01<18:55, 1.42s/it]
Training 1/1 epoch (loss 2.5570): 36%|ββββ | 449/1250 [11:03<18:55, 1.42s/it]
Training 1/1 epoch (loss 2.5570): 36%|ββββ | 450/1250 [11:03<20:16, 1.52s/it]
Training 1/1 epoch (loss 2.5005): 36%|ββββ | 450/1250 [11:05<20:16, 1.52s/it]
Training 1/1 epoch (loss 2.5005): 36%|ββββ | 451/1250 [11:05<24:09, 1.81s/it]
Training 1/1 epoch (loss 2.5036): 36%|ββββ | 451/1250 [11:06<24:09, 1.81s/it]
Training 1/1 epoch (loss 2.5036): 36%|ββββ | 452/1250 [11:06<20:30, 1.54s/it]
Training 1/1 epoch (loss 2.6251): 36%|ββββ | 452/1250 [11:08<20:30, 1.54s/it]
Training 1/1 epoch (loss 2.6251): 36%|ββββ | 453/1250 [11:08<22:32, 1.70s/it]
Training 1/1 epoch (loss 2.6079): 36%|ββββ | 453/1250 [11:11<22:32, 1.70s/it]
Training 1/1 epoch (loss 2.6079): 36%|ββββ | 454/1250 [11:11<25:38, 1.93s/it]
Training 1/1 epoch (loss 2.5340): 36%|ββββ | 454/1250 [11:12<25:38, 1.93s/it]
Training 1/1 epoch (loss 2.5340): 36%|ββββ | 455/1250 [11:12<20:20, 1.54s/it]
Training 1/1 epoch (loss 2.7243): 36%|ββββ | 455/1250 [11:13<20:20, 1.54s/it]
Training 1/1 epoch (loss 2.7243): 36%|ββββ | 456/1250 [11:13<22:01, 1.66s/it]
Training 1/1 epoch (loss 2.7251): 36%|ββββ | 456/1250 [11:15<22:01, 1.66s/it]
Training 1/1 epoch (loss 2.7251): 37%|ββββ | 457/1250 [11:15<22:47, 1.72s/it]
Training 1/1 epoch (loss 2.5736): 37%|ββββ | 457/1250 [11:17<22:47, 1.72s/it]
Training 1/1 epoch (loss 2.5736): 37%|ββββ | 458/1250 [11:17<21:59, 1.67s/it]
Training 1/1 epoch (loss 2.8168): 37%|ββββ | 458/1250 [11:18<21:59, 1.67s/it]
Training 1/1 epoch (loss 2.8168): 37%|ββββ | 459/1250 [11:18<19:01, 1.44s/it]
Training 1/1 epoch (loss 2.6727): 37%|ββββ | 459/1250 [11:19<19:01, 1.44s/it]
Training 1/1 epoch (loss 2.6727): 37%|ββββ | 460/1250 [11:19<17:49, 1.35s/it]
Training 1/1 epoch (loss 2.4602): 37%|ββββ | 460/1250 [11:20<17:49, 1.35s/it]
Training 1/1 epoch (loss 2.4602): 37%|ββββ | 461/1250 [11:20<17:24, 1.32s/it]
Training 1/1 epoch (loss 2.5269): 37%|ββββ | 461/1250 [11:21<17:24, 1.32s/it]
Training 1/1 epoch (loss 2.5269): 37%|ββββ | 462/1250 [11:21<16:17, 1.24s/it]
Training 1/1 epoch (loss 2.6007): 37%|ββββ | 462/1250 [11:23<16:17, 1.24s/it]
Training 1/1 epoch (loss 2.6007): 37%|ββββ | 463/1250 [11:23<16:50, 1.28s/it]
Training 1/1 epoch (loss 2.4682): 37%|ββββ | 463/1250 [11:24<16:50, 1.28s/it]
Training 1/1 epoch (loss 2.4682): 37%|ββββ | 464/1250 [11:24<16:47, 1.28s/it]
Training 1/1 epoch (loss 2.3068): 37%|ββββ | 464/1250 [11:26<16:47, 1.28s/it]
Training 1/1 epoch (loss 2.3068): 37%|ββββ | 465/1250 [11:26<20:47, 1.59s/it]
Training 1/1 epoch (loss 2.3991): 37%|ββββ | 465/1250 [11:27<20:47, 1.59s/it]
Training 1/1 epoch (loss 2.3991): 37%|ββββ | 466/1250 [11:27<17:23, 1.33s/it]
Training 1/1 epoch (loss 2.4429): 37%|ββββ | 466/1250 [11:29<17:23, 1.33s/it]
Training 1/1 epoch (loss 2.4429): 37%|ββββ | 467/1250 [11:29<19:38, 1.51s/it]
Training 1/1 epoch (loss 2.5941): 37%|ββββ | 467/1250 [11:31<19:38, 1.51s/it]
Training 1/1 epoch (loss 2.5941): 37%|ββββ | 468/1250 [11:31<20:27, 1.57s/it]
Training 1/1 epoch (loss 2.6405): 37%|ββββ | 468/1250 [11:31<20:27, 1.57s/it]
Training 1/1 epoch (loss 2.6405): 38%|ββββ | 469/1250 [11:31<16:24, 1.26s/it]
Training 1/1 epoch (loss 2.6718): 38%|ββββ | 469/1250 [11:32<16:24, 1.26s/it]
Training 1/1 epoch (loss 2.6718): 38%|ββββ | 470/1250 [11:32<16:37, 1.28s/it]
Training 1/1 epoch (loss 2.7111): 38%|ββββ | 470/1250 [11:34<16:37, 1.28s/it]
Training 1/1 epoch (loss 2.7111): 38%|ββββ | 471/1250 [11:34<18:31, 1.43s/it]
Training 1/1 epoch (loss 2.4984): 38%|ββββ | 471/1250 [11:35<18:31, 1.43s/it]
Training 1/1 epoch (loss 2.4984): 38%|ββββ | 472/1250 [11:35<17:03, 1.32s/it]
Training 1/1 epoch (loss 2.5212): 38%|ββββ | 472/1250 [11:37<17:03, 1.32s/it]
Training 1/1 epoch (loss 2.5212): 38%|ββββ | 473/1250 [11:37<18:44, 1.45s/it]
Training 1/1 epoch (loss 2.4705): 38%|ββββ | 473/1250 [11:38<18:44, 1.45s/it]
Training 1/1 epoch (loss 2.4705): 38%|ββββ | 474/1250 [11:38<17:27, 1.35s/it]
Training 1/1 epoch (loss 2.5481): 38%|ββββ | 474/1250 [11:39<17:27, 1.35s/it]
Training 1/1 epoch (loss 2.5481): 38%|ββββ | 475/1250 [11:39<14:45, 1.14s/it]
Training 1/1 epoch (loss 2.6578): 38%|ββββ | 475/1250 [11:40<14:45, 1.14s/it]
Training 1/1 epoch (loss 2.6578): 38%|ββββ | 476/1250 [11:40<16:50, 1.31s/it]
Training 1/1 epoch (loss 2.6011): 38%|ββββ | 476/1250 [11:42<16:50, 1.31s/it]
Training 1/1 epoch (loss 2.6011): 38%|ββββ | 477/1250 [11:42<18:12, 1.41s/it]
Training 1/1 epoch (loss 2.7581): 38%|ββββ | 477/1250 [11:44<18:12, 1.41s/it]
Training 1/1 epoch (loss 2.7581): 38%|ββββ | 478/1250 [11:44<18:19, 1.42s/it]
Training 1/1 epoch (loss 2.5347): 38%|ββββ | 478/1250 [11:45<18:19, 1.42s/it]
Training 1/1 epoch (loss 2.5347): 38%|ββββ | 479/1250 [11:45<19:52, 1.55s/it]
Training 1/1 epoch (loss 2.5335): 38%|ββββ | 479/1250 [11:47<19:52, 1.55s/it]
Training 1/1 epoch (loss 2.5335): 38%|ββββ | 480/1250 [11:47<18:30, 1.44s/it]
Training 1/1 epoch (loss 2.5061): 38%|ββββ | 480/1250 [11:49<18:30, 1.44s/it]
Training 1/1 epoch (loss 2.5061): 38%|ββββ | 481/1250 [11:49<21:55, 1.71s/it]
Training 1/1 epoch (loss 2.7551): 38%|ββββ | 481/1250 [11:50<21:55, 1.71s/it]
Training 1/1 epoch (loss 2.7551): 39%|ββββ | 482/1250 [11:50<20:13, 1.58s/it]
Training 1/1 epoch (loss 2.5300): 39%|ββββ | 482/1250 [11:51<20:13, 1.58s/it]
Training 1/1 epoch (loss 2.5300): 39%|ββββ | 483/1250 [11:51<16:05, 1.26s/it]
Training 1/1 epoch (loss 2.4110): 39%|ββββ | 483/1250 [11:52<16:05, 1.26s/it]
Training 1/1 epoch (loss 2.4110): 39%|ββββ | 484/1250 [11:52<16:22, 1.28s/it]
Training 1/1 epoch (loss 2.4364): 39%|ββββ | 484/1250 [11:54<16:22, 1.28s/it]
Training 1/1 epoch (loss 2.4364): 39%|ββββ | 485/1250 [11:54<17:19, 1.36s/it]
Training 1/1 epoch (loss 2.6351): 39%|ββββ | 485/1250 [11:54<17:19, 1.36s/it]
Training 1/1 epoch (loss 2.6351): 39%|ββββ | 486/1250 [11:54<14:00, 1.10s/it]
Training 1/1 epoch (loss 2.5930): 39%|ββββ | 486/1250 [11:57<14:00, 1.10s/it]
Training 1/1 epoch (loss 2.5930): 39%|ββββ | 487/1250 [11:57<19:08, 1.51s/it]
Training 1/1 epoch (loss 2.4453): 39%|ββββ | 487/1250 [11:59<19:08, 1.51s/it]
Training 1/1 epoch (loss 2.4453): 39%|ββββ | 488/1250 [11:59<21:03, 1.66s/it]
Training 1/1 epoch (loss 2.5649): 39%|ββββ | 488/1250 [11:59<21:03, 1.66s/it]
Training 1/1 epoch (loss 2.5649): 39%|ββββ | 489/1250 [11:59<17:31, 1.38s/it]
Training 1/1 epoch (loss 2.7979): 39%|ββββ | 489/1250 [12:01<17:31, 1.38s/it]
Training 1/1 epoch (loss 2.7979): 39%|ββββ | 490/1250 [12:01<17:21, 1.37s/it]
Training 1/1 epoch (loss 2.5990): 39%|ββββ | 490/1250 [12:02<17:21, 1.37s/it]
Training 1/1 epoch (loss 2.5990): 39%|ββββ | 491/1250 [12:02<17:20, 1.37s/it]
Training 1/1 epoch (loss 2.5440): 39%|ββββ | 491/1250 [12:03<17:20, 1.37s/it]
Training 1/1 epoch (loss 2.5440): 39%|ββββ | 492/1250 [12:03<16:58, 1.34s/it]
Training 1/1 epoch (loss 2.6045): 39%|ββββ | 492/1250 [12:05<16:58, 1.34s/it]
Training 1/1 epoch (loss 2.6045): 39%|ββββ | 493/1250 [12:05<19:01, 1.51s/it]
Training 1/1 epoch (loss 2.4121): 39%|ββββ | 493/1250 [12:06<19:01, 1.51s/it]
Training 1/1 epoch (loss 2.4121): 40%|ββββ | 494/1250 [12:06<15:22, 1.22s/it]
Training 1/1 epoch (loss 2.6961): 40%|ββββ | 494/1250 [12:07<15:22, 1.22s/it]
Training 1/1 epoch (loss 2.6961): 40%|ββββ | 495/1250 [12:07<15:47, 1.25s/it]
Training 1/1 epoch (loss 2.6368): 40%|ββββ | 495/1250 [12:09<15:47, 1.25s/it]
Training 1/1 epoch (loss 2.6368): 40%|ββββ | 496/1250 [12:09<16:37, 1.32s/it]
Training 1/1 epoch (loss 2.4938): 40%|ββββ | 496/1250 [12:09<16:37, 1.32s/it]
Training 1/1 epoch (loss 2.4938): 40%|ββββ | 497/1250 [12:09<14:33, 1.16s/it]
Training 1/1 epoch (loss 2.7109): 40%|ββββ | 497/1250 [12:12<14:33, 1.16s/it]
Training 1/1 epoch (loss 2.7109): 40%|ββββ | 498/1250 [12:12<19:18, 1.54s/it]
Training 1/1 epoch (loss 2.3223): 40%|ββββ | 498/1250 [12:14<19:18, 1.54s/it]
Training 1/1 epoch (loss 2.3223): 40%|ββββ | 499/1250 [12:14<22:01, 1.76s/it]
Training 1/1 epoch (loss 2.6849): 40%|ββββ | 499/1250 [12:15<22:01, 1.76s/it]
Training 1/1 epoch (loss 2.6849): 40%|ββββ | 500/1250 [12:15<17:11, 1.38s/it]
Training 1/1 epoch (loss 2.6097): 40%|ββββ | 500/1250 [12:16<17:11, 1.38s/it]
Training 1/1 epoch (loss 2.6097): 40%|ββββ | 501/1250 [12:16<17:37, 1.41s/it]
Training 1/1 epoch (loss 2.7321): 40%|ββββ | 501/1250 [12:18<17:37, 1.41s/it]
Training 1/1 epoch (loss 2.7321): 40%|ββββ | 502/1250 [12:18<20:05, 1.61s/it]
Training 1/1 epoch (loss 2.8476): 40%|ββββ | 502/1250 [12:19<20:05, 1.61s/it]
Training 1/1 epoch (loss 2.8476): 40%|ββββ | 503/1250 [12:19<15:57, 1.28s/it]
Training 1/1 epoch (loss 2.4316): 40%|ββββ | 503/1250 [12:20<15:57, 1.28s/it]
Training 1/1 epoch (loss 2.4316): 40%|ββββ | 504/1250 [12:20<17:52, 1.44s/it]
Training 1/1 epoch (loss 2.8073): 40%|ββββ | 504/1250 [12:23<17:52, 1.44s/it]
Training 1/1 epoch (loss 2.8073): 40%|ββββ | 505/1250 [12:23<21:30, 1.73s/it]
Training 1/1 epoch (loss 2.7498): 40%|ββββ | 505/1250 [12:23<21:30, 1.73s/it]
Training 1/1 epoch (loss 2.7498): 40%|ββββ | 506/1250 [12:23<16:25, 1.32s/it]
Training 1/1 epoch (loss 2.5748): 40%|ββββ | 506/1250 [12:26<16:25, 1.32s/it]
Training 1/1 epoch (loss 2.5748): 41%|ββββ | 507/1250 [12:26<20:02, 1.62s/it]
Training 1/1 epoch (loss 2.5914): 41%|ββββ | 507/1250 [12:27<20:02, 1.62s/it]
Training 1/1 epoch (loss 2.5914): 41%|ββββ | 508/1250 [12:27<20:13, 1.63s/it]
Training 1/1 epoch (loss 2.6236): 41%|ββββ | 508/1250 [12:28<20:13, 1.63s/it]
Training 1/1 epoch (loss 2.6236): 41%|ββββ | 509/1250 [12:28<16:52, 1.37s/it]
Training 1/1 epoch (loss 2.4864): 41%|ββββ | 509/1250 [12:30<16:52, 1.37s/it]
Training 1/1 epoch (loss 2.4864): 41%|ββββ | 510/1250 [12:30<19:41, 1.60s/it]
Training 1/1 epoch (loss 2.4993): 41%|ββββ | 510/1250 [12:32<19:41, 1.60s/it]
Training 1/1 epoch (loss 2.4993): 41%|ββββ | 511/1250 [12:32<19:10, 1.56s/it]
Training 1/1 epoch (loss 2.5413): 41%|ββββ | 511/1250 [12:33<19:10, 1.56s/it]
Training 1/1 epoch (loss 2.5413): 41%|ββββ | 512/1250 [12:33<18:40, 1.52s/it]
Training 1/1 epoch (loss 2.5662): 41%|ββββ | 512/1250 [12:34<18:40, 1.52s/it]
Training 1/1 epoch (loss 2.5662): 41%|ββββ | 513/1250 [12:34<17:06, 1.39s/it]
Training 1/1 epoch (loss 2.4906): 41%|ββββ | 513/1250 [12:36<17:06, 1.39s/it]
Training 1/1 epoch (loss 2.4906): 41%|ββββ | 514/1250 [12:36<17:35, 1.43s/it]
Training 1/1 epoch (loss 2.4394): 41%|ββββ | 514/1250 [12:37<17:35, 1.43s/it]
Training 1/1 epoch (loss 2.4394): 41%|ββββ | 515/1250 [12:37<15:49, 1.29s/it]
Training 1/1 epoch (loss 2.5632): 41%|ββββ | 515/1250 [12:39<15:49, 1.29s/it]
Training 1/1 epoch (loss 2.5632): 41%|βββββ | 516/1250 [12:39<18:21, 1.50s/it]
Training 1/1 epoch (loss 2.7066): 41%|βββββ | 516/1250 [12:39<18:21, 1.50s/it]
Training 1/1 epoch (loss 2.7066): 41%|βββββ | 517/1250 [12:39<16:15, 1.33s/it]
Training 1/1 epoch (loss 2.5641): 41%|βββββ | 517/1250 [12:41<16:15, 1.33s/it]
Training 1/1 epoch (loss 2.5641): 41%|βββββ | 518/1250 [12:41<18:01, 1.48s/it]
Training 1/1 epoch (loss 2.5284): 41%|βββββ | 518/1250 [12:43<18:01, 1.48s/it]
Training 1/1 epoch (loss 2.5284): 42%|βββββ | 519/1250 [12:43<19:10, 1.57s/it]
Training 1/1 epoch (loss 2.4196): 42%|βββββ | 519/1250 [12:44<19:10, 1.57s/it]
Training 1/1 epoch (loss 2.4196): 42%|βββββ | 520/1250 [12:44<17:22, 1.43s/it]
Training 1/1 epoch (loss 2.7214): 42%|βββββ | 520/1250 [12:47<17:22, 1.43s/it]
Training 1/1 epoch (loss 2.7214): 42%|βββββ | 521/1250 [12:47<20:45, 1.71s/it]
Training 1/1 epoch (loss 2.6504): 42%|βββββ | 521/1250 [12:49<20:45, 1.71s/it]
Training 1/1 epoch (loss 2.6504): 42%|βββββ | 522/1250 [12:49<22:06, 1.82s/it]
Training 1/1 epoch (loss 2.7730): 42%|βββββ | 522/1250 [12:49<22:06, 1.82s/it]
Training 1/1 epoch (loss 2.7730): 42%|βββββ | 523/1250 [12:49<17:37, 1.45s/it]
Training 1/1 epoch (loss 2.6191): 42%|βββββ | 523/1250 [12:50<17:37, 1.45s/it]
Training 1/1 epoch (loss 2.6191): 42%|βββββ | 524/1250 [12:50<15:57, 1.32s/it]
Training 1/1 epoch (loss 2.6424): 42%|βββββ | 524/1250 [12:52<15:57, 1.32s/it]
Training 1/1 epoch (loss 2.6424): 42%|βββββ | 525/1250 [12:52<17:39, 1.46s/it]
Training 1/1 epoch (loss 2.5818): 42%|βββββ | 525/1250 [12:53<17:39, 1.46s/it]
Training 1/1 epoch (loss 2.5818): 42%|βββββ | 526/1250 [12:53<14:52, 1.23s/it]
Training 1/1 epoch (loss 2.4771): 42%|βββββ | 526/1250 [12:54<14:52, 1.23s/it]
Training 1/1 epoch (loss 2.4771): 42%|βββββ | 527/1250 [12:54<16:11, 1.34s/it]
Training 1/1 epoch (loss 2.6574): 42%|βββββ | 527/1250 [12:56<16:11, 1.34s/it]
Training 1/1 epoch (loss 2.6574): 42%|βββββ | 528/1250 [12:56<19:10, 1.59s/it]
Training 1/1 epoch (loss 2.4600): 42%|βββββ | 528/1250 [12:57<19:10, 1.59s/it]
Training 1/1 epoch (loss 2.4600): 42%|βββββ | 529/1250 [12:57<16:15, 1.35s/it]
Training 1/1 epoch (loss 2.6247): 42%|βββββ | 529/1250 [12:59<16:15, 1.35s/it]
Training 1/1 epoch (loss 2.6247): 42%|βββββ | 530/1250 [12:59<15:55, 1.33s/it]
Training 1/1 epoch (loss 2.3705): 42%|βββββ | 530/1250 [13:00<15:55, 1.33s/it]
Training 1/1 epoch (loss 2.3705): 42%|βββββ | 531/1250 [13:00<16:37, 1.39s/it]
Training 1/1 epoch (loss 2.5586): 42%|βββββ | 531/1250 [13:01<16:37, 1.39s/it]
Training 1/1 epoch (loss 2.5586): 43%|βββββ | 532/1250 [13:01<16:03, 1.34s/it]
Training 1/1 epoch (loss 2.6245): 43%|βββββ | 532/1250 [13:04<16:03, 1.34s/it]
Training 1/1 epoch (loss 2.6245): 43%|βββββ | 533/1250 [13:04<19:27, 1.63s/it]
Training 1/1 epoch (loss 2.7376): 43%|βββββ | 533/1250 [13:04<19:27, 1.63s/it]
Training 1/1 epoch (loss 2.7376): 43%|βββββ | 534/1250 [13:04<16:09, 1.35s/it]
Training 1/1 epoch (loss 2.8108): 43%|βββββ | 534/1250 [13:06<16:09, 1.35s/it]
Training 1/1 epoch (loss 2.8108): 43%|βββββ | 535/1250 [13:06<15:55, 1.34s/it]
Training 1/1 epoch (loss 2.7653): 43%|βββββ | 535/1250 [13:07<15:55, 1.34s/it]
Training 1/1 epoch (loss 2.7653): 43%|βββββ | 536/1250 [13:07<16:28, 1.38s/it]
Training 1/1 epoch (loss 2.8272): 43%|βββββ | 536/1250 [13:08<16:28, 1.38s/it]
Training 1/1 epoch (loss 2.8272): 43%|βββββ | 537/1250 [13:08<13:24, 1.13s/it]
Training 1/1 epoch (loss 2.4811): 43%|βββββ | 537/1250 [13:09<13:24, 1.13s/it]
Training 1/1 epoch (loss 2.4811): 43%|βββββ | 538/1250 [13:09<14:18, 1.21s/it]
Training 1/1 epoch (loss 2.6670): 43%|βββββ | 538/1250 [13:10<14:18, 1.21s/it]
Training 1/1 epoch (loss 2.6670): 43%|βββββ | 539/1250 [13:10<15:04, 1.27s/it]
Training 1/1 epoch (loss 2.7530): 43%|βββββ | 539/1250 [13:11<15:04, 1.27s/it]
Training 1/1 epoch (loss 2.7530): 43%|βββββ | 540/1250 [13:11<12:05, 1.02s/it]
Training 1/1 epoch (loss 2.6119): 43%|βββββ | 540/1250 [13:13<12:05, 1.02s/it]
Training 1/1 epoch (loss 2.6119): 43%|βββββ | 541/1250 [13:13<15:18, 1.29s/it]
Training 1/1 epoch (loss 2.6061): 43%|βββββ | 541/1250 [13:14<15:18, 1.29s/it]
Training 1/1 epoch (loss 2.6061): 43%|βββββ | 542/1250 [13:14<15:52, 1.35s/it]
Training 1/1 epoch (loss 2.4441): 43%|βββββ | 542/1250 [13:16<15:52, 1.35s/it]
Training 1/1 epoch (loss 2.4441): 43%|βββββ | 543/1250 [13:16<15:26, 1.31s/it]
Training 1/1 epoch (loss 2.5208): 43%|βββββ | 543/1250 [13:17<15:26, 1.31s/it]
Training 1/1 epoch (loss 2.5208): 44%|βββββ | 544/1250 [13:17<16:22, 1.39s/it]
Training 1/1 epoch (loss 2.5969): 44%|βββββ | 544/1250 [13:19<16:22, 1.39s/it]
Training 1/1 epoch (loss 2.5969): 44%|βββββ | 545/1250 [13:19<18:43, 1.59s/it]
Training 1/1 epoch (loss 2.6844): 44%|βββββ | 545/1250 [13:20<18:43, 1.59s/it]
Training 1/1 epoch (loss 2.6844): 44%|βββββ | 546/1250 [13:20<15:22, 1.31s/it]
Training 1/1 epoch (loss 2.6625): 44%|βββββ | 546/1250 [13:22<15:22, 1.31s/it]
Training 1/1 epoch (loss 2.6625): 44%|βββββ | 547/1250 [13:22<17:22, 1.48s/it]
Training 1/1 epoch (loss 2.7248): 44%|βββββ | 547/1250 [13:24<17:22, 1.48s/it]
Training 1/1 epoch (loss 2.7248): 44%|βββββ | 548/1250 [13:24<20:34, 1.76s/it]
Training 1/1 epoch (loss 2.8320): 44%|βββββ | 548/1250 [13:25<20:34, 1.76s/it]
Training 1/1 epoch (loss 2.8320): 44%|βββββ | 549/1250 [13:25<15:59, 1.37s/it]
Training 1/1 epoch (loss 2.6510): 44%|βββββ | 549/1250 [13:26<15:59, 1.37s/it]
Training 1/1 epoch (loss 2.6510): 44%|βββββ | 550/1250 [13:26<16:59, 1.46s/it]
Training 1/1 epoch (loss 2.5457): 44%|βββββ | 550/1250 [13:28<16:59, 1.46s/it]
Training 1/1 epoch (loss 2.5457): 44%|βββββ | 551/1250 [13:28<18:41, 1.60s/it]
Training 1/1 epoch (loss 2.4311): 44%|βββββ | 551/1250 [13:29<18:41, 1.60s/it]
Training 1/1 epoch (loss 2.4311): 44%|βββββ | 552/1250 [13:29<15:59, 1.37s/it]
Training 1/1 epoch (loss 2.7267): 44%|βββββ | 552/1250 [13:31<15:59, 1.37s/it]
Training 1/1 epoch (loss 2.7267): 44%|βββββ | 553/1250 [13:31<16:25, 1.41s/it]
Training 1/1 epoch (loss 2.5724): 44%|βββββ | 553/1250 [13:33<16:25, 1.41s/it]
Training 1/1 epoch (loss 2.5724): 44%|βββββ | 554/1250 [13:33<18:25, 1.59s/it]
Training 1/1 epoch (loss 2.3507): 44%|βββββ | 554/1250 [13:33<18:25, 1.59s/it]
Training 1/1 epoch (loss 2.3507): 44%|βββββ | 555/1250 [13:33<14:58, 1.29s/it]
Training 1/1 epoch (loss 2.5844): 44%|βββββ | 555/1250 [13:35<14:58, 1.29s/it]
Training 1/1 epoch (loss 2.5844): 44%|βββββ | 556/1250 [13:35<15:27, 1.34s/it]
Training 1/1 epoch (loss 2.7556): 44%|βββββ | 556/1250 [13:36<15:27, 1.34s/it]
Training 1/1 epoch (loss 2.7556): 45%|βββββ | 557/1250 [13:36<16:16, 1.41s/it]
Training 1/1 epoch (loss 2.6244): 45%|βββββ | 557/1250 [13:37<16:16, 1.41s/it]
Training 1/1 epoch (loss 2.6244): 45%|βββββ | 558/1250 [13:37<13:49, 1.20s/it]
Training 1/1 epoch (loss 2.6870): 45%|βββββ | 558/1250 [13:39<13:49, 1.20s/it]
Training 1/1 epoch (loss 2.6870): 45%|βββββ | 559/1250 [13:39<18:08, 1.58s/it]
Training 1/1 epoch (loss 2.7056): 45%|βββββ | 559/1250 [13:41<18:08, 1.58s/it]
Training 1/1 epoch (loss 2.7056): 45%|βββββ | 560/1250 [13:41<18:39, 1.62s/it]
Training 1/1 epoch (loss 2.4816): 45%|βββββ | 560/1250 [13:42<18:39, 1.62s/it]
Training 1/1 epoch (loss 2.4816): 45%|βββββ | 561/1250 [13:42<16:01, 1.40s/it]
Training 1/1 epoch (loss 2.6817): 45%|βββββ | 561/1250 [13:44<16:01, 1.40s/it]
Training 1/1 epoch (loss 2.6817): 45%|βββββ | 562/1250 [13:44<17:11, 1.50s/it]
Training 1/1 epoch (loss 2.7023): 45%|βββββ | 562/1250 [13:44<17:11, 1.50s/it]
Training 1/1 epoch (loss 2.7023): 45%|βββββ | 563/1250 [13:44<14:20, 1.25s/it]
Training 1/1 epoch (loss 2.7654): 45%|βββββ | 563/1250 [13:46<14:20, 1.25s/it]
Training 1/1 epoch (loss 2.7654): 45%|βββββ | 564/1250 [13:46<14:07, 1.24s/it]
Training 1/1 epoch (loss 2.6745): 45%|βββββ | 564/1250 [13:48<14:07, 1.24s/it]
Training 1/1 epoch (loss 2.6745): 45%|βββββ | 565/1250 [13:48<16:58, 1.49s/it]
Training 1/1 epoch (loss 2.6957): 45%|βββββ | 565/1250 [13:49<16:58, 1.49s/it]
Training 1/1 epoch (loss 2.6957): 45%|βββββ | 566/1250 [13:49<16:38, 1.46s/it]
Training 1/1 epoch (loss 2.6633): 45%|βββββ | 566/1250 [13:50<16:38, 1.46s/it]
Training 1/1 epoch (loss 2.6633): 45%|βββββ | 567/1250 [13:50<14:31, 1.28s/it]
Training 1/1 epoch (loss 2.5755): 45%|βββββ | 567/1250 [13:52<14:31, 1.28s/it]
Training 1/1 epoch (loss 2.5755): 45%|βββββ | 568/1250 [13:52<18:58, 1.67s/it]
Training 1/1 epoch (loss 2.7143): 45%|βββββ | 568/1250 [13:53<18:58, 1.67s/it]
Training 1/1 epoch (loss 2.7143): 46%|βββββ | 569/1250 [13:53<16:23, 1.44s/it]
Training 1/1 epoch (loss 2.5625): 46%|βββββ | 569/1250 [13:55<16:23, 1.44s/it]
Training 1/1 epoch (loss 2.5625): 46%|βββββ | 570/1250 [13:55<16:04, 1.42s/it]
Training 1/1 epoch (loss 2.6036): 46%|βββββ | 570/1250 [13:57<16:04, 1.42s/it]
Training 1/1 epoch (loss 2.6036): 46%|βββββ | 571/1250 [13:57<17:39, 1.56s/it]
Training 1/1 epoch (loss 2.7195): 46%|βββββ | 571/1250 [13:57<17:39, 1.56s/it]
Training 1/1 epoch (loss 2.7195): 46%|βββββ | 572/1250 [13:57<13:49, 1.22s/it]
Training 1/1 epoch (loss 2.6236): 46%|βββββ | 572/1250 [13:58<13:49, 1.22s/it]
Training 1/1 epoch (loss 2.6236): 46%|βββββ | 573/1250 [13:58<14:08, 1.25s/it]
Training 1/1 epoch (loss 2.6549): 46%|βββββ | 573/1250 [14:00<14:08, 1.25s/it]
Training 1/1 epoch (loss 2.6549): 46%|βββββ | 574/1250 [14:00<15:06, 1.34s/it]
Training 1/1 epoch (loss 2.7592): 46%|βββββ | 574/1250 [14:01<15:06, 1.34s/it]
Training 1/1 epoch (loss 2.7592): 46%|βββββ | 575/1250 [14:01<12:47, 1.14s/it]
Training 1/1 epoch (loss 2.5464): 46%|βββββ | 575/1250 [14:02<12:47, 1.14s/it]
Training 1/1 epoch (loss 2.5464): 46%|βββββ | 576/1250 [14:02<14:13, 1.27s/it]
Training 1/1 epoch (loss 2.5353): 46%|βββββ | 576/1250 [14:04<14:13, 1.27s/it]
Training 1/1 epoch (loss 2.5353): 46%|βββββ | 577/1250 [14:04<15:34, 1.39s/it]
Training 1/1 epoch (loss 2.7078): 46%|βββββ | 577/1250 [14:05<15:34, 1.39s/it]
Training 1/1 epoch (loss 2.7078): 46%|βββββ | 578/1250 [14:05<13:34, 1.21s/it]
Training 1/1 epoch (loss 2.8384): 46%|βββββ | 578/1250 [14:06<13:34, 1.21s/it]
Training 1/1 epoch (loss 2.8384): 46%|βββββ | 579/1250 [14:06<15:37, 1.40s/it]
Training 1/1 epoch (loss 2.7911): 46%|βββββ | 579/1250 [14:07<15:37, 1.40s/it]
Training 1/1 epoch (loss 2.7911): 46%|βββββ | 580/1250 [14:07<13:31, 1.21s/it]
Training 1/1 epoch (loss 2.5906): 46%|βββββ | 580/1250 [14:09<13:31, 1.21s/it]
Training 1/1 epoch (loss 2.5906): 46%|βββββ | 581/1250 [14:09<15:29, 1.39s/it]
Training 1/1 epoch (loss 2.5244): 46%|βββββ | 581/1250 [14:11<15:29, 1.39s/it]
Training 1/1 epoch (loss 2.5244): 47%|βββββ | 582/1250 [14:11<16:55, 1.52s/it]
Training 1/1 epoch (loss 2.4629): 47%|βββββ | 582/1250 [14:12<16:55, 1.52s/it]
Training 1/1 epoch (loss 2.4629): 47%|βββββ | 583/1250 [14:12<14:32, 1.31s/it]
Training 1/1 epoch (loss 2.5469): 47%|βββββ | 583/1250 [14:14<14:32, 1.31s/it]
Training 1/1 epoch (loss 2.5469): 47%|βββββ | 584/1250 [14:14<16:38, 1.50s/it]
Training 1/1 epoch (loss 2.6817): 47%|βββββ | 584/1250 [14:16<16:38, 1.50s/it]
Training 1/1 epoch (loss 2.6817): 47%|βββββ | 585/1250 [14:16<18:01, 1.63s/it]
Training 1/1 epoch (loss 2.5131): 47%|βββββ | 585/1250 [14:16<18:01, 1.63s/it]
Training 1/1 epoch (loss 2.5131): 47%|βββββ | 586/1250 [14:16<14:19, 1.29s/it]
Training 1/1 epoch (loss 2.5664): 47%|βββββ | 586/1250 [14:17<14:19, 1.29s/it]
Training 1/1 epoch (loss 2.5664): 47%|βββββ | 587/1250 [14:17<14:46, 1.34s/it]
Training 1/1 epoch (loss 2.3229): 47%|βββββ | 587/1250 [14:20<14:46, 1.34s/it]
Training 1/1 epoch (loss 2.3229): 47%|βββββ | 588/1250 [14:20<18:31, 1.68s/it]
Training 1/1 epoch (loss 2.4710): 47%|βββββ | 588/1250 [14:21<18:31, 1.68s/it]
Training 1/1 epoch (loss 2.4710): 47%|βββββ | 589/1250 [14:21<15:26, 1.40s/it]
Training 1/1 epoch (loss 2.6074): 47%|βββββ | 589/1250 [14:23<15:26, 1.40s/it]
Training 1/1 epoch (loss 2.6074): 47%|βββββ | 590/1250 [14:23<17:25, 1.58s/it]
Training 1/1 epoch (loss 2.4835): 47%|βββββ | 590/1250 [14:24<17:25, 1.58s/it]
Training 1/1 epoch (loss 2.4835): 47%|βββββ | 591/1250 [14:24<15:38, 1.42s/it]
Training 1/1 epoch (loss 2.6145): 47%|βββββ | 591/1250 [14:26<15:38, 1.42s/it]
Training 1/1 epoch (loss 2.6145): 47%|βββββ | 592/1250 [14:26<16:44, 1.53s/it]
Training 1/1 epoch (loss 2.7174): 47%|βββββ | 592/1250 [14:27<16:44, 1.53s/it]
Training 1/1 epoch (loss 2.7174): 47%|βββββ | 593/1250 [14:27<16:50, 1.54s/it]
Training 1/1 epoch (loss 2.5328): 47%|βββββ | 593/1250 [14:28<16:50, 1.54s/it]
Training 1/1 epoch (loss 2.5328): 48%|βββββ | 594/1250 [14:28<13:46, 1.26s/it]
Training 1/1 epoch (loss 2.4997): 48%|βββββ | 594/1250 [14:30<13:46, 1.26s/it]
Training 1/1 epoch (loss 2.4997): 48%|βββββ | 595/1250 [14:30<16:36, 1.52s/it]
Training 1/1 epoch (loss 2.5500): 48%|βββββ | 595/1250 [14:32<16:36, 1.52s/it]
Training 1/1 epoch (loss 2.5500): 48%|βββββ | 596/1250 [14:32<18:08, 1.66s/it]
Training 1/1 epoch (loss 2.6454): 48%|βββββ | 596/1250 [14:33<18:08, 1.66s/it]
Training 1/1 epoch (loss 2.6454): 48%|βββββ | 597/1250 [14:33<15:00, 1.38s/it]
Training 1/1 epoch (loss 2.5811): 48%|βββββ | 597/1250 [14:33<15:00, 1.38s/it]
Training 1/1 epoch (loss 2.5811): 48%|βββββ | 598/1250 [14:33<13:22, 1.23s/it]
Training 1/1 epoch (loss 2.6443): 48%|βββββ | 598/1250 [14:36<13:22, 1.23s/it]
Training 1/1 epoch (loss 2.6443): 48%|βββββ | 599/1250 [14:36<16:19, 1.50s/it]
Training 1/1 epoch (loss 2.6103): 48%|βββββ | 599/1250 [14:36<16:19, 1.50s/it]
Training 1/1 epoch (loss 2.6103): 48%|βββββ | 600/1250 [14:36<13:29, 1.24s/it]
Training 1/1 epoch (loss 2.7919): 48%|βββββ | 600/1250 [14:38<13:29, 1.24s/it]
Training 1/1 epoch (loss 2.7919): 48%|βββββ | 601/1250 [14:38<15:58, 1.48s/it]
Training 1/1 epoch (loss 2.5110): 48%|βββββ | 601/1250 [14:40<15:58, 1.48s/it]
Training 1/1 epoch (loss 2.5110): 48%|βββββ | 602/1250 [14:40<16:02, 1.49s/it]
Training 1/1 epoch (loss 2.4821): 48%|βββββ | 602/1250 [14:41<16:02, 1.49s/it]
Training 1/1 epoch (loss 2.4821): 48%|βββββ | 603/1250 [14:41<16:05, 1.49s/it]
Training 1/1 epoch (loss 2.6708): 48%|βββββ | 603/1250 [14:43<16:05, 1.49s/it]
Training 1/1 epoch (loss 2.6708): 48%|βββββ | 604/1250 [14:43<16:16, 1.51s/it]
Training 1/1 epoch (loss 2.5681): 48%|βββββ | 604/1250 [14:44<16:16, 1.51s/it]
Training 1/1 epoch (loss 2.5681): 48%|βββββ | 605/1250 [14:44<15:26, 1.44s/it]
Training 1/1 epoch (loss 2.7022): 48%|βββββ | 605/1250 [14:45<15:26, 1.44s/it]
Training 1/1 epoch (loss 2.7022): 48%|βββββ | 606/1250 [14:45<15:13, 1.42s/it]
Training 1/1 epoch (loss 2.4906): 48%|βββββ | 606/1250 [14:47<15:13, 1.42s/it]
Training 1/1 epoch (loss 2.4906): 49%|βββββ | 607/1250 [14:47<16:46, 1.57s/it]
Training 1/1 epoch (loss 2.8043): 49%|βββββ | 607/1250 [14:49<16:46, 1.57s/it]
Training 1/1 epoch (loss 2.8043): 49%|βββββ | 608/1250 [14:49<15:40, 1.47s/it]
Training 1/1 epoch (loss 2.5385): 49%|βββββ | 608/1250 [14:50<15:40, 1.47s/it]
Training 1/1 epoch (loss 2.5385): 49%|βββββ | 609/1250 [14:50<16:53, 1.58s/it]
Training 1/1 epoch (loss 2.6402): 49%|βββββ | 609/1250 [14:52<16:53, 1.58s/it]
Training 1/1 epoch (loss 2.6402): 49%|βββββ | 610/1250 [14:52<16:44, 1.57s/it]
Training 1/1 epoch (loss 2.6810): 49%|βββββ | 610/1250 [14:53<16:44, 1.57s/it]
Training 1/1 epoch (loss 2.6810): 49%|βββββ | 611/1250 [14:53<13:49, 1.30s/it]
Training 1/1 epoch (loss 2.4853): 49%|βββββ | 611/1250 [14:54<13:49, 1.30s/it]
Training 1/1 epoch (loss 2.4853): 49%|βββββ | 612/1250 [14:54<14:57, 1.41s/it]
Training 1/1 epoch (loss 2.7313): 49%|βββββ | 612/1250 [14:56<14:57, 1.41s/it]
Training 1/1 epoch (loss 2.7313): 49%|βββββ | 613/1250 [14:56<14:29, 1.36s/it]
Training 1/1 epoch (loss 2.7618): 49%|βββββ | 613/1250 [14:56<14:29, 1.36s/it]
Training 1/1 epoch (loss 2.7618): 49%|βββββ | 614/1250 [14:56<11:50, 1.12s/it]
Training 1/1 epoch (loss 2.5682): 49%|βββββ | 614/1250 [14:58<11:50, 1.12s/it]
Training 1/1 epoch (loss 2.5682): 49%|βββββ | 615/1250 [14:58<13:59, 1.32s/it]
Training 1/1 epoch (loss 2.6902): 49%|βββββ | 615/1250 [15:00<13:59, 1.32s/it]
Training 1/1 epoch (loss 2.6902): 49%|βββββ | 616/1250 [15:00<16:49, 1.59s/it]
Training 1/1 epoch (loss 2.2787): 49%|βββββ | 616/1250 [15:01<16:49, 1.59s/it]
Training 1/1 epoch (loss 2.2787): 49%|βββββ | 617/1250 [15:01<14:08, 1.34s/it]
Training 1/1 epoch (loss 2.5914): 49%|βββββ | 617/1250 [15:03<14:08, 1.34s/it]
Training 1/1 epoch (loss 2.5914): 49%|βββββ | 618/1250 [15:03<15:03, 1.43s/it]
Training 1/1 epoch (loss 2.4870): 49%|βββββ | 618/1250 [15:04<15:03, 1.43s/it]
Training 1/1 epoch (loss 2.4870): 50%|βββββ | 619/1250 [15:04<14:01, 1.33s/it]
Training 1/1 epoch (loss 2.5183): 50%|βββββ | 619/1250 [15:05<14:01, 1.33s/it]
Training 1/1 epoch (loss 2.5183): 50%|βββββ | 620/1250 [15:05<14:42, 1.40s/it]
Training 1/1 epoch (loss 2.4669): 50%|βββββ | 620/1250 [15:08<14:42, 1.40s/it]
Training 1/1 epoch (loss 2.4669): 50%|βββββ | 621/1250 [15:08<17:54, 1.71s/it]
Training 1/1 epoch (loss 2.7508): 50%|βββββ | 621/1250 [15:09<17:54, 1.71s/it]
Training 1/1 epoch (loss 2.7508): 50%|βββββ | 622/1250 [15:09<15:27, 1.48s/it]
Training 1/1 epoch (loss 2.6511): 50%|βββββ | 622/1250 [15:11<15:27, 1.48s/it]
Training 1/1 epoch (loss 2.6511): 50%|βββββ | 623/1250 [15:11<18:34, 1.78s/it]
Training 1/1 epoch (loss 2.5640): 50%|βββββ | 623/1250 [15:14<18:34, 1.78s/it]
Training 1/1 epoch (loss 2.5640): 50%|βββββ | 624/1250 [15:14<21:14, 2.04s/it]
Training 1/1 epoch (loss 2.6227): 50%|βββββ | 624/1250 [15:15<21:14, 2.04s/it]
Training 1/1 epoch (loss 2.6227): 50%|βββββ | 625/1250 [15:15<18:11, 1.75s/it]
Training 1/1 epoch (loss 2.6607): 50%|βββββ | 625/1250 [15:16<18:11, 1.75s/it]
Training 1/1 epoch (loss 2.6607): 50%|βββββ | 626/1250 [15:16<17:53, 1.72s/it]
Training 1/1 epoch (loss 2.3666): 50%|βββββ | 626/1250 [15:18<17:53, 1.72s/it]
Training 1/1 epoch (loss 2.3666): 50%|βββββ | 627/1250 [15:18<17:08, 1.65s/it]
Training 1/1 epoch (loss 2.5780): 50%|βββββ | 627/1250 [15:20<17:08, 1.65s/it]
Training 1/1 epoch (loss 2.5780): 50%|βββββ | 628/1250 [15:20<17:41, 1.71s/it]
Training 1/1 epoch (loss 2.7759): 50%|βββββ | 628/1250 [15:21<17:41, 1.71s/it]
Training 1/1 epoch (loss 2.7759): 50%|βββββ | 629/1250 [15:21<16:50, 1.63s/it]
Training 1/1 epoch (loss 2.3194): 50%|βββββ | 629/1250 [15:22<16:50, 1.63s/it]
Training 1/1 epoch (loss 2.3194): 50%|βββββ | 630/1250 [15:22<13:33, 1.31s/it]
Training 1/1 epoch (loss 2.6304): 50%|βββββ | 630/1250 [15:23<13:33, 1.31s/it]
Training 1/1 epoch (loss 2.6304): 50%|βββββ | 631/1250 [15:23<13:50, 1.34s/it]
Training 1/1 epoch (loss 2.7322): 50%|βββββ | 631/1250 [15:25<13:50, 1.34s/it]
Training 1/1 epoch (loss 2.7322): 51%|βββββ | 632/1250 [15:25<15:19, 1.49s/it]
Training 1/1 epoch (loss 2.6277): 51%|βββββ | 632/1250 [15:26<15:19, 1.49s/it]
Training 1/1 epoch (loss 2.6277): 51%|βββββ | 633/1250 [15:26<13:24, 1.30s/it]
Training 1/1 epoch (loss 2.5903): 51%|βββββ | 633/1250 [15:27<13:24, 1.30s/it]
Training 1/1 epoch (loss 2.5903): 51%|βββββ | 634/1250 [15:27<13:54, 1.35s/it]
Training 1/1 epoch (loss 2.4531): 51%|βββββ | 634/1250 [15:30<13:54, 1.35s/it]
Training 1/1 epoch (loss 2.4531): 51%|βββββ | 635/1250 [15:30<17:13, 1.68s/it]
Training 1/1 epoch (loss 2.4080): 51%|βββββ | 635/1250 [15:30<17:13, 1.68s/it]
Training 1/1 epoch (loss 2.4080): 51%|βββββ | 636/1250 [15:30<14:05, 1.38s/it]
Training 1/1 epoch (loss 2.4797): 51%|βββββ | 636/1250 [15:33<14:05, 1.38s/it]
Training 1/1 epoch (loss 2.4797): 51%|βββββ | 637/1250 [15:33<17:07, 1.68s/it]
Training 1/1 epoch (loss 2.4366): 51%|βββββ | 637/1250 [15:35<17:07, 1.68s/it]
Training 1/1 epoch (loss 2.4366): 51%|βββββ | 638/1250 [15:35<17:49, 1.75s/it]
Training 1/1 epoch (loss 2.7567): 51%|βββββ | 638/1250 [15:36<17:49, 1.75s/it]
Training 1/1 epoch (loss 2.7567): 51%|βββββ | 639/1250 [15:36<15:37, 1.53s/it]
Training 1/1 epoch (loss 2.5729): 51%|βββββ | 639/1250 [15:39<15:37, 1.53s/it]
Training 1/1 epoch (loss 2.5729): 51%|βββββ | 640/1250 [15:39<19:22, 1.91s/it]
Training 1/1 epoch (loss 2.5568): 51%|βββββ | 640/1250 [15:40<19:22, 1.91s/it]
Training 1/1 epoch (loss 2.5568): 51%|ββββββ | 641/1250 [15:40<18:00, 1.77s/it]
Training 1/1 epoch (loss 2.5435): 51%|ββββββ | 641/1250 [15:42<18:00, 1.77s/it]
Training 1/1 epoch (loss 2.5435): 51%|ββββββ | 642/1250 [15:42<18:59, 1.87s/it]
Training 1/1 epoch (loss 2.6269): 51%|ββββββ | 642/1250 [15:44<18:59, 1.87s/it]
Training 1/1 epoch (loss 2.6269): 51%|ββββββ | 643/1250 [15:44<18:21, 1.81s/it]
Training 1/1 epoch (loss 2.5830): 51%|ββββββ | 643/1250 [15:45<18:21, 1.81s/it]
Training 1/1 epoch (loss 2.5830): 52%|ββββββ | 644/1250 [15:45<15:06, 1.50s/it]
Training 1/1 epoch (loss 2.4615): 52%|ββββββ | 644/1250 [15:46<15:06, 1.50s/it]
Training 1/1 epoch (loss 2.4615): 52%|ββββββ | 645/1250 [15:46<15:23, 1.53s/it]
Training 1/1 epoch (loss 2.1903): 52%|ββββββ | 645/1250 [15:47<15:23, 1.53s/it]
Training 1/1 epoch (loss 2.1903): 52%|ββββββ | 646/1250 [15:47<13:50, 1.37s/it]
Training 1/1 epoch (loss 2.5423): 52%|ββββββ | 646/1250 [15:48<13:50, 1.37s/it]
Training 1/1 epoch (loss 2.5423): 52%|ββββββ | 647/1250 [15:48<11:24, 1.14s/it]
Training 1/1 epoch (loss 2.7382): 52%|ββββββ | 647/1250 [15:50<11:24, 1.14s/it]
Training 1/1 epoch (loss 2.7382): 52%|ββββββ | 648/1250 [15:50<15:37, 1.56s/it]
Training 1/1 epoch (loss 2.4365): 52%|ββββββ | 648/1250 [15:52<15:37, 1.56s/it]
Training 1/1 epoch (loss 2.4365): 52%|ββββββ | 649/1250 [15:52<16:08, 1.61s/it]
Training 1/1 epoch (loss 2.7005): 52%|ββββββ | 649/1250 [15:53<16:08, 1.61s/it]
Training 1/1 epoch (loss 2.7005): 52%|ββββββ | 650/1250 [15:53<15:31, 1.55s/it]
Training 1/1 epoch (loss 2.6318): 52%|ββββββ | 650/1250 [15:54<15:31, 1.55s/it]
Training 1/1 epoch (loss 2.6318): 52%|ββββββ | 651/1250 [15:54<14:01, 1.40s/it]
Training 1/1 epoch (loss 2.4586): 52%|ββββββ | 651/1250 [15:56<14:01, 1.40s/it]
Training 1/1 epoch (loss 2.4586): 52%|ββββββ | 652/1250 [15:56<14:33, 1.46s/it]
Training 1/1 epoch (loss 2.4828): 52%|ββββββ | 652/1250 [15:58<14:33, 1.46s/it]
Training 1/1 epoch (loss 2.4828): 52%|ββββββ | 653/1250 [15:58<14:35, 1.47s/it]
Training 1/1 epoch (loss 2.4164): 52%|ββββββ | 653/1250 [15:59<14:35, 1.47s/it]
Training 1/1 epoch (loss 2.4164): 52%|ββββββ | 654/1250 [15:59<14:56, 1.50s/it]
Training 1/1 epoch (loss 2.5949): 52%|ββββββ | 654/1250 [16:01<14:56, 1.50s/it]
Training 1/1 epoch (loss 2.5949): 52%|ββββββ | 655/1250 [16:01<14:37, 1.47s/it]
Training 1/1 epoch (loss 2.7660): 52%|ββββββ | 655/1250 [16:02<14:37, 1.47s/it]
Training 1/1 epoch (loss 2.7660): 52%|ββββββ | 656/1250 [16:02<13:02, 1.32s/it]
Training 1/1 epoch (loss 2.7274): 52%|ββββββ | 656/1250 [16:03<13:02, 1.32s/it]
Training 1/1 epoch (loss 2.7274): 53%|ββββββ | 657/1250 [16:03<13:57, 1.41s/it]
Training 1/1 epoch (loss 2.7435): 53%|ββββββ | 657/1250 [16:04<13:57, 1.41s/it]
Training 1/1 epoch (loss 2.7435): 53%|ββββββ | 658/1250 [16:04<11:36, 1.18s/it]
Training 1/1 epoch (loss 2.4963): 53%|ββββββ | 658/1250 [16:06<11:36, 1.18s/it]
Training 1/1 epoch (loss 2.4963): 53%|ββββββ | 659/1250 [16:06<15:39, 1.59s/it]
Training 1/1 epoch (loss 2.2754): 53%|ββββββ | 659/1250 [16:09<15:39, 1.59s/it]
Training 1/1 epoch (loss 2.2754): 53%|ββββββ | 660/1250 [16:09<18:05, 1.84s/it]
Training 1/1 epoch (loss 2.4492): 53%|ββββββ | 660/1250 [16:09<18:05, 1.84s/it]
Training 1/1 epoch (loss 2.4492): 53%|ββββββ | 661/1250 [16:09<14:02, 1.43s/it]
Training 1/1 epoch (loss 2.5118): 53%|ββββββ | 661/1250 [16:12<14:02, 1.43s/it]
Training 1/1 epoch (loss 2.5118): 53%|ββββββ | 662/1250 [16:12<16:55, 1.73s/it]
Training 1/1 epoch (loss 2.6008): 53%|ββββββ | 662/1250 [16:13<16:55, 1.73s/it]
Training 1/1 epoch (loss 2.6008): 53%|ββββββ | 663/1250 [16:13<16:42, 1.71s/it]
Training 1/1 epoch (loss 2.6699): 53%|ββββββ | 663/1250 [16:14<16:42, 1.71s/it]
Training 1/1 epoch (loss 2.6699): 53%|ββββββ | 664/1250 [16:14<13:32, 1.39s/it]
Training 1/1 epoch (loss 2.6048): 53%|ββββββ | 664/1250 [16:16<13:32, 1.39s/it]
Training 1/1 epoch (loss 2.6048): 53%|ββββββ | 665/1250 [16:16<14:22, 1.47s/it]
Training 1/1 epoch (loss 2.5760): 53%|ββββββ | 665/1250 [16:18<14:22, 1.47s/it]
Training 1/1 epoch (loss 2.5760): 53%|ββββββ | 666/1250 [16:18<15:55, 1.64s/it]
Training 1/1 epoch (loss 2.7904): 53%|ββββββ | 666/1250 [16:18<15:55, 1.64s/it]
Training 1/1 epoch (loss 2.7904): 53%|ββββββ | 667/1250 [16:18<12:42, 1.31s/it]
Training 1/1 epoch (loss 2.6637): 53%|ββββββ | 667/1250 [16:21<12:42, 1.31s/it]
Training 1/1 epoch (loss 2.6637): 53%|ββββββ | 668/1250 [16:21<16:14, 1.67s/it]
Training 1/1 epoch (loss 2.6271): 53%|ββββββ | 668/1250 [16:22<16:14, 1.67s/it]
Training 1/1 epoch (loss 2.6271): 54%|ββββββ | 669/1250 [16:22<15:02, 1.55s/it]
Training 1/1 epoch (loss 2.8648): 54%|ββββββ | 669/1250 [16:23<15:02, 1.55s/it]
Training 1/1 epoch (loss 2.8648): 54%|ββββββ | 670/1250 [16:23<12:32, 1.30s/it]
Training 1/1 epoch (loss 2.8592): 54%|ββββββ | 670/1250 [16:24<12:32, 1.30s/it]
Training 1/1 epoch (loss 2.8592): 54%|ββββββ | 671/1250 [16:24<13:05, 1.36s/it]
Training 1/1 epoch (loss 2.7948): 54%|ββββββ | 671/1250 [16:26<13:05, 1.36s/it]
Training 1/1 epoch (loss 2.7948): 54%|ββββββ | 672/1250 [16:26<14:14, 1.48s/it]
Training 1/1 epoch (loss 2.4465): 54%|ββββββ | 672/1250 [16:27<14:14, 1.48s/it]
Training 1/1 epoch (loss 2.4465): 54%|ββββββ | 673/1250 [16:27<14:21, 1.49s/it]
Training 1/1 epoch (loss 2.7113): 54%|ββββββ | 673/1250 [16:30<14:21, 1.49s/it]
Training 1/1 epoch (loss 2.7113): 54%|ββββββ | 674/1250 [16:30<16:44, 1.74s/it]
Training 1/1 epoch (loss 2.3839): 54%|ββββββ | 674/1250 [16:31<16:44, 1.74s/it]
Training 1/1 epoch (loss 2.3839): 54%|ββββββ | 675/1250 [16:31<14:16, 1.49s/it]
Training 1/1 epoch (loss 2.6224): 54%|ββββββ | 675/1250 [16:33<14:16, 1.49s/it]
Training 1/1 epoch (loss 2.6224): 54%|ββββββ | 676/1250 [16:33<15:13, 1.59s/it]
Training 1/1 epoch (loss 2.4234): 54%|ββββββ | 676/1250 [16:35<15:13, 1.59s/it]
Training 1/1 epoch (loss 2.4234): 54%|ββββββ | 677/1250 [16:35<17:44, 1.86s/it]
Training 1/1 epoch (loss 2.6902): 54%|ββββββ | 677/1250 [16:36<17:44, 1.86s/it]
Training 1/1 epoch (loss 2.6902): 54%|ββββββ | 678/1250 [16:36<14:40, 1.54s/it]
Training 1/1 epoch (loss 2.6778): 54%|ββββββ | 678/1250 [16:38<14:40, 1.54s/it]
Training 1/1 epoch (loss 2.6778): 54%|ββββββ | 679/1250 [16:38<15:59, 1.68s/it]
Training 1/1 epoch (loss 2.6037): 54%|ββββββ | 679/1250 [16:39<15:59, 1.68s/it]
Training 1/1 epoch (loss 2.6037): 54%|ββββββ | 680/1250 [16:39<15:40, 1.65s/it]
Training 1/1 epoch (loss 2.4749): 54%|ββββββ | 680/1250 [16:40<15:40, 1.65s/it]
Training 1/1 epoch (loss 2.4749): 54%|ββββββ | 681/1250 [16:40<13:07, 1.38s/it]
Training 1/1 epoch (loss 2.6862): 54%|ββββββ | 681/1250 [16:42<13:07, 1.38s/it]
Training 1/1 epoch (loss 2.6862): 55%|ββββββ | 682/1250 [16:42<15:20, 1.62s/it]
Training 1/1 epoch (loss 2.7036): 55%|ββββββ | 682/1250 [16:44<15:20, 1.62s/it]
Training 1/1 epoch (loss 2.7036): 55%|ββββββ | 683/1250 [16:44<15:06, 1.60s/it]
Training 1/1 epoch (loss 2.6619): 55%|ββββββ | 683/1250 [16:45<15:06, 1.60s/it]
Training 1/1 epoch (loss 2.6619): 55%|ββββββ | 684/1250 [16:45<14:20, 1.52s/it]
Training 1/1 epoch (loss 2.3929): 55%|ββββββ | 684/1250 [16:47<14:20, 1.52s/it]
Training 1/1 epoch (loss 2.3929): 55%|ββββββ | 685/1250 [16:47<13:55, 1.48s/it]
Training 1/1 epoch (loss 2.6683): 55%|ββββββ | 685/1250 [16:48<13:55, 1.48s/it]
Training 1/1 epoch (loss 2.6683): 55%|ββββββ | 686/1250 [16:48<14:00, 1.49s/it]
Training 1/1 epoch (loss 2.4910): 55%|ββββββ | 686/1250 [16:49<14:00, 1.49s/it]
Training 1/1 epoch (loss 2.4910): 55%|ββββββ | 687/1250 [16:49<12:48, 1.37s/it]
Training 1/1 epoch (loss 2.6414): 55%|ββββββ | 687/1250 [16:51<12:48, 1.37s/it]
Training 1/1 epoch (loss 2.6414): 55%|ββββββ | 688/1250 [16:51<14:38, 1.56s/it]
Training 1/1 epoch (loss 2.6943): 55%|ββββββ | 688/1250 [16:52<14:38, 1.56s/it]
Training 1/1 epoch (loss 2.6943): 55%|ββββββ | 689/1250 [16:52<11:56, 1.28s/it]
Training 1/1 epoch (loss 2.7189): 55%|ββββββ | 689/1250 [16:54<11:56, 1.28s/it]
Training 1/1 epoch (loss 2.7189): 55%|ββββββ | 690/1250 [16:54<15:23, 1.65s/it]
Training 1/1 epoch (loss 2.5563): 55%|ββββββ | 690/1250 [16:56<15:23, 1.65s/it]
Training 1/1 epoch (loss 2.5563): 55%|ββββββ | 691/1250 [16:56<15:52, 1.70s/it]
Training 1/1 epoch (loss 2.6491): 55%|ββββββ | 691/1250 [16:57<15:52, 1.70s/it]
Training 1/1 epoch (loss 2.6491): 55%|ββββββ | 692/1250 [16:57<13:19, 1.43s/it]
Training 1/1 epoch (loss 2.6049): 55%|ββββββ | 692/1250 [16:58<13:19, 1.43s/it]
Training 1/1 epoch (loss 2.6049): 55%|ββββββ | 693/1250 [16:58<13:16, 1.43s/it]
Training 1/1 epoch (loss 2.7655): 55%|ββββββ | 693/1250 [16:59<13:16, 1.43s/it]
Training 1/1 epoch (loss 2.7655): 56%|ββββββ | 694/1250 [16:59<12:01, 1.30s/it]
Training 1/1 epoch (loss 2.5687): 56%|ββββββ | 694/1250 [17:01<12:01, 1.30s/it]
Training 1/1 epoch (loss 2.5687): 56%|ββββββ | 695/1250 [17:01<12:31, 1.35s/it]
Training 1/1 epoch (loss 2.8039): 56%|ββββββ | 695/1250 [17:03<12:31, 1.35s/it]
Training 1/1 epoch (loss 2.8039): 56%|ββββββ | 696/1250 [17:03<13:34, 1.47s/it]
Training 1/1 epoch (loss 2.6238): 56%|ββββββ | 696/1250 [17:03<13:34, 1.47s/it]
Training 1/1 epoch (loss 2.6238): 56%|ββββββ | 697/1250 [17:03<11:38, 1.26s/it]
Training 1/1 epoch (loss 2.4462): 56%|ββββββ | 697/1250 [17:04<11:38, 1.26s/it]
Training 1/1 epoch (loss 2.4462): 56%|ββββββ | 698/1250 [17:04<10:52, 1.18s/it]
Training 1/1 epoch (loss 2.7596): 56%|ββββββ | 698/1250 [17:07<10:52, 1.18s/it]
Training 1/1 epoch (loss 2.7596): 56%|ββββββ | 699/1250 [17:07<14:12, 1.55s/it]
Training 1/1 epoch (loss 2.8827): 56%|ββββββ | 699/1250 [17:07<14:12, 1.55s/it]
Training 1/1 epoch (loss 2.8827): 56%|ββββββ | 700/1250 [17:07<11:15, 1.23s/it]
Training 1/1 epoch (loss 2.7614): 56%|ββββββ | 700/1250 [17:09<11:15, 1.23s/it]
Training 1/1 epoch (loss 2.7614): 56%|ββββββ | 701/1250 [17:09<11:34, 1.27s/it]
Training 1/1 epoch (loss 2.5256): 56%|ββββββ | 701/1250 [17:10<11:34, 1.27s/it]
Training 1/1 epoch (loss 2.5256): 56%|ββββββ | 702/1250 [17:10<12:43, 1.39s/it]
Training 1/1 epoch (loss 2.6495): 56%|ββββββ | 702/1250 [17:12<12:43, 1.39s/it]
Training 1/1 epoch (loss 2.6495): 56%|ββββββ | 703/1250 [17:12<12:35, 1.38s/it]
Training 1/1 epoch (loss 2.4158): 56%|ββββββ | 703/1250 [17:14<12:35, 1.38s/it]
Training 1/1 epoch (loss 2.4158): 56%|ββββββ | 704/1250 [17:14<16:11, 1.78s/it]
Training 1/1 epoch (loss 2.6337): 56%|ββββββ | 704/1250 [17:15<16:11, 1.78s/it]
Training 1/1 epoch (loss 2.6337): 56%|ββββββ | 705/1250 [17:15<14:06, 1.55s/it]
Training 1/1 epoch (loss 2.7225): 56%|ββββββ | 705/1250 [17:18<14:06, 1.55s/it]
Training 1/1 epoch (loss 2.7225): 56%|ββββββ | 706/1250 [17:18<15:49, 1.75s/it]
Training 1/1 epoch (loss 2.6764): 56%|ββββββ | 706/1250 [17:20<15:49, 1.75s/it]
Training 1/1 epoch (loss 2.6764): 57%|ββββββ | 707/1250 [17:20<17:44, 1.96s/it]
Training 1/1 epoch (loss 2.7126): 57%|ββββββ | 707/1250 [17:21<17:44, 1.96s/it]
Training 1/1 epoch (loss 2.7126): 57%|ββββββ | 708/1250 [17:21<13:42, 1.52s/it]
Training 1/1 epoch (loss 2.7566): 57%|ββββββ | 708/1250 [17:22<13:42, 1.52s/it]
Training 1/1 epoch (loss 2.7566): 57%|ββββββ | 709/1250 [17:22<14:54, 1.65s/it]
Training 1/1 epoch (loss 2.5558): 57%|ββββββ | 709/1250 [17:25<14:54, 1.65s/it]
Training 1/1 epoch (loss 2.5558): 57%|ββββββ | 710/1250 [17:25<16:35, 1.84s/it]
Training 1/1 epoch (loss 2.6972): 57%|ββββββ | 710/1250 [17:25<16:35, 1.84s/it]
Training 1/1 epoch (loss 2.6972): 57%|ββββββ | 711/1250 [17:25<13:15, 1.48s/it]
Training 1/1 epoch (loss 2.7081): 57%|ββββββ | 711/1250 [17:28<13:15, 1.48s/it]
Training 1/1 epoch (loss 2.7081): 57%|ββββββ | 712/1250 [17:28<15:37, 1.74s/it]
Training 1/1 epoch (loss 2.4670): 57%|ββββββ | 712/1250 [17:29<15:37, 1.74s/it]
Training 1/1 epoch (loss 2.4670): 57%|ββββββ | 713/1250 [17:29<15:11, 1.70s/it]
Training 1/1 epoch (loss 2.8242): 57%|ββββββ | 713/1250 [17:31<15:11, 1.70s/it]
Training 1/1 epoch (loss 2.8242): 57%|ββββββ | 714/1250 [17:31<15:19, 1.72s/it]
Training 1/1 epoch (loss 2.7099): 57%|ββββββ | 714/1250 [17:33<15:19, 1.72s/it]
Training 1/1 epoch (loss 2.7099): 57%|ββββββ | 715/1250 [17:33<16:54, 1.90s/it]
Training 1/1 epoch (loss 2.3441): 57%|ββββββ | 715/1250 [17:34<16:54, 1.90s/it]
Training 1/1 epoch (loss 2.3441): 57%|ββββββ | 716/1250 [17:34<13:04, 1.47s/it]
Training 1/1 epoch (loss 2.6468): 57%|ββββββ | 716/1250 [17:36<13:04, 1.47s/it]
Training 1/1 epoch (loss 2.6468): 57%|ββββββ | 717/1250 [17:36<13:50, 1.56s/it]
Training 1/1 epoch (loss 2.6536): 57%|ββββββ | 717/1250 [17:38<13:50, 1.56s/it]
Training 1/1 epoch (loss 2.6536): 57%|ββββββ | 718/1250 [17:38<14:44, 1.66s/it]
Training 1/1 epoch (loss 2.6858): 57%|ββββββ | 718/1250 [17:38<14:44, 1.66s/it]
Training 1/1 epoch (loss 2.6858): 58%|ββββββ | 719/1250 [17:38<11:32, 1.30s/it]
Training 1/1 epoch (loss 2.7392): 58%|ββββββ | 719/1250 [17:41<11:32, 1.30s/it]
Training 1/1 epoch (loss 2.7392): 58%|ββββββ | 720/1250 [17:41<14:36, 1.65s/it]
Training 1/1 epoch (loss 2.5389): 58%|ββββββ | 720/1250 [17:42<14:36, 1.65s/it]
Training 1/1 epoch (loss 2.5389): 58%|ββββββ | 721/1250 [17:42<14:54, 1.69s/it]
Training 1/1 epoch (loss 2.5279): 58%|ββββββ | 721/1250 [17:43<14:54, 1.69s/it]
Training 1/1 epoch (loss 2.5279): 58%|ββββββ | 722/1250 [17:43<12:49, 1.46s/it]
Training 1/1 epoch (loss 2.4048): 58%|ββββββ | 722/1250 [17:45<12:49, 1.46s/it]
Training 1/1 epoch (loss 2.4048): 58%|ββββββ | 723/1250 [17:45<13:15, 1.51s/it]
Training 1/1 epoch (loss 2.5238): 58%|ββββββ | 723/1250 [17:46<13:15, 1.51s/it]
Training 1/1 epoch (loss 2.5238): 58%|ββββββ | 724/1250 [17:46<13:29, 1.54s/it]
Training 1/1 epoch (loss 2.8951): 58%|ββββββ | 724/1250 [17:47<13:29, 1.54s/it]
Training 1/1 epoch (loss 2.8951): 58%|ββββββ | 725/1250 [17:47<11:00, 1.26s/it]
Training 1/1 epoch (loss 2.5546): 58%|ββββββ | 725/1250 [17:49<11:00, 1.26s/it]
Training 1/1 epoch (loss 2.5546): 58%|ββββββ | 726/1250 [17:49<14:08, 1.62s/it]
Training 1/1 epoch (loss 2.4951): 58%|ββββββ | 726/1250 [17:52<14:08, 1.62s/it]
Training 1/1 epoch (loss 2.4951): 58%|ββββββ | 727/1250 [17:52<15:14, 1.75s/it]
Training 1/1 epoch (loss 2.4851): 58%|ββββββ | 727/1250 [17:52<15:14, 1.75s/it]
Training 1/1 epoch (loss 2.4851): 58%|ββββββ | 728/1250 [17:52<12:42, 1.46s/it]
Training 1/1 epoch (loss 2.6863): 58%|ββββββ | 728/1250 [17:55<12:42, 1.46s/it]
Training 1/1 epoch (loss 2.6863): 58%|ββββββ | 729/1250 [17:55<14:51, 1.71s/it]
Training 1/1 epoch (loss 2.6912): 58%|ββββββ | 729/1250 [17:56<14:51, 1.71s/it]
Training 1/1 epoch (loss 2.6912): 58%|ββββββ | 730/1250 [17:56<12:56, 1.49s/it]
Training 1/1 epoch (loss 2.6046): 58%|ββββββ | 730/1250 [17:57<12:56, 1.49s/it]
Training 1/1 epoch (loss 2.6046): 58%|ββββββ | 731/1250 [17:57<12:48, 1.48s/it]
Training 1/1 epoch (loss 2.3552): 58%|ββββββ | 731/1250 [17:59<12:48, 1.48s/it]
Training 1/1 epoch (loss 2.3552): 59%|ββββββ | 732/1250 [17:59<13:59, 1.62s/it]
Training 1/1 epoch (loss 2.4945): 59%|ββββββ | 732/1250 [18:00<13:59, 1.62s/it]
Training 1/1 epoch (loss 2.4945): 59%|ββββββ | 733/1250 [18:00<12:40, 1.47s/it]
Training 1/1 epoch (loss 2.6101): 59%|ββββββ | 733/1250 [18:02<12:40, 1.47s/it]
Training 1/1 epoch (loss 2.6101): 59%|ββββββ | 734/1250 [18:02<13:46, 1.60s/it]
Training 1/1 epoch (loss 2.5340): 59%|ββββββ | 734/1250 [18:04<13:46, 1.60s/it]
Training 1/1 epoch (loss 2.5340): 59%|ββββββ | 735/1250 [18:04<13:25, 1.56s/it]
Training 1/1 epoch (loss 2.7817): 59%|ββββββ | 735/1250 [18:04<13:25, 1.56s/it]
Training 1/1 epoch (loss 2.7817): 59%|ββββββ | 736/1250 [18:04<10:54, 1.27s/it]
Training 1/1 epoch (loss 2.4524): 59%|ββββββ | 736/1250 [18:07<10:54, 1.27s/it]
Training 1/1 epoch (loss 2.4524): 59%|ββββββ | 737/1250 [18:07<13:50, 1.62s/it]
Training 1/1 epoch (loss 2.5512): 59%|ββββββ | 737/1250 [18:08<13:50, 1.62s/it]
Training 1/1 epoch (loss 2.5512): 59%|ββββββ | 738/1250 [18:08<14:37, 1.71s/it]
Training 1/1 epoch (loss 2.6145): 59%|ββββββ | 738/1250 [18:09<14:37, 1.71s/it]
Training 1/1 epoch (loss 2.6145): 59%|ββββββ | 739/1250 [18:09<11:13, 1.32s/it]
Training 1/1 epoch (loss 2.5717): 59%|ββββββ | 739/1250 [18:11<11:13, 1.32s/it]
Training 1/1 epoch (loss 2.5717): 59%|ββββββ | 740/1250 [18:11<12:27, 1.47s/it]
Training 1/1 epoch (loss 2.5406): 59%|ββββββ | 740/1250 [18:12<12:27, 1.47s/it]
Training 1/1 epoch (loss 2.5406): 59%|ββββββ | 741/1250 [18:12<11:46, 1.39s/it]
Training 1/1 epoch (loss 2.6900): 59%|ββββββ | 741/1250 [18:13<11:46, 1.39s/it]
Training 1/1 epoch (loss 2.6900): 59%|ββββββ | 742/1250 [18:13<09:50, 1.16s/it]
Training 1/1 epoch (loss 2.6204): 59%|ββββββ | 742/1250 [18:14<09:50, 1.16s/it]
Training 1/1 epoch (loss 2.6204): 59%|ββββββ | 743/1250 [18:14<10:36, 1.25s/it]
Training 1/1 epoch (loss 2.6929): 59%|ββββββ | 743/1250 [18:17<10:36, 1.25s/it]
Training 1/1 epoch (loss 2.6929): 60%|ββββββ | 744/1250 [18:17<14:11, 1.68s/it]
Training 1/1 epoch (loss 2.5914): 60%|ββββββ | 744/1250 [18:18<14:11, 1.68s/it]
Training 1/1 epoch (loss 2.5914): 60%|ββββββ | 745/1250 [18:18<12:21, 1.47s/it]
Training 1/1 epoch (loss 2.6766): 60%|ββββββ | 745/1250 [18:20<12:21, 1.47s/it]
Training 1/1 epoch (loss 2.6766): 60%|ββββββ | 746/1250 [18:20<14:46, 1.76s/it]
Training 1/1 epoch (loss 2.4528): 60%|ββββββ | 746/1250 [18:21<14:46, 1.76s/it]
Training 1/1 epoch (loss 2.4528): 60%|ββββββ | 747/1250 [18:21<13:32, 1.61s/it]
Training 1/1 epoch (loss 2.6642): 60%|ββββββ | 747/1250 [18:22<13:32, 1.61s/it]
Training 1/1 epoch (loss 2.6642): 60%|ββββββ | 748/1250 [18:22<11:49, 1.41s/it]
Training 1/1 epoch (loss 2.4567): 60%|ββββββ | 748/1250 [18:24<11:49, 1.41s/it]
Training 1/1 epoch (loss 2.4567): 60%|ββββββ | 749/1250 [18:24<13:05, 1.57s/it]
Training 1/1 epoch (loss 2.6649): 60%|ββββββ | 749/1250 [18:26<13:05, 1.57s/it]
Training 1/1 epoch (loss 2.6649): 60%|ββββββ | 750/1250 [18:26<12:57, 1.56s/it]
Training 1/1 epoch (loss 2.4400): 60%|ββββββ | 750/1250 [18:27<12:57, 1.56s/it]
Training 1/1 epoch (loss 2.4400): 60%|ββββββ | 751/1250 [18:27<12:09, 1.46s/it]
Training 1/1 epoch (loss 2.6450): 60%|ββββββ | 751/1250 [18:29<12:09, 1.46s/it]
Training 1/1 epoch (loss 2.6450): 60%|ββββββ | 752/1250 [18:29<12:20, 1.49s/it]
Training 1/1 epoch (loss 2.7684): 60%|ββββββ | 752/1250 [18:29<12:20, 1.49s/it]
Training 1/1 epoch (loss 2.7684): 60%|ββββββ | 753/1250 [18:29<10:53, 1.31s/it]
Training 1/1 epoch (loss 2.7738): 60%|ββββββ | 753/1250 [18:31<10:53, 1.31s/it]
Training 1/1 epoch (loss 2.7738): 60%|ββββββ | 754/1250 [18:31<12:02, 1.46s/it]
Training 1/1 epoch (loss 2.5576): 60%|ββββββ | 754/1250 [18:33<12:02, 1.46s/it]
Training 1/1 epoch (loss 2.5576): 60%|ββββββ | 755/1250 [18:33<12:49, 1.56s/it]
Training 1/1 epoch (loss 2.7882): 60%|ββββββ | 755/1250 [18:34<12:49, 1.56s/it]
Training 1/1 epoch (loss 2.7882): 60%|ββββββ | 756/1250 [18:34<10:26, 1.27s/it]
Training 1/1 epoch (loss 2.5757): 60%|ββββββ | 756/1250 [18:35<10:26, 1.27s/it]
Training 1/1 epoch (loss 2.5757): 61%|ββββββ | 757/1250 [18:35<10:40, 1.30s/it]
Training 1/1 epoch (loss 2.5358): 61%|ββββββ | 757/1250 [18:37<10:40, 1.30s/it]
Training 1/1 epoch (loss 2.5358): 61%|ββββββ | 758/1250 [18:37<12:08, 1.48s/it]
Training 1/1 epoch (loss 2.6014): 61%|ββββββ | 758/1250 [18:38<12:08, 1.48s/it]
Training 1/1 epoch (loss 2.6014): 61%|ββββββ | 759/1250 [18:38<10:28, 1.28s/it]
Training 1/1 epoch (loss 2.6589): 61%|ββββββ | 759/1250 [18:40<10:28, 1.28s/it]
Training 1/1 epoch (loss 2.6589): 61%|ββββββ | 760/1250 [18:40<12:29, 1.53s/it]
Training 1/1 epoch (loss 2.6697): 61%|ββββββ | 760/1250 [18:42<12:29, 1.53s/it]
Training 1/1 epoch (loss 2.6697): 61%|ββββββ | 761/1250 [18:42<14:42, 1.80s/it]
Training 1/1 epoch (loss 2.4782): 61%|ββββββ | 761/1250 [18:43<14:42, 1.80s/it]
Training 1/1 epoch (loss 2.4782): 61%|ββββββ | 762/1250 [18:43<12:20, 1.52s/it]
Training 1/1 epoch (loss 2.4893): 61%|ββββββ | 762/1250 [18:45<12:20, 1.52s/it]
Training 1/1 epoch (loss 2.4893): 61%|ββββββ | 763/1250 [18:45<12:53, 1.59s/it]
Training 1/1 epoch (loss 2.6189): 61%|ββββββ | 763/1250 [18:46<12:53, 1.59s/it]
Training 1/1 epoch (loss 2.6189): 61%|ββββββ | 764/1250 [18:46<12:11, 1.50s/it]
Training 1/1 epoch (loss 2.6710): 61%|ββββββ | 764/1250 [18:47<12:11, 1.50s/it]
Training 1/1 epoch (loss 2.6710): 61%|ββββββ | 765/1250 [18:47<10:40, 1.32s/it]
Training 1/1 epoch (loss 2.3150): 61%|ββββββ | 765/1250 [18:48<10:40, 1.32s/it]
Training 1/1 epoch (loss 2.3150): 61%|βββββββ | 766/1250 [18:48<10:46, 1.33s/it]
Training 1/1 epoch (loss 2.4091): 61%|βββββββ | 766/1250 [18:50<10:46, 1.33s/it]
Training 1/1 epoch (loss 2.4091): 61%|βββββββ | 767/1250 [18:50<11:00, 1.37s/it]
Training 1/1 epoch (loss 2.6093): 61%|βββββββ | 767/1250 [18:51<11:00, 1.37s/it]
Training 1/1 epoch (loss 2.6093): 61%|βββββββ | 768/1250 [18:51<09:30, 1.18s/it]
Training 1/1 epoch (loss 2.7798): 61%|βββββββ | 768/1250 [18:53<09:30, 1.18s/it]
Training 1/1 epoch (loss 2.7798): 62%|βββββββ | 769/1250 [18:53<11:20, 1.41s/it]
Training 1/1 epoch (loss 2.6157): 62%|βββββββ | 769/1250 [18:53<11:20, 1.41s/it]
Training 1/1 epoch (loss 2.6157): 62%|βββββββ | 770/1250 [18:53<09:37, 1.20s/it]
Training 1/1 epoch (loss 2.6638): 62%|βββββββ | 770/1250 [18:55<09:37, 1.20s/it]
Training 1/1 epoch (loss 2.6638): 62%|βββββββ | 771/1250 [18:55<11:39, 1.46s/it]
Training 1/1 epoch (loss 2.7043): 62%|βββββββ | 771/1250 [18:58<11:39, 1.46s/it]
Training 1/1 epoch (loss 2.7043): 62%|βββββββ | 772/1250 [18:58<13:40, 1.72s/it]
Training 1/1 epoch (loss 2.5573): 62%|βββββββ | 772/1250 [18:58<13:40, 1.72s/it]
Training 1/1 epoch (loss 2.5573): 62%|βββββββ | 773/1250 [18:58<10:49, 1.36s/it]
Training 1/1 epoch (loss 2.5263): 62%|βββββββ | 773/1250 [19:00<10:49, 1.36s/it]
Training 1/1 epoch (loss 2.5263): 62%|βββββββ | 774/1250 [19:00<11:56, 1.51s/it]
Training 1/1 epoch (loss 2.6907): 62%|βββββββ | 774/1250 [19:02<11:56, 1.51s/it]
Training 1/1 epoch (loss 2.6907): 62%|βββββββ | 775/1250 [19:02<12:18, 1.55s/it]
Training 1/1 epoch (loss 2.5233): 62%|βββββββ | 775/1250 [19:03<12:18, 1.55s/it]
Training 1/1 epoch (loss 2.5233): 62%|βββββββ | 776/1250 [19:03<11:03, 1.40s/it]
Training 1/1 epoch (loss 2.6244): 62%|βββββββ | 776/1250 [19:04<11:03, 1.40s/it]
Training 1/1 epoch (loss 2.6244): 62%|βββββββ | 777/1250 [19:04<11:21, 1.44s/it]
Training 1/1 epoch (loss 2.7002): 62%|βββββββ | 777/1250 [19:05<11:21, 1.44s/it]
Training 1/1 epoch (loss 2.7002): 62%|βββββββ | 778/1250 [19:05<09:26, 1.20s/it]
Training 1/1 epoch (loss 2.5338): 62%|βββββββ | 778/1250 [19:07<09:26, 1.20s/it]
Training 1/1 epoch (loss 2.5338): 62%|βββββββ | 779/1250 [19:07<11:39, 1.49s/it]
Training 1/1 epoch (loss 2.8167): 62%|βββββββ | 779/1250 [19:09<11:39, 1.49s/it]
Training 1/1 epoch (loss 2.8167): 62%|βββββββ | 780/1250 [19:09<13:10, 1.68s/it]
Training 1/1 epoch (loss 2.4582): 62%|βββββββ | 780/1250 [19:10<13:10, 1.68s/it]
Training 1/1 epoch (loss 2.4582): 62%|βββββββ | 781/1250 [19:10<10:33, 1.35s/it]
Training 1/1 epoch (loss 2.6639): 62%|βββββββ | 781/1250 [19:12<10:33, 1.35s/it]
Training 1/1 epoch (loss 2.6639): 63%|βββββββ | 782/1250 [19:12<11:38, 1.49s/it]
Training 1/1 epoch (loss 2.6254): 63%|βββββββ | 782/1250 [19:13<11:38, 1.49s/it]
Training 1/1 epoch (loss 2.6254): 63%|βββββββ | 783/1250 [19:13<10:42, 1.38s/it]
Training 1/1 epoch (loss 2.4882): 63%|βββββββ | 783/1250 [19:14<10:42, 1.38s/it]
Training 1/1 epoch (loss 2.4882): 63%|βββββββ | 784/1250 [19:14<09:42, 1.25s/it]
Training 1/1 epoch (loss 2.7216): 63%|βββββββ | 784/1250 [19:16<09:42, 1.25s/it]
Training 1/1 epoch (loss 2.7216): 63%|βββββββ | 785/1250 [19:16<12:32, 1.62s/it]
Training 1/1 epoch (loss 2.6348): 63%|βββββββ | 785/1250 [19:17<12:32, 1.62s/it]
Training 1/1 epoch (loss 2.6348): 63%|βββββββ | 786/1250 [19:17<11:33, 1.49s/it]
Training 1/1 epoch (loss 2.6470): 63%|βββββββ | 786/1250 [19:19<11:33, 1.49s/it]
Training 1/1 epoch (loss 2.6470): 63%|βββββββ | 787/1250 [19:19<12:20, 1.60s/it]
Training 1/1 epoch (loss 2.4259): 63%|βββββββ | 787/1250 [19:20<12:20, 1.60s/it]
Training 1/1 epoch (loss 2.4259): 63%|βββββββ | 788/1250 [19:20<11:26, 1.48s/it]
Training 1/1 epoch (loss 2.4005): 63%|βββββββ | 788/1250 [19:22<11:26, 1.48s/it]
Training 1/1 epoch (loss 2.4005): 63%|βββββββ | 789/1250 [19:22<11:10, 1.46s/it]
Training 1/1 epoch (loss 2.5451): 63%|βββββββ | 789/1250 [19:24<11:10, 1.46s/it]
Training 1/1 epoch (loss 2.5451): 63%|βββββββ | 790/1250 [19:24<11:51, 1.55s/it]
Training 1/1 epoch (loss 2.5916): 63%|βββββββ | 790/1250 [19:24<11:51, 1.55s/it]
Training 1/1 epoch (loss 2.5916): 63%|βββββββ | 791/1250 [19:24<10:11, 1.33s/it]
Training 1/1 epoch (loss 2.6291): 63%|βββββββ | 791/1250 [19:26<10:11, 1.33s/it]
Training 1/1 epoch (loss 2.6291): 63%|βββββββ | 792/1250 [19:26<10:11, 1.34s/it]
Training 1/1 epoch (loss 2.4113): 63%|βββββββ | 792/1250 [19:28<10:11, 1.34s/it]
Training 1/1 epoch (loss 2.4113): 63%|βββββββ | 793/1250 [19:28<11:26, 1.50s/it]
Training 1/1 epoch (loss 2.5931): 63%|βββββββ | 793/1250 [19:28<11:26, 1.50s/it]
Training 1/1 epoch (loss 2.5931): 64%|βββββββ | 794/1250 [19:28<09:03, 1.19s/it]
Training 1/1 epoch (loss 2.7272): 64%|βββββββ | 794/1250 [19:30<09:03, 1.19s/it]
Training 1/1 epoch (loss 2.7272): 64%|βββββββ | 795/1250 [19:30<09:34, 1.26s/it]
Training 1/1 epoch (loss 2.5390): 64%|βββββββ | 795/1250 [19:31<09:34, 1.26s/it]
Training 1/1 epoch (loss 2.5390): 64%|βββββββ | 796/1250 [19:31<10:40, 1.41s/it]
Training 1/1 epoch (loss 2.4921): 64%|βββββββ | 796/1250 [19:32<10:40, 1.41s/it]
Training 1/1 epoch (loss 2.4921): 64%|βββββββ | 797/1250 [19:32<08:49, 1.17s/it]
Training 1/1 epoch (loss 2.7276): 64%|βββββββ | 797/1250 [19:34<08:49, 1.17s/it]
Training 1/1 epoch (loss 2.7276): 64%|βββββββ | 798/1250 [19:34<11:12, 1.49s/it]
Training 1/1 epoch (loss 2.6862): 64%|βββββββ | 798/1250 [19:36<11:12, 1.49s/it]
Training 1/1 epoch (loss 2.6862): 64%|βββββββ | 799/1250 [19:36<11:16, 1.50s/it]
Training 1/1 epoch (loss 2.6211): 64%|βββββββ | 799/1250 [19:37<11:16, 1.50s/it]
Training 1/1 epoch (loss 2.6211): 64%|βββββββ | 800/1250 [19:37<09:52, 1.32s/it]
Training 1/1 epoch (loss 2.5910): 64%|βββββββ | 800/1250 [19:38<09:52, 1.32s/it]
Training 1/1 epoch (loss 2.5910): 64%|βββββββ | 801/1250 [19:38<10:34, 1.41s/it]
Training 1/1 epoch (loss 2.7081): 64%|βββββββ | 801/1250 [19:39<10:34, 1.41s/it]
Training 1/1 epoch (loss 2.7081): 64%|βββββββ | 802/1250 [19:39<09:14, 1.24s/it]
Training 1/1 epoch (loss 2.6299): 64%|βββββββ | 802/1250 [19:41<09:14, 1.24s/it]
Training 1/1 epoch (loss 2.6299): 64%|βββββββ | 803/1250 [19:41<10:17, 1.38s/it]
Training 1/1 epoch (loss 2.5030): 64%|βββββββ | 803/1250 [19:43<10:17, 1.38s/it]
Training 1/1 epoch (loss 2.5030): 64%|βββββββ | 804/1250 [19:43<12:14, 1.65s/it]
Training 1/1 epoch (loss 2.6330): 64%|βββββββ | 804/1250 [19:44<12:14, 1.65s/it]
Training 1/1 epoch (loss 2.6330): 64%|βββββββ | 805/1250 [19:44<09:54, 1.34s/it]
Training 1/1 epoch (loss 2.5741): 64%|βββββββ | 805/1250 [19:46<09:54, 1.34s/it]
Training 1/1 epoch (loss 2.5741): 64%|βββββββ | 806/1250 [19:46<11:29, 1.55s/it]
Training 1/1 epoch (loss 2.6079): 64%|βββββββ | 806/1250 [19:47<11:29, 1.55s/it]
Training 1/1 epoch (loss 2.6079): 65%|βββββββ | 807/1250 [19:47<11:45, 1.59s/it]
Training 1/1 epoch (loss 2.5071): 65%|βββββββ | 807/1250 [19:48<11:45, 1.59s/it]
Training 1/1 epoch (loss 2.5071): 65%|βββββββ | 808/1250 [19:48<09:37, 1.31s/it]
Training 1/1 epoch (loss 2.5349): 65%|βββββββ | 808/1250 [19:49<09:37, 1.31s/it]
Training 1/1 epoch (loss 2.5349): 65%|βββββββ | 809/1250 [19:49<09:26, 1.28s/it]
Training 1/1 epoch (loss 2.6618): 65%|βββββββ | 809/1250 [19:51<09:26, 1.28s/it]
Training 1/1 epoch (loss 2.6618): 65%|βββββββ | 810/1250 [19:51<10:55, 1.49s/it]
Training 1/1 epoch (loss 2.5876): 65%|βββββββ | 810/1250 [19:52<10:55, 1.49s/it]
Training 1/1 epoch (loss 2.5876): 65%|βββββββ | 811/1250 [19:52<09:05, 1.24s/it]
Training 1/1 epoch (loss 2.6055): 65%|βββββββ | 811/1250 [19:54<09:05, 1.24s/it]
Training 1/1 epoch (loss 2.6055): 65%|βββββββ | 812/1250 [19:54<11:11, 1.53s/it]
Training 1/1 epoch (loss 2.5211): 65%|βββββββ | 812/1250 [19:56<11:11, 1.53s/it]
Training 1/1 epoch (loss 2.5211): 65%|βββββββ | 813/1250 [19:56<11:18, 1.55s/it]
Training 1/1 epoch (loss 2.6334): 65%|βββββββ | 813/1250 [19:56<11:18, 1.55s/it]
Training 1/1 epoch (loss 2.6334): 65%|βββββββ | 814/1250 [19:56<09:02, 1.25s/it]
Training 1/1 epoch (loss 2.6441): 65%|βββββββ | 814/1250 [19:58<09:02, 1.25s/it]
Training 1/1 epoch (loss 2.6441): 65%|βββββββ | 815/1250 [19:58<09:32, 1.31s/it]
Training 1/1 epoch (loss 2.7660): 65%|βββββββ | 815/1250 [19:59<09:32, 1.31s/it]
Training 1/1 epoch (loss 2.7660): 65%|βββββββ | 816/1250 [19:59<09:30, 1.31s/it]
Training 1/1 epoch (loss 2.6304): 65%|βββββββ | 816/1250 [20:00<09:30, 1.31s/it]
Training 1/1 epoch (loss 2.6304): 65%|βββββββ | 817/1250 [20:00<08:03, 1.12s/it]
Training 1/1 epoch (loss 2.4917): 65%|βββββββ | 817/1250 [20:01<08:03, 1.12s/it]
Training 1/1 epoch (loss 2.4917): 65%|βββββββ | 818/1250 [20:01<08:18, 1.15s/it]
Training 1/1 epoch (loss 2.7647): 65%|βββββββ | 818/1250 [20:02<08:18, 1.15s/it]
Training 1/1 epoch (loss 2.7647): 66%|βββββββ | 819/1250 [20:02<08:23, 1.17s/it]
Training 1/1 epoch (loss 2.5597): 66%|βββββββ | 819/1250 [20:04<08:23, 1.17s/it]
Training 1/1 epoch (loss 2.5597): 66%|βββββββ | 820/1250 [20:04<08:54, 1.24s/it]
Training 1/1 epoch (loss 2.4858): 66%|βββββββ | 820/1250 [20:06<08:54, 1.24s/it]
Training 1/1 epoch (loss 2.4858): 66%|βββββββ | 821/1250 [20:06<11:11, 1.57s/it]
Training 1/1 epoch (loss 2.4307): 66%|βββββββ | 821/1250 [20:07<11:11, 1.57s/it]
Training 1/1 epoch (loss 2.4307): 66%|βββββββ | 822/1250 [20:07<09:33, 1.34s/it]
Training 1/1 epoch (loss 2.4565): 66%|βββββββ | 822/1250 [20:08<09:33, 1.34s/it]
Training 1/1 epoch (loss 2.4565): 66%|βββββββ | 823/1250 [20:08<09:28, 1.33s/it]
Training 1/1 epoch (loss 2.4885): 66%|βββββββ | 823/1250 [20:10<09:28, 1.33s/it]
Training 1/1 epoch (loss 2.4885): 66%|βββββββ | 824/1250 [20:10<11:45, 1.66s/it]
Training 1/1 epoch (loss 2.5281): 66%|βββββββ | 824/1250 [20:11<11:45, 1.66s/it]
Training 1/1 epoch (loss 2.5281): 66%|βββββββ | 825/1250 [20:11<09:52, 1.39s/it]
Training 1/1 epoch (loss 2.7554): 66%|βββββββ | 825/1250 [20:13<09:52, 1.39s/it]
Training 1/1 epoch (loss 2.7554): 66%|βββββββ | 826/1250 [20:13<11:11, 1.58s/it]
Training 1/1 epoch (loss 2.6348): 66%|βββββββ | 826/1250 [20:14<11:11, 1.58s/it]
Training 1/1 epoch (loss 2.6348): 66%|βββββββ | 827/1250 [20:14<10:19, 1.46s/it]
Training 1/1 epoch (loss 2.5573): 66%|βββββββ | 827/1250 [20:16<10:19, 1.46s/it]
Training 1/1 epoch (loss 2.5573): 66%|βββββββ | 828/1250 [20:16<10:12, 1.45s/it]
Training 1/1 epoch (loss 2.5925): 66%|βββββββ | 828/1250 [20:18<10:12, 1.45s/it]
Training 1/1 epoch (loss 2.5925): 66%|βββββββ | 829/1250 [20:18<10:52, 1.55s/it]
Training 1/1 epoch (loss 2.5363): 66%|βββββββ | 829/1250 [20:18<10:52, 1.55s/it]
Training 1/1 epoch (loss 2.5363): 66%|βββββββ | 830/1250 [20:18<09:21, 1.34s/it]
Training 1/1 epoch (loss 2.5350): 66%|βββββββ | 830/1250 [20:19<09:21, 1.34s/it]
Training 1/1 epoch (loss 2.5350): 66%|βββββββ | 831/1250 [20:19<08:37, 1.24s/it]
Training 1/1 epoch (loss 2.7175): 66%|βββββββ | 831/1250 [20:22<08:37, 1.24s/it]
Training 1/1 epoch (loss 2.7175): 67%|βββββββ | 832/1250 [20:22<10:30, 1.51s/it]
Training 1/1 epoch (loss 2.5325): 67%|βββββββ | 832/1250 [20:22<10:30, 1.51s/it]
Training 1/1 epoch (loss 2.5325): 67%|βββββββ | 833/1250 [20:22<09:15, 1.33s/it]
Training 1/1 epoch (loss 2.6689): 67%|βββββββ | 833/1250 [20:24<09:15, 1.33s/it]
Training 1/1 epoch (loss 2.6689): 67%|βββββββ | 834/1250 [20:24<08:51, 1.28s/it]
Training 1/1 epoch (loss 2.7942): 67%|βββββββ | 834/1250 [20:26<08:51, 1.28s/it]
Training 1/1 epoch (loss 2.7942): 67%|βββββββ | 835/1250 [20:26<10:15, 1.48s/it]
Training 1/1 epoch (loss 2.5888): 67%|βββββββ | 835/1250 [20:26<10:15, 1.48s/it]
Training 1/1 epoch (loss 2.5888): 67%|βββββββ | 836/1250 [20:26<08:36, 1.25s/it]
Training 1/1 epoch (loss 2.5762): 67%|βββββββ | 836/1250 [20:28<08:36, 1.25s/it]
Training 1/1 epoch (loss 2.5762): 67%|βββββββ | 837/1250 [20:28<08:51, 1.29s/it]
Training 1/1 epoch (loss 2.6959): 67%|βββββββ | 837/1250 [20:29<08:51, 1.29s/it]
Training 1/1 epoch (loss 2.6959): 67%|βββββββ | 838/1250 [20:29<08:55, 1.30s/it]
Training 1/1 epoch (loss 2.6638): 67%|βββββββ | 838/1250 [20:29<08:55, 1.30s/it]
Training 1/1 epoch (loss 2.6638): 67%|βββββββ | 839/1250 [20:29<07:07, 1.04s/it]
Training 1/1 epoch (loss 2.3857): 67%|βββββββ | 839/1250 [20:31<07:07, 1.04s/it]
Training 1/1 epoch (loss 2.3857): 67%|βββββββ | 840/1250 [20:31<08:48, 1.29s/it]
Training 1/1 epoch (loss 2.5791): 67%|βββββββ | 840/1250 [20:33<08:48, 1.29s/it]
Training 1/1 epoch (loss 2.5791): 67%|βββββββ | 841/1250 [20:33<10:00, 1.47s/it]
Training 1/1 epoch (loss 2.7169): 67%|βββββββ | 841/1250 [20:34<10:00, 1.47s/it]
Training 1/1 epoch (loss 2.7169): 67%|βββββββ | 842/1250 [20:34<08:10, 1.20s/it]
Training 1/1 epoch (loss 2.5354): 67%|βββββββ | 842/1250 [20:35<08:10, 1.20s/it]
Training 1/1 epoch (loss 2.5354): 67%|βββββββ | 843/1250 [20:35<07:51, 1.16s/it]
Training 1/1 epoch (loss 2.7006): 67%|βββββββ | 843/1250 [20:37<07:51, 1.16s/it]
Training 1/1 epoch (loss 2.7006): 68%|βββββββ | 844/1250 [20:37<10:12, 1.51s/it]
Training 1/1 epoch (loss 2.5832): 68%|βββββββ | 844/1250 [20:38<10:12, 1.51s/it]
Training 1/1 epoch (loss 2.5832): 68%|βββββββ | 845/1250 [20:38<09:48, 1.45s/it]
Training 1/1 epoch (loss 2.6906): 68%|βββββββ | 845/1250 [20:40<09:48, 1.45s/it]
Training 1/1 epoch (loss 2.6906): 68%|βββββββ | 846/1250 [20:40<10:41, 1.59s/it]
Training 1/1 epoch (loss 2.5068): 68%|βββββββ | 846/1250 [20:42<10:41, 1.59s/it]
Training 1/1 epoch (loss 2.5068): 68%|βββββββ | 847/1250 [20:42<10:35, 1.58s/it]
Training 1/1 epoch (loss 2.6124): 68%|βββββββ | 847/1250 [20:44<10:35, 1.58s/it]
Training 1/1 epoch (loss 2.6124): 68%|βββββββ | 848/1250 [20:44<10:52, 1.62s/it]
Training 1/1 epoch (loss 2.6230): 68%|βββββββ | 848/1250 [20:45<10:52, 1.62s/it]
Training 1/1 epoch (loss 2.6230): 68%|βββββββ | 849/1250 [20:45<10:35, 1.59s/it]
Training 1/1 epoch (loss 2.7716): 68%|βββββββ | 849/1250 [20:46<10:35, 1.59s/it]
Training 1/1 epoch (loss 2.7716): 68%|βββββββ | 850/1250 [20:46<08:25, 1.26s/it]
Training 1/1 epoch (loss 2.5207): 68%|βββββββ | 850/1250 [20:47<08:25, 1.26s/it]
Training 1/1 epoch (loss 2.5207): 68%|βββββββ | 851/1250 [20:47<09:20, 1.40s/it]
Training 1/1 epoch (loss 2.4372): 68%|βββββββ | 851/1250 [20:49<09:20, 1.40s/it]
Training 1/1 epoch (loss 2.4372): 68%|βββββββ | 852/1250 [20:49<09:14, 1.39s/it]
Training 1/1 epoch (loss 2.7808): 68%|βββββββ | 852/1250 [20:49<09:14, 1.39s/it]
Training 1/1 epoch (loss 2.7808): 68%|βββββββ | 853/1250 [20:49<07:18, 1.10s/it]
Training 1/1 epoch (loss 2.5053): 68%|βββββββ | 853/1250 [20:50<07:18, 1.10s/it]
Training 1/1 epoch (loss 2.5053): 68%|βββββββ | 854/1250 [20:50<07:40, 1.16s/it]
Training 1/1 epoch (loss 2.7193): 68%|βββββββ | 854/1250 [20:53<07:40, 1.16s/it]
Training 1/1 epoch (loss 2.7193): 68%|βββββββ | 855/1250 [20:53<09:58, 1.51s/it]
Training 1/1 epoch (loss 2.3415): 68%|βββββββ | 855/1250 [20:53<09:58, 1.51s/it]
Training 1/1 epoch (loss 2.3415): 68%|βββββββ | 856/1250 [20:53<08:09, 1.24s/it]
Training 1/1 epoch (loss 2.6195): 68%|βββββββ | 856/1250 [20:55<08:09, 1.24s/it]
Training 1/1 epoch (loss 2.6195): 69%|βββββββ | 857/1250 [20:55<08:46, 1.34s/it]
Training 1/1 epoch (loss 2.5837): 69%|βββββββ | 857/1250 [20:56<08:46, 1.34s/it]
Training 1/1 epoch (loss 2.5837): 69%|βββββββ | 858/1250 [20:56<08:41, 1.33s/it]
Training 1/1 epoch (loss 2.5090): 69%|βββββββ | 858/1250 [20:57<08:41, 1.33s/it]
Training 1/1 epoch (loss 2.5090): 69%|βββββββ | 859/1250 [20:57<07:28, 1.15s/it]
Training 1/1 epoch (loss 2.5764): 69%|βββββββ | 859/1250 [20:59<07:28, 1.15s/it]
Training 1/1 epoch (loss 2.5764): 69%|βββββββ | 860/1250 [20:59<08:19, 1.28s/it]
Training 1/1 epoch (loss 2.6460): 69%|βββββββ | 860/1250 [21:00<08:19, 1.28s/it]
Training 1/1 epoch (loss 2.6460): 69%|βββββββ | 861/1250 [21:00<07:58, 1.23s/it]
Training 1/1 epoch (loss 2.5679): 69%|βββββββ | 861/1250 [21:01<07:58, 1.23s/it]
Training 1/1 epoch (loss 2.5679): 69%|βββββββ | 862/1250 [21:01<07:49, 1.21s/it]
Training 1/1 epoch (loss 2.6031): 69%|βββββββ | 862/1250 [21:02<07:49, 1.21s/it]
Training 1/1 epoch (loss 2.6031): 69%|βββββββ | 863/1250 [21:02<08:01, 1.24s/it]
Training 1/1 epoch (loss 2.5348): 69%|βββββββ | 863/1250 [21:03<08:01, 1.24s/it]
Training 1/1 epoch (loss 2.5348): 69%|βββββββ | 864/1250 [21:03<07:21, 1.14s/it]
Training 1/1 epoch (loss 2.5406): 69%|βββββββ | 864/1250 [21:05<07:21, 1.14s/it]
Training 1/1 epoch (loss 2.5406): 69%|βββββββ | 865/1250 [21:05<08:58, 1.40s/it]
Training 1/1 epoch (loss 2.4610): 69%|βββββββ | 865/1250 [21:07<08:58, 1.40s/it]
Training 1/1 epoch (loss 2.4610): 69%|βββββββ | 866/1250 [21:07<09:48, 1.53s/it]
Training 1/1 epoch (loss 2.6391): 69%|βββββββ | 866/1250 [21:08<09:48, 1.53s/it]
Training 1/1 epoch (loss 2.6391): 69%|βββββββ | 867/1250 [21:08<07:57, 1.25s/it]
Training 1/1 epoch (loss 2.8108): 69%|βββββββ | 867/1250 [21:09<07:57, 1.25s/it]
Training 1/1 epoch (loss 2.8108): 69%|βββββββ | 868/1250 [21:09<09:09, 1.44s/it]
Training 1/1 epoch (loss 2.4407): 69%|βββββββ | 868/1250 [21:12<09:09, 1.44s/it]
Training 1/1 epoch (loss 2.4407): 70%|βββββββ | 869/1250 [21:12<11:08, 1.76s/it]
Training 1/1 epoch (loss 2.4503): 70%|βββββββ | 869/1250 [21:12<11:08, 1.76s/it]
Training 1/1 epoch (loss 2.4503): 70%|βββββββ | 870/1250 [21:12<08:38, 1.36s/it]
Training 1/1 epoch (loss 2.5171): 70%|βββββββ | 870/1250 [21:15<08:38, 1.36s/it]
Training 1/1 epoch (loss 2.5171): 70%|βββββββ | 871/1250 [21:15<10:37, 1.68s/it]
Training 1/1 epoch (loss 2.5934): 70%|βββββββ | 871/1250 [21:16<10:37, 1.68s/it]
Training 1/1 epoch (loss 2.5934): 70%|βββββββ | 872/1250 [21:16<10:01, 1.59s/it]
Training 1/1 epoch (loss 2.7718): 70%|βββββββ | 872/1250 [21:17<10:01, 1.59s/it]
Training 1/1 epoch (loss 2.7718): 70%|βββββββ | 873/1250 [21:17<08:28, 1.35s/it]
Training 1/1 epoch (loss 2.5617): 70%|βββββββ | 873/1250 [21:19<08:28, 1.35s/it]
Training 1/1 epoch (loss 2.5617): 70%|βββββββ | 874/1250 [21:19<10:34, 1.69s/it]
Training 1/1 epoch (loss 2.6053): 70%|βββββββ | 874/1250 [21:21<10:34, 1.69s/it]
Training 1/1 epoch (loss 2.6053): 70%|βββββββ | 875/1250 [21:21<10:18, 1.65s/it]
Training 1/1 epoch (loss 2.6612): 70%|βββββββ | 875/1250 [21:22<10:18, 1.65s/it]
Training 1/1 epoch (loss 2.6612): 70%|βββββββ | 876/1250 [21:22<08:55, 1.43s/it]
Training 1/1 epoch (loss 2.7251): 70%|βββββββ | 876/1250 [21:23<08:55, 1.43s/it]
Training 1/1 epoch (loss 2.7251): 70%|βββββββ | 877/1250 [21:23<08:39, 1.39s/it]
Training 1/1 epoch (loss 2.8156): 70%|βββββββ | 877/1250 [21:24<08:39, 1.39s/it]
Training 1/1 epoch (loss 2.8156): 70%|βββββββ | 878/1250 [21:24<08:22, 1.35s/it]
Training 1/1 epoch (loss 2.6279): 70%|βββββββ | 878/1250 [21:25<08:22, 1.35s/it]
Training 1/1 epoch (loss 2.6279): 70%|βββββββ | 879/1250 [21:25<07:33, 1.22s/it]
Training 1/1 epoch (loss 2.7227): 70%|βββββββ | 879/1250 [21:27<07:33, 1.22s/it]
Training 1/1 epoch (loss 2.7227): 70%|βββββββ | 880/1250 [21:27<08:10, 1.33s/it]
Training 1/1 epoch (loss 2.4996): 70%|βββββββ | 880/1250 [21:28<08:10, 1.33s/it]
Training 1/1 epoch (loss 2.4996): 70%|βββββββ | 881/1250 [21:28<07:23, 1.20s/it]
Training 1/1 epoch (loss 2.6115): 70%|βββββββ | 881/1250 [21:29<07:23, 1.20s/it]
Training 1/1 epoch (loss 2.6115): 71%|βββββββ | 882/1250 [21:29<06:33, 1.07s/it]
Training 1/1 epoch (loss 2.7681): 71%|βββββββ | 882/1250 [21:31<06:33, 1.07s/it]
Training 1/1 epoch (loss 2.7681): 71%|βββββββ | 883/1250 [21:31<08:10, 1.34s/it]
Training 1/1 epoch (loss 2.6786): 71%|βββββββ | 883/1250 [21:32<08:10, 1.34s/it]
Training 1/1 epoch (loss 2.6786): 71%|βββββββ | 884/1250 [21:32<08:06, 1.33s/it]
Training 1/1 epoch (loss 2.5621): 71%|βββββββ | 884/1250 [21:33<08:06, 1.33s/it]
Training 1/1 epoch (loss 2.5621): 71%|βββββββ | 885/1250 [21:33<07:56, 1.30s/it]
Training 1/1 epoch (loss 2.6602): 71%|βββββββ | 885/1250 [21:35<07:56, 1.30s/it]
Training 1/1 epoch (loss 2.6602): 71%|βββββββ | 886/1250 [21:35<08:55, 1.47s/it]
Training 1/1 epoch (loss 2.5998): 71%|βββββββ | 886/1250 [21:36<08:55, 1.47s/it]
Training 1/1 epoch (loss 2.5998): 71%|βββββββ | 887/1250 [21:36<08:02, 1.33s/it]
Training 1/1 epoch (loss 2.5609): 71%|βββββββ | 887/1250 [21:38<08:02, 1.33s/it]
Training 1/1 epoch (loss 2.5609): 71%|βββββββ | 888/1250 [21:38<08:52, 1.47s/it]
Training 1/1 epoch (loss 2.6031): 71%|βββββββ | 888/1250 [21:40<08:52, 1.47s/it]
Training 1/1 epoch (loss 2.6031): 71%|βββββββ | 889/1250 [21:40<09:23, 1.56s/it]
Training 1/1 epoch (loss 2.6048): 71%|βββββββ | 889/1250 [21:40<09:23, 1.56s/it]
Training 1/1 epoch (loss 2.6048): 71%|βββββββ | 890/1250 [21:40<07:25, 1.24s/it]
Training 1/1 epoch (loss 2.7464): 71%|βββββββ | 890/1250 [21:42<07:25, 1.24s/it]
Training 1/1 epoch (loss 2.7464): 71%|ββββββββ | 891/1250 [21:42<09:29, 1.59s/it]
Training 1/1 epoch (loss 2.5917): 71%|ββββββββ | 891/1250 [21:45<09:29, 1.59s/it]
Training 1/1 epoch (loss 2.5917): 71%|ββββββββ | 892/1250 [21:45<10:59, 1.84s/it]
Training 1/1 epoch (loss 2.4278): 71%|ββββββββ | 892/1250 [21:45<10:59, 1.84s/it]
Training 1/1 epoch (loss 2.4278): 71%|ββββββββ | 893/1250 [21:45<08:28, 1.42s/it]
Training 1/1 epoch (loss 2.4345): 71%|ββββββββ | 893/1250 [21:47<08:28, 1.42s/it]
Training 1/1 epoch (loss 2.4345): 72%|ββββββββ | 894/1250 [21:47<09:16, 1.56s/it]
Training 1/1 epoch (loss 2.9056): 72%|ββββββββ | 894/1250 [21:49<09:16, 1.56s/it]
Training 1/1 epoch (loss 2.9056): 72%|ββββββββ | 895/1250 [21:49<09:30, 1.61s/it]
Training 1/1 epoch (loss 2.2784): 72%|ββββββββ | 895/1250 [21:50<09:30, 1.61s/it]
Training 1/1 epoch (loss 2.2784): 72%|ββββββββ | 896/1250 [21:50<08:15, 1.40s/it]
Training 1/1 epoch (loss 2.5317): 72%|ββββββββ | 896/1250 [21:52<08:15, 1.40s/it]
Training 1/1 epoch (loss 2.5317): 72%|ββββββββ | 897/1250 [21:52<09:09, 1.56s/it]
Training 1/1 epoch (loss 2.6922): 72%|ββββββββ | 897/1250 [21:53<09:09, 1.56s/it]
Training 1/1 epoch (loss 2.6922): 72%|ββββββββ | 898/1250 [21:53<07:58, 1.36s/it]
Training 1/1 epoch (loss 2.5874): 72%|ββββββββ | 898/1250 [21:55<07:58, 1.36s/it]
Training 1/1 epoch (loss 2.5874): 72%|ββββββββ | 899/1250 [21:55<09:00, 1.54s/it]
Training 1/1 epoch (loss 2.7648): 72%|ββββββββ | 899/1250 [21:56<09:00, 1.54s/it]
Training 1/1 epoch (loss 2.7648): 72%|ββββββββ | 900/1250 [21:56<08:58, 1.54s/it]
Training 1/1 epoch (loss 2.6152): 72%|ββββββββ | 900/1250 [21:57<08:58, 1.54s/it]
Training 1/1 epoch (loss 2.6152): 72%|ββββββββ | 901/1250 [21:57<07:53, 1.36s/it]
Training 1/1 epoch (loss 2.6407): 72%|ββββββββ | 901/1250 [21:59<07:53, 1.36s/it]
Training 1/1 epoch (loss 2.6407): 72%|ββββββββ | 902/1250 [21:59<09:17, 1.60s/it]
Training 1/1 epoch (loss 2.5982): 72%|ββββββββ | 902/1250 [22:02<09:17, 1.60s/it]
Training 1/1 epoch (loss 2.5982): 72%|ββββββββ | 903/1250 [22:02<10:26, 1.80s/it]
Training 1/1 epoch (loss 2.6316): 72%|ββββββββ | 903/1250 [22:03<10:26, 1.80s/it]
Training 1/1 epoch (loss 2.6316): 72%|ββββββββ | 904/1250 [22:03<09:40, 1.68s/it]
Training 1/1 epoch (loss 2.5046): 72%|ββββββββ | 904/1250 [22:05<09:40, 1.68s/it]
Training 1/1 epoch (loss 2.5046): 72%|ββββββββ | 905/1250 [22:05<10:04, 1.75s/it]
Training 1/1 epoch (loss 2.4795): 72%|ββββββββ | 905/1250 [22:06<10:04, 1.75s/it]
Training 1/1 epoch (loss 2.4795): 72%|ββββββββ | 906/1250 [22:06<08:55, 1.56s/it]
Training 1/1 epoch (loss 2.4291): 72%|ββββββββ | 906/1250 [22:07<08:55, 1.56s/it]
Training 1/1 epoch (loss 2.4291): 73%|ββββββββ | 907/1250 [22:07<07:39, 1.34s/it]
Training 1/1 epoch (loss 2.4618): 73%|ββββββββ | 907/1250 [22:08<07:39, 1.34s/it]
Training 1/1 epoch (loss 2.4618): 73%|ββββββββ | 908/1250 [22:08<08:00, 1.41s/it]
Training 1/1 epoch (loss 2.7279): 73%|ββββββββ | 908/1250 [22:09<08:00, 1.41s/it]
Training 1/1 epoch (loss 2.7279): 73%|ββββββββ | 909/1250 [22:09<07:28, 1.32s/it]
Training 1/1 epoch (loss 2.5039): 73%|ββββββββ | 909/1250 [22:11<07:28, 1.32s/it]
Training 1/1 epoch (loss 2.5039): 73%|ββββββββ | 910/1250 [22:11<07:51, 1.39s/it]
Training 1/1 epoch (loss 2.3722): 73%|ββββββββ | 910/1250 [22:13<07:51, 1.39s/it]
Training 1/1 epoch (loss 2.3722): 73%|ββββββββ | 911/1250 [22:13<08:08, 1.44s/it]
Training 1/1 epoch (loss 2.6333): 73%|ββββββββ | 911/1250 [22:13<08:08, 1.44s/it]
Training 1/1 epoch (loss 2.6333): 73%|ββββββββ | 912/1250 [22:13<06:51, 1.22s/it]
Training 1/1 epoch (loss 2.4461): 73%|ββββββββ | 912/1250 [22:15<06:51, 1.22s/it]
Training 1/1 epoch (loss 2.4461): 73%|ββββββββ | 913/1250 [22:15<07:27, 1.33s/it]
Training 1/1 epoch (loss 2.5899): 73%|ββββββββ | 913/1250 [22:16<07:27, 1.33s/it]
Training 1/1 epoch (loss 2.5899): 73%|ββββββββ | 914/1250 [22:16<07:43, 1.38s/it]
Training 1/1 epoch (loss 2.6921): 73%|ββββββββ | 914/1250 [22:18<07:43, 1.38s/it]
Training 1/1 epoch (loss 2.6921): 73%|ββββββββ | 915/1250 [22:18<07:33, 1.35s/it]
Training 1/1 epoch (loss 2.6312): 73%|ββββββββ | 915/1250 [22:19<07:33, 1.35s/it]
Training 1/1 epoch (loss 2.6312): 73%|ββββββββ | 916/1250 [22:19<07:33, 1.36s/it]
Training 1/1 epoch (loss 2.3663): 73%|ββββββββ | 916/1250 [22:20<07:33, 1.36s/it]
Training 1/1 epoch (loss 2.3663): 73%|ββββββββ | 917/1250 [22:20<06:39, 1.20s/it]
Training 1/1 epoch (loss 2.7745): 73%|ββββββββ | 917/1250 [22:21<06:39, 1.20s/it]
Training 1/1 epoch (loss 2.7745): 73%|ββββββββ | 918/1250 [22:21<06:20, 1.15s/it]
Training 1/1 epoch (loss 2.5045): 73%|ββββββββ | 918/1250 [22:22<06:20, 1.15s/it]
Training 1/1 epoch (loss 2.5045): 74%|ββββββββ | 919/1250 [22:22<06:51, 1.24s/it]
Training 1/1 epoch (loss 2.6749): 74%|ββββββββ | 919/1250 [22:23<06:51, 1.24s/it]
Training 1/1 epoch (loss 2.6749): 74%|ββββββββ | 920/1250 [22:23<05:49, 1.06s/it]
Training 1/1 epoch (loss 2.5836): 74%|ββββββββ | 920/1250 [22:25<05:49, 1.06s/it]
Training 1/1 epoch (loss 2.5836): 74%|ββββββββ | 921/1250 [22:25<08:04, 1.47s/it]
Training 1/1 epoch (loss 2.5361): 74%|ββββββββ | 921/1250 [22:27<08:04, 1.47s/it]
Training 1/1 epoch (loss 2.5361): 74%|ββββββββ | 922/1250 [22:27<08:20, 1.53s/it]
Training 1/1 epoch (loss 2.4722): 74%|ββββββββ | 922/1250 [22:28<08:20, 1.53s/it]
Training 1/1 epoch (loss 2.4722): 74%|ββββββββ | 923/1250 [22:28<06:45, 1.24s/it]
Training 1/1 epoch (loss 2.7755): 74%|ββββββββ | 923/1250 [22:29<06:45, 1.24s/it]
Training 1/1 epoch (loss 2.7755): 74%|ββββββββ | 924/1250 [22:29<07:26, 1.37s/it]
Training 1/1 epoch (loss 2.6048): 74%|ββββββββ | 924/1250 [22:31<07:26, 1.37s/it]
Training 1/1 epoch (loss 2.6048): 74%|ββββββββ | 925/1250 [22:31<07:55, 1.46s/it]
Training 1/1 epoch (loss 2.6251): 74%|ββββββββ | 925/1250 [22:32<07:55, 1.46s/it]
Training 1/1 epoch (loss 2.6251): 74%|ββββββββ | 926/1250 [22:32<07:13, 1.34s/it]
Training 1/1 epoch (loss 2.5272): 74%|ββββββββ | 926/1250 [22:34<07:13, 1.34s/it]
Training 1/1 epoch (loss 2.5272): 74%|ββββββββ | 927/1250 [22:34<08:15, 1.54s/it]
Training 1/1 epoch (loss 2.8139): 74%|ββββββββ | 927/1250 [22:36<08:15, 1.54s/it]
Training 1/1 epoch (loss 2.8139): 74%|ββββββββ | 928/1250 [22:36<08:21, 1.56s/it]
Training 1/1 epoch (loss 2.6710): 74%|ββββββββ | 928/1250 [22:37<08:21, 1.56s/it]
Training 1/1 epoch (loss 2.6710): 74%|ββββββββ | 929/1250 [22:37<07:21, 1.37s/it]
Training 1/1 epoch (loss 2.7887): 74%|ββββββββ | 929/1250 [22:39<07:21, 1.37s/it]
Training 1/1 epoch (loss 2.7887): 74%|ββββββββ | 930/1250 [22:39<09:05, 1.70s/it]
Training 1/1 epoch (loss 2.6824): 74%|ββββββββ | 930/1250 [22:40<09:05, 1.70s/it]
Training 1/1 epoch (loss 2.6824): 74%|ββββββββ | 931/1250 [22:40<08:11, 1.54s/it]
Training 1/1 epoch (loss 2.5978): 74%|ββββββββ | 931/1250 [22:42<08:11, 1.54s/it]
Training 1/1 epoch (loss 2.5978): 75%|ββββββββ | 932/1250 [22:42<08:37, 1.63s/it]
Training 1/1 epoch (loss 2.6021): 75%|ββββββββ | 932/1250 [22:44<08:37, 1.63s/it]
Training 1/1 epoch (loss 2.6021): 75%|ββββββββ | 933/1250 [22:44<08:54, 1.69s/it]
Training 1/1 epoch (loss 2.4795): 75%|ββββββββ | 933/1250 [22:45<08:54, 1.69s/it]
Training 1/1 epoch (loss 2.4795): 75%|ββββββββ | 934/1250 [22:45<07:13, 1.37s/it]
Training 1/1 epoch (loss 2.5124): 75%|ββββββββ | 934/1250 [22:46<07:13, 1.37s/it]
Training 1/1 epoch (loss 2.5124): 75%|ββββββββ | 935/1250 [22:46<07:23, 1.41s/it]
Training 1/1 epoch (loss 2.6426): 75%|ββββββββ | 935/1250 [22:48<07:23, 1.41s/it]
Training 1/1 epoch (loss 2.6426): 75%|ββββββββ | 936/1250 [22:48<07:39, 1.46s/it]
Training 1/1 epoch (loss 2.3844): 75%|ββββββββ | 936/1250 [22:48<07:39, 1.46s/it]
Training 1/1 epoch (loss 2.3844): 75%|ββββββββ | 937/1250 [22:48<06:38, 1.27s/it]
Training 1/1 epoch (loss 2.5120): 75%|ββββββββ | 937/1250 [22:50<06:38, 1.27s/it]
Training 1/1 epoch (loss 2.5120): 75%|ββββββββ | 938/1250 [22:50<07:24, 1.43s/it]
Training 1/1 epoch (loss 2.6529): 75%|ββββββββ | 938/1250 [22:51<07:24, 1.43s/it]
Training 1/1 epoch (loss 2.6529): 75%|ββββββββ | 939/1250 [22:51<06:58, 1.34s/it]
Training 1/1 epoch (loss 2.6475): 75%|ββββββββ | 939/1250 [22:53<06:58, 1.34s/it]
Training 1/1 epoch (loss 2.6475): 75%|ββββββββ | 940/1250 [22:53<07:34, 1.46s/it]
Training 1/1 epoch (loss 2.5826): 75%|ββββββββ | 940/1250 [22:55<07:34, 1.46s/it]
Training 1/1 epoch (loss 2.5826): 75%|ββββββββ | 941/1250 [22:55<07:41, 1.49s/it]
Training 1/1 epoch (loss 2.5538): 75%|ββββββββ | 941/1250 [22:55<07:41, 1.49s/it]
Training 1/1 epoch (loss 2.5538): 75%|ββββββββ | 942/1250 [22:55<06:20, 1.23s/it]
Training 1/1 epoch (loss 2.4481): 75%|ββββββββ | 942/1250 [22:57<06:20, 1.23s/it]
Training 1/1 epoch (loss 2.4481): 75%|ββββββββ | 943/1250 [22:57<07:13, 1.41s/it]
Training 1/1 epoch (loss 2.3013): 75%|ββββββββ | 943/1250 [22:58<07:13, 1.41s/it]
Training 1/1 epoch (loss 2.3013): 76%|ββββββββ | 944/1250 [22:58<07:04, 1.39s/it]
Training 1/1 epoch (loss 2.5500): 76%|ββββββββ | 944/1250 [22:59<07:04, 1.39s/it]
Training 1/1 epoch (loss 2.5500): 76%|ββββββββ | 945/1250 [22:59<05:56, 1.17s/it]
Training 1/1 epoch (loss 2.6140): 76%|ββββββββ | 945/1250 [23:02<05:56, 1.17s/it]
Training 1/1 epoch (loss 2.6140): 76%|ββββββββ | 946/1250 [23:02<07:54, 1.56s/it]
Training 1/1 epoch (loss 2.5047): 76%|ββββββββ | 946/1250 [23:03<07:54, 1.56s/it]
Training 1/1 epoch (loss 2.5047): 76%|ββββββββ | 947/1250 [23:03<07:33, 1.50s/it]
Training 1/1 epoch (loss 2.4620): 76%|ββββββββ | 947/1250 [23:04<07:33, 1.50s/it]
Training 1/1 epoch (loss 2.4620): 76%|ββββββββ | 948/1250 [23:04<06:09, 1.22s/it]
Training 1/1 epoch (loss 2.6158): 76%|ββββββββ | 948/1250 [23:05<06:09, 1.22s/it]
Training 1/1 epoch (loss 2.6158): 76%|ββββββββ | 949/1250 [23:05<06:59, 1.39s/it]
Training 1/1 epoch (loss 2.4767): 76%|ββββββββ | 949/1250 [23:07<06:59, 1.39s/it]
Training 1/1 epoch (loss 2.4767): 76%|ββββββββ | 950/1250 [23:07<07:34, 1.51s/it]
Training 1/1 epoch (loss 2.3984): 76%|ββββββββ | 950/1250 [23:08<07:34, 1.51s/it]
Training 1/1 epoch (loss 2.3984): 76%|ββββββββ | 951/1250 [23:08<06:09, 1.24s/it]
Training 1/1 epoch (loss 2.7034): 76%|ββββββββ | 951/1250 [23:10<06:09, 1.24s/it]
Training 1/1 epoch (loss 2.7034): 76%|ββββββββ | 952/1250 [23:10<07:44, 1.56s/it]
Training 1/1 epoch (loss 2.4641): 76%|ββββββββ | 952/1250 [23:12<07:44, 1.56s/it]
Training 1/1 epoch (loss 2.4641): 76%|ββββββββ | 953/1250 [23:12<07:49, 1.58s/it]
Training 1/1 epoch (loss 2.5437): 76%|ββββββββ | 953/1250 [23:12<07:49, 1.58s/it]
Training 1/1 epoch (loss 2.5437): 76%|ββββββββ | 954/1250 [23:12<06:08, 1.24s/it]
Training 1/1 epoch (loss 2.5106): 76%|ββββββββ | 954/1250 [23:13<06:08, 1.24s/it]
Training 1/1 epoch (loss 2.5106): 76%|ββββββββ | 955/1250 [23:13<06:16, 1.28s/it]
Training 1/1 epoch (loss 2.5911): 76%|ββββββββ | 955/1250 [23:15<06:16, 1.28s/it]
Training 1/1 epoch (loss 2.5911): 76%|ββββββββ | 956/1250 [23:15<06:50, 1.40s/it]
Training 1/1 epoch (loss 2.7537): 76%|ββββββββ | 956/1250 [23:16<06:50, 1.40s/it]
Training 1/1 epoch (loss 2.7537): 77%|ββββββββ | 957/1250 [23:16<05:30, 1.13s/it]
Training 1/1 epoch (loss 2.5620): 77%|ββββββββ | 957/1250 [23:17<05:30, 1.13s/it]
Training 1/1 epoch (loss 2.5620): 77%|ββββββββ | 958/1250 [23:17<06:11, 1.27s/it]
Training 1/1 epoch (loss 2.7914): 77%|ββββββββ | 958/1250 [23:20<06:11, 1.27s/it]
Training 1/1 epoch (loss 2.7914): 77%|ββββββββ | 959/1250 [23:20<07:36, 1.57s/it]
Training 1/1 epoch (loss 2.3507): 77%|ββββββββ | 959/1250 [23:20<07:36, 1.57s/it]
Training 1/1 epoch (loss 2.3507): 77%|ββββββββ | 960/1250 [23:20<05:54, 1.22s/it]
Training 1/1 epoch (loss 2.3302): 77%|ββββββββ | 960/1250 [23:22<05:54, 1.22s/it]
Training 1/1 epoch (loss 2.3302): 77%|ββββββββ | 961/1250 [23:22<06:30, 1.35s/it]
Training 1/1 epoch (loss 2.3539): 77%|ββββββββ | 961/1250 [23:23<06:30, 1.35s/it]
Training 1/1 epoch (loss 2.3539): 77%|ββββββββ | 962/1250 [23:23<06:52, 1.43s/it]
Training 1/1 epoch (loss 2.7459): 77%|ββββββββ | 962/1250 [23:25<06:52, 1.43s/it]
Training 1/1 epoch (loss 2.7459): 77%|ββββββββ | 963/1250 [23:25<06:46, 1.42s/it]
Training 1/1 epoch (loss 2.6836): 77%|ββββββββ | 963/1250 [23:26<06:46, 1.42s/it]
Training 1/1 epoch (loss 2.6836): 77%|ββββββββ | 964/1250 [23:26<06:48, 1.43s/it]
Training 1/1 epoch (loss 2.3475): 77%|ββββββββ | 964/1250 [23:27<06:48, 1.43s/it]
Training 1/1 epoch (loss 2.3475): 77%|ββββββββ | 965/1250 [23:27<05:54, 1.24s/it]
Training 1/1 epoch (loss 2.5563): 77%|ββββββββ | 965/1250 [23:29<05:54, 1.24s/it]
Training 1/1 epoch (loss 2.5563): 77%|ββββββββ | 966/1250 [23:29<06:29, 1.37s/it]
Training 1/1 epoch (loss 2.5153): 77%|ββββββββ | 966/1250 [23:31<06:29, 1.37s/it]
Training 1/1 epoch (loss 2.5153): 77%|ββββββββ | 967/1250 [23:31<07:27, 1.58s/it]
Training 1/1 epoch (loss 2.5604): 77%|ββββββββ | 967/1250 [23:31<07:27, 1.58s/it]
Training 1/1 epoch (loss 2.5604): 77%|ββββββββ | 968/1250 [23:31<06:22, 1.36s/it]
Training 1/1 epoch (loss 2.8741): 77%|ββββββββ | 968/1250 [23:33<06:22, 1.36s/it]
Training 1/1 epoch (loss 2.8741): 78%|ββββββββ | 969/1250 [23:33<07:10, 1.53s/it]
Training 1/1 epoch (loss 2.4924): 78%|ββββββββ | 969/1250 [23:35<07:10, 1.53s/it]
Training 1/1 epoch (loss 2.4924): 78%|ββββββββ | 970/1250 [23:35<07:07, 1.53s/it]
Training 1/1 epoch (loss 2.8911): 78%|ββββββββ | 970/1250 [23:36<07:07, 1.53s/it]
Training 1/1 epoch (loss 2.8911): 78%|ββββββββ | 971/1250 [23:36<05:53, 1.27s/it]
Training 1/1 epoch (loss 2.5558): 78%|ββββββββ | 971/1250 [23:37<05:53, 1.27s/it]
Training 1/1 epoch (loss 2.5558): 78%|ββββββββ | 972/1250 [23:37<05:54, 1.28s/it]
Training 1/1 epoch (loss 2.5788): 78%|ββββββββ | 972/1250 [23:39<05:54, 1.28s/it]
Training 1/1 epoch (loss 2.5788): 78%|ββββββββ | 973/1250 [23:39<06:59, 1.52s/it]
Training 1/1 epoch (loss 2.4504): 78%|ββββββββ | 973/1250 [23:40<06:59, 1.52s/it]
Training 1/1 epoch (loss 2.4504): 78%|ββββββββ | 974/1250 [23:40<05:50, 1.27s/it]
Training 1/1 epoch (loss 2.5891): 78%|ββββββββ | 974/1250 [23:42<05:50, 1.27s/it]
Training 1/1 epoch (loss 2.5891): 78%|ββββββββ | 975/1250 [23:42<06:49, 1.49s/it]
Training 1/1 epoch (loss 2.4772): 78%|ββββββββ | 975/1250 [23:43<06:49, 1.49s/it]
Training 1/1 epoch (loss 2.4772): 78%|ββββββββ | 976/1250 [23:43<06:19, 1.39s/it]
Training 1/1 epoch (loss 2.6558): 78%|ββββββββ | 976/1250 [23:44<06:19, 1.39s/it]
Training 1/1 epoch (loss 2.6558): 78%|ββββββββ | 977/1250 [23:44<06:02, 1.33s/it]
Training 1/1 epoch (loss 2.6577): 78%|ββββββββ | 977/1250 [23:46<06:02, 1.33s/it]
Training 1/1 epoch (loss 2.6577): 78%|ββββββββ | 978/1250 [23:46<06:40, 1.47s/it]
Training 1/1 epoch (loss 2.8601): 78%|ββββββββ | 978/1250 [23:47<06:40, 1.47s/it]
Training 1/1 epoch (loss 2.8601): 78%|ββββββββ | 979/1250 [23:47<06:15, 1.39s/it]
Training 1/1 epoch (loss 2.6597): 78%|ββββββββ | 979/1250 [23:48<06:15, 1.39s/it]
Training 1/1 epoch (loss 2.6597): 78%|ββββββββ | 980/1250 [23:48<05:58, 1.33s/it]
Training 1/1 epoch (loss 2.6060): 78%|ββββββββ | 980/1250 [23:50<05:58, 1.33s/it]
Training 1/1 epoch (loss 2.6060): 78%|ββββββββ | 981/1250 [23:50<06:37, 1.48s/it]
Training 1/1 epoch (loss 2.5961): 78%|ββββββββ | 981/1250 [23:51<06:37, 1.48s/it]
Training 1/1 epoch (loss 2.5961): 79%|ββββββββ | 982/1250 [23:51<05:42, 1.28s/it]
Training 1/1 epoch (loss 2.6592): 79%|ββββββββ | 982/1250 [23:52<05:42, 1.28s/it]
Training 1/1 epoch (loss 2.6592): 79%|ββββββββ | 983/1250 [23:52<05:39, 1.27s/it]
Training 1/1 epoch (loss 2.6301): 79%|ββββββββ | 983/1250 [23:54<05:39, 1.27s/it]
Training 1/1 epoch (loss 2.6301): 79%|ββββββββ | 984/1250 [23:54<06:47, 1.53s/it]
Training 1/1 epoch (loss 2.7192): 79%|ββββββββ | 984/1250 [23:55<06:47, 1.53s/it]
Training 1/1 epoch (loss 2.7192): 79%|ββββββββ | 985/1250 [23:55<05:24, 1.22s/it]
Training 1/1 epoch (loss 2.4560): 79%|ββββββββ | 985/1250 [23:57<05:24, 1.22s/it]
Training 1/1 epoch (loss 2.4560): 79%|ββββββββ | 986/1250 [23:57<06:35, 1.50s/it]
Training 1/1 epoch (loss 2.5943): 79%|ββββββββ | 986/1250 [23:59<06:35, 1.50s/it]
Training 1/1 epoch (loss 2.5943): 79%|ββββββββ | 987/1250 [23:59<07:15, 1.65s/it]
Training 1/1 epoch (loss 2.5223): 79%|ββββββββ | 987/1250 [24:00<07:15, 1.65s/it]
Training 1/1 epoch (loss 2.5223): 79%|ββββββββ | 988/1250 [24:00<06:20, 1.45s/it]
Training 1/1 epoch (loss 2.8114): 79%|ββββββββ | 988/1250 [24:01<06:20, 1.45s/it]
Training 1/1 epoch (loss 2.8114): 79%|ββββββββ | 989/1250 [24:01<06:23, 1.47s/it]
Training 1/1 epoch (loss 2.4353): 79%|ββββββββ | 989/1250 [24:03<06:23, 1.47s/it]
Training 1/1 epoch (loss 2.4353): 79%|ββββββββ | 990/1250 [24:03<06:32, 1.51s/it]
Training 1/1 epoch (loss 2.5145): 79%|ββββββββ | 990/1250 [24:04<06:32, 1.51s/it]
Training 1/1 epoch (loss 2.5145): 79%|ββββββββ | 991/1250 [24:04<05:32, 1.28s/it]
Training 1/1 epoch (loss 2.2521): 79%|ββββββββ | 991/1250 [24:06<05:32, 1.28s/it]
Training 1/1 epoch (loss 2.2521): 79%|ββββββββ | 992/1250 [24:06<06:25, 1.50s/it]
Training 1/1 epoch (loss 2.5101): 79%|ββββββββ | 992/1250 [24:07<06:25, 1.50s/it]
Training 1/1 epoch (loss 2.5101): 79%|ββββββββ | 993/1250 [24:07<06:10, 1.44s/it]
Training 1/1 epoch (loss 2.4350): 79%|ββββββββ | 993/1250 [24:08<06:10, 1.44s/it]
Training 1/1 epoch (loss 2.4350): 80%|ββββββββ | 994/1250 [24:08<06:01, 1.41s/it]
Training 1/1 epoch (loss 2.2831): 80%|ββββββββ | 994/1250 [24:10<06:01, 1.41s/it]
Training 1/1 epoch (loss 2.2831): 80%|ββββββββ | 995/1250 [24:10<05:58, 1.40s/it]
Training 1/1 epoch (loss 2.5321): 80%|ββββββββ | 995/1250 [24:11<05:58, 1.40s/it]
Training 1/1 epoch (loss 2.5321): 80%|ββββββββ | 996/1250 [24:11<05:35, 1.32s/it]
Training 1/1 epoch (loss 2.6333): 80%|ββββββββ | 996/1250 [24:12<05:35, 1.32s/it]
Training 1/1 epoch (loss 2.6333): 80%|ββββββββ | 997/1250 [24:12<04:57, 1.18s/it]
Training 1/1 epoch (loss 2.5478): 80%|ββββββββ | 997/1250 [24:14<04:57, 1.18s/it]
Training 1/1 epoch (loss 2.5478): 80%|ββββββββ | 998/1250 [24:14<06:31, 1.55s/it]
Training 1/1 epoch (loss 2.5820): 80%|ββββββββ | 998/1250 [24:15<06:31, 1.55s/it]
Training 1/1 epoch (loss 2.5820): 80%|ββββββββ | 999/1250 [24:15<05:43, 1.37s/it]
Training 1/1 epoch (loss 2.5452): 80%|ββββββββ | 999/1250 [24:17<05:43, 1.37s/it]
Training 1/1 epoch (loss 2.5452): 80%|ββββββββ | 1000/1250 [24:17<06:05, 1.46s/it]
Training 1/1 epoch (loss 2.6420): 80%|ββββββββ | 1000/1250 [24:19<06:05, 1.46s/it]
Training 1/1 epoch (loss 2.6420): 80%|ββββββββ | 1001/1250 [24:19<06:45, 1.63s/it]
Training 1/1 epoch (loss 2.3777): 80%|ββββββββ | 1001/1250 [24:19<06:45, 1.63s/it]
Training 1/1 epoch (loss 2.3777): 80%|ββββββββ | 1002/1250 [24:19<05:16, 1.28s/it]
Training 1/1 epoch (loss 2.4638): 80%|ββββββββ | 1002/1250 [24:21<05:16, 1.28s/it]
Training 1/1 epoch (loss 2.4638): 80%|ββββββββ | 1003/1250 [24:21<05:31, 1.34s/it]
Training 1/1 epoch (loss 2.4468): 80%|ββββββββ | 1003/1250 [24:23<05:31, 1.34s/it]
Training 1/1 epoch (loss 2.4468): 80%|ββββββββ | 1004/1250 [24:23<06:23, 1.56s/it]
Training 1/1 epoch (loss 2.6731): 80%|ββββββββ | 1004/1250 [24:23<06:23, 1.56s/it]
Training 1/1 epoch (loss 2.6731): 80%|ββββββββ | 1005/1250 [24:23<05:19, 1.30s/it]
Training 1/1 epoch (loss 2.5950): 80%|ββββββββ | 1005/1250 [24:25<05:19, 1.30s/it]
Training 1/1 epoch (loss 2.5950): 80%|ββββββββ | 1006/1250 [24:25<05:11, 1.28s/it]
Training 1/1 epoch (loss 2.6667): 80%|ββββββββ | 1006/1250 [24:27<05:11, 1.28s/it]
Training 1/1 epoch (loss 2.6667): 81%|ββββββββ | 1007/1250 [24:27<06:02, 1.49s/it]
Training 1/1 epoch (loss 2.6168): 81%|ββββββββ | 1007/1250 [24:28<06:02, 1.49s/it]
Training 1/1 epoch (loss 2.6168): 81%|ββββββββ | 1008/1250 [24:28<05:55, 1.47s/it]
Training 1/1 epoch (loss 2.7035): 81%|ββββββββ | 1008/1250 [24:30<05:55, 1.47s/it]
Training 1/1 epoch (loss 2.7035): 81%|ββββββββ | 1009/1250 [24:30<06:26, 1.61s/it]
Training 1/1 epoch (loss 2.6358): 81%|ββββββββ | 1009/1250 [24:31<06:26, 1.61s/it]
Training 1/1 epoch (loss 2.6358): 81%|ββββββββ | 1010/1250 [24:31<05:58, 1.49s/it]
Training 1/1 epoch (loss 2.7557): 81%|ββββββββ | 1010/1250 [24:34<05:58, 1.49s/it]
Training 1/1 epoch (loss 2.7557): 81%|ββββββββ | 1011/1250 [24:34<07:05, 1.78s/it]
Training 1/1 epoch (loss 2.6186): 81%|ββββββββ | 1011/1250 [24:36<07:05, 1.78s/it]
Training 1/1 epoch (loss 2.6186): 81%|ββββββββ | 1012/1250 [24:36<07:50, 1.98s/it]
Training 1/1 epoch (loss 2.4513): 81%|ββββββββ | 1012/1250 [24:37<07:50, 1.98s/it]
Training 1/1 epoch (loss 2.4513): 81%|ββββββββ | 1013/1250 [24:37<06:45, 1.71s/it]
Training 1/1 epoch (loss 2.3893): 81%|ββββββββ | 1013/1250 [24:39<06:45, 1.71s/it]
Training 1/1 epoch (loss 2.3893): 81%|ββββββββ | 1014/1250 [24:39<07:13, 1.84s/it]
Training 1/1 epoch (loss 2.5520): 81%|ββββββββ | 1014/1250 [24:42<07:13, 1.84s/it]
Training 1/1 epoch (loss 2.5520): 81%|ββββββββ | 1015/1250 [24:42<07:57, 2.03s/it]
Training 1/1 epoch (loss 2.5779): 81%|ββββββββ | 1015/1250 [24:43<07:57, 2.03s/it]
Training 1/1 epoch (loss 2.5779): 81%|βββββββββ | 1016/1250 [24:43<06:45, 1.73s/it]
Training 1/1 epoch (loss 2.5612): 81%|βββββββββ | 1016/1250 [24:44<06:45, 1.73s/it]
Training 1/1 epoch (loss 2.5612): 81%|βββββββββ | 1017/1250 [24:44<05:43, 1.48s/it]
Training 1/1 epoch (loss 2.6684): 81%|βββββββββ | 1017/1250 [24:45<05:43, 1.48s/it]
Training 1/1 epoch (loss 2.6684): 81%|βββββββββ | 1018/1250 [24:45<05:04, 1.31s/it]
Training 1/1 epoch (loss 2.5985): 81%|βββββββββ | 1018/1250 [24:47<05:04, 1.31s/it]
Training 1/1 epoch (loss 2.5985): 82%|βββββββββ | 1019/1250 [24:47<05:42, 1.48s/it]
Training 1/1 epoch (loss 2.7760): 82%|βββββββββ | 1019/1250 [24:48<05:42, 1.48s/it]
Training 1/1 epoch (loss 2.7760): 82%|βββββββββ | 1020/1250 [24:48<05:55, 1.55s/it]
Training 1/1 epoch (loss 2.5240): 82%|βββββββββ | 1020/1250 [24:49<05:55, 1.55s/it]
Training 1/1 epoch (loss 2.5240): 82%|βββββββββ | 1021/1250 [24:49<04:53, 1.28s/it]
Training 1/1 epoch (loss 2.8070): 82%|βββββββββ | 1021/1250 [24:51<04:53, 1.28s/it]
Training 1/1 epoch (loss 2.8070): 82%|βββββββββ | 1022/1250 [24:51<05:18, 1.40s/it]
Training 1/1 epoch (loss 2.4870): 82%|βββββββββ | 1022/1250 [24:52<05:18, 1.40s/it]
Training 1/1 epoch (loss 2.4870): 82%|βββββββββ | 1023/1250 [24:52<05:26, 1.44s/it]
Training 1/1 epoch (loss 2.5521): 82%|βββββββββ | 1023/1250 [24:53<05:26, 1.44s/it]
Training 1/1 epoch (loss 2.5521): 82%|βββββββββ | 1024/1250 [24:53<04:54, 1.30s/it]
Training 1/1 epoch (loss 2.4295): 82%|βββββββββ | 1024/1250 [24:56<04:54, 1.30s/it]
Training 1/1 epoch (loss 2.4295): 82%|βββββββββ | 1025/1250 [24:56<06:09, 1.64s/it]
Training 1/1 epoch (loss 2.6075): 82%|βββββββββ | 1025/1250 [24:57<06:09, 1.64s/it]
Training 1/1 epoch (loss 2.6075): 82%|βββββββββ | 1026/1250 [24:57<05:53, 1.58s/it]
Training 1/1 epoch (loss 2.4976): 82%|βββββββββ | 1026/1250 [24:57<05:53, 1.58s/it]
Training 1/1 epoch (loss 2.4976): 82%|βββββββββ | 1027/1250 [24:57<04:38, 1.25s/it]
Training 1/1 epoch (loss 2.4923): 82%|βββββββββ | 1027/1250 [24:59<04:38, 1.25s/it]
Training 1/1 epoch (loss 2.4923): 82%|βββββββββ | 1028/1250 [24:59<05:19, 1.44s/it]
Training 1/1 epoch (loss 2.6077): 82%|βββββββββ | 1028/1250 [25:01<05:19, 1.44s/it]
Training 1/1 epoch (loss 2.6077): 82%|βββββββββ | 1029/1250 [25:01<05:50, 1.59s/it]
Training 1/1 epoch (loss 2.6690): 82%|βββββββββ | 1029/1250 [25:02<05:50, 1.59s/it]
Training 1/1 epoch (loss 2.6690): 82%|βββββββββ | 1030/1250 [25:02<04:41, 1.28s/it]
Training 1/1 epoch (loss 2.6047): 82%|βββββββββ | 1030/1250 [25:03<04:41, 1.28s/it]
Training 1/1 epoch (loss 2.6047): 82%|βββββββββ | 1031/1250 [25:03<04:37, 1.27s/it]
Training 1/1 epoch (loss 2.5900): 82%|βββββββββ | 1031/1250 [25:05<04:37, 1.27s/it]
Training 1/1 epoch (loss 2.5900): 83%|βββββββββ | 1032/1250 [25:05<05:26, 1.50s/it]
Training 1/1 epoch (loss 2.4710): 83%|βββββββββ | 1032/1250 [25:06<05:26, 1.50s/it]
Training 1/1 epoch (loss 2.4710): 83%|βββββββββ | 1033/1250 [25:06<05:00, 1.38s/it]
Training 1/1 epoch (loss 2.4462): 83%|βββββββββ | 1033/1250 [25:08<05:00, 1.38s/it]
Training 1/1 epoch (loss 2.4462): 83%|βββββββββ | 1034/1250 [25:08<05:34, 1.55s/it]
Training 1/1 epoch (loss 2.3863): 83%|βββββββββ | 1034/1250 [25:10<05:34, 1.55s/it]
Training 1/1 epoch (loss 2.3863): 83%|βββββββββ | 1035/1250 [25:10<05:54, 1.65s/it]
Training 1/1 epoch (loss 2.5109): 83%|βββββββββ | 1035/1250 [25:11<05:54, 1.65s/it]
Training 1/1 epoch (loss 2.5109): 83%|βββββββββ | 1036/1250 [25:11<05:08, 1.44s/it]
Training 1/1 epoch (loss 2.6931): 83%|βββββββββ | 1036/1250 [25:13<05:08, 1.44s/it]
Training 1/1 epoch (loss 2.6931): 83%|βββββββββ | 1037/1250 [25:13<06:09, 1.73s/it]
Training 1/1 epoch (loss 2.4604): 83%|βββββββββ | 1037/1250 [25:15<06:09, 1.73s/it]
Training 1/1 epoch (loss 2.4604): 83%|βββββββββ | 1038/1250 [25:15<06:05, 1.72s/it]
Training 1/1 epoch (loss 2.3465): 83%|βββββββββ | 1038/1250 [25:17<06:05, 1.72s/it]
Training 1/1 epoch (loss 2.3465): 83%|βββββββββ | 1039/1250 [25:17<05:44, 1.63s/it]
Training 1/1 epoch (loss 2.4537): 83%|βββββββββ | 1039/1250 [25:19<05:44, 1.63s/it]
Training 1/1 epoch (loss 2.4537): 83%|βββββββββ | 1040/1250 [25:19<06:18, 1.80s/it]
Training 1/1 epoch (loss 2.3101): 83%|βββββββββ | 1040/1250 [25:20<06:18, 1.80s/it]
Training 1/1 epoch (loss 2.3101): 83%|βββββββββ | 1041/1250 [25:20<05:11, 1.49s/it]
Training 1/1 epoch (loss 2.7219): 83%|βββββββββ | 1041/1250 [25:22<05:11, 1.49s/it]
Training 1/1 epoch (loss 2.7219): 83%|βββββββββ | 1042/1250 [25:22<05:48, 1.67s/it]
Training 1/1 epoch (loss 2.6239): 83%|βββββββββ | 1042/1250 [25:23<05:48, 1.67s/it]
Training 1/1 epoch (loss 2.6239): 83%|βββββββββ | 1043/1250 [25:23<05:30, 1.60s/it]
Training 1/1 epoch (loss 2.6034): 83%|βββββββββ | 1043/1250 [25:24<05:30, 1.60s/it]
Training 1/1 epoch (loss 2.6034): 84%|βββββββββ | 1044/1250 [25:24<04:19, 1.26s/it]
Training 1/1 epoch (loss 2.6666): 84%|βββββββββ | 1044/1250 [25:25<04:19, 1.26s/it]
Training 1/1 epoch (loss 2.6666): 84%|βββββββββ | 1045/1250 [25:25<04:32, 1.33s/it]
Training 1/1 epoch (loss 2.6361): 84%|βββββββββ | 1045/1250 [25:27<04:32, 1.33s/it]
Training 1/1 epoch (loss 2.6361): 84%|βββββββββ | 1046/1250 [25:27<05:36, 1.65s/it]
Training 1/1 epoch (loss 2.6398): 84%|βββββββββ | 1046/1250 [25:28<05:36, 1.65s/it]
Training 1/1 epoch (loss 2.6398): 84%|βββββββββ | 1047/1250 [25:28<04:26, 1.31s/it]
Training 1/1 epoch (loss 2.6086): 84%|βββββββββ | 1047/1250 [25:30<04:26, 1.31s/it]
Training 1/1 epoch (loss 2.6086): 84%|βββββββββ | 1048/1250 [25:30<05:22, 1.60s/it]
Training 1/1 epoch (loss 2.4828): 84%|βββββββββ | 1048/1250 [25:32<05:22, 1.60s/it]
Training 1/1 epoch (loss 2.4828): 84%|βββββββββ | 1049/1250 [25:32<05:35, 1.67s/it]
Training 1/1 epoch (loss 2.7476): 84%|βββββββββ | 1049/1250 [25:33<05:35, 1.67s/it]
Training 1/1 epoch (loss 2.7476): 84%|βββββββββ | 1050/1250 [25:33<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.6126): 84%|βββββββββ | 1050/1250 [25:35<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.6126): 84%|βββββββββ | 1051/1250 [25:35<05:07, 1.55s/it]
Training 1/1 epoch (loss 2.5927): 84%|βββββββββ | 1051/1250 [25:36<05:07, 1.55s/it]
Training 1/1 epoch (loss 2.5927): 84%|βββββββββ | 1052/1250 [25:36<04:28, 1.36s/it]
Training 1/1 epoch (loss 2.4628): 84%|βββββββββ | 1052/1250 [25:37<04:28, 1.36s/it]
Training 1/1 epoch (loss 2.4628): 84%|βββββββββ | 1053/1250 [25:37<04:40, 1.42s/it]
Training 1/1 epoch (loss 2.4557): 84%|βββββββββ | 1053/1250 [25:38<04:40, 1.42s/it]
Training 1/1 epoch (loss 2.4557): 84%|βββββββββ | 1054/1250 [25:38<04:18, 1.32s/it]
Training 1/1 epoch (loss 2.3544): 84%|βββββββββ | 1054/1250 [25:40<04:18, 1.32s/it]
Training 1/1 epoch (loss 2.3544): 84%|βββββββββ | 1055/1250 [25:40<04:24, 1.35s/it]
Training 1/1 epoch (loss 2.4873): 84%|βββββββββ | 1055/1250 [25:42<04:24, 1.35s/it]
Training 1/1 epoch (loss 2.4873): 84%|βββββββββ | 1056/1250 [25:42<04:59, 1.54s/it]
Training 1/1 epoch (loss 2.6093): 84%|βββββββββ | 1056/1250 [25:44<04:59, 1.54s/it]
Training 1/1 epoch (loss 2.6093): 85%|βββββββββ | 1057/1250 [25:44<05:49, 1.81s/it]
Training 1/1 epoch (loss 2.6188): 85%|βββββββββ | 1057/1250 [25:45<05:49, 1.81s/it]
Training 1/1 epoch (loss 2.6188): 85%|βββββββββ | 1058/1250 [25:45<04:37, 1.45s/it]
Training 1/1 epoch (loss 2.5976): 85%|βββββββββ | 1058/1250 [25:46<04:37, 1.45s/it]
Training 1/1 epoch (loss 2.5976): 85%|βββββββββ | 1059/1250 [25:46<04:49, 1.51s/it]
Training 1/1 epoch (loss 2.6862): 85%|βββββββββ | 1059/1250 [25:48<04:49, 1.51s/it]
Training 1/1 epoch (loss 2.6862): 85%|βββββββββ | 1060/1250 [25:48<05:11, 1.64s/it]
Training 1/1 epoch (loss 2.4173): 85%|βββββββββ | 1060/1250 [25:49<05:11, 1.64s/it]
Training 1/1 epoch (loss 2.4173): 85%|βββββββββ | 1061/1250 [25:49<04:08, 1.32s/it]
Training 1/1 epoch (loss 2.8002): 85%|βββββββββ | 1061/1250 [25:51<04:08, 1.32s/it]
Training 1/1 epoch (loss 2.8002): 85%|βββββββββ | 1062/1250 [25:51<04:37, 1.48s/it]
Training 1/1 epoch (loss 2.7227): 85%|βββββββββ | 1062/1250 [25:52<04:37, 1.48s/it]
Training 1/1 epoch (loss 2.7227): 85%|βββββββββ | 1063/1250 [25:52<04:31, 1.45s/it]
Training 1/1 epoch (loss 2.6381): 85%|βββββββββ | 1063/1250 [25:53<04:31, 1.45s/it]
Training 1/1 epoch (loss 2.6381): 85%|βββββββββ | 1064/1250 [25:53<04:15, 1.37s/it]
Training 1/1 epoch (loss 2.5342): 85%|βββββββββ | 1064/1250 [25:55<04:15, 1.37s/it]
Training 1/1 epoch (loss 2.5342): 85%|βββββββββ | 1065/1250 [25:55<04:35, 1.49s/it]
Training 1/1 epoch (loss 2.7228): 85%|βββββββββ | 1065/1250 [25:56<04:35, 1.49s/it]
Training 1/1 epoch (loss 2.7228): 85%|βββββββββ | 1066/1250 [25:56<04:14, 1.38s/it]
Training 1/1 epoch (loss 2.5571): 85%|βββββββββ | 1066/1250 [25:58<04:14, 1.38s/it]
Training 1/1 epoch (loss 2.5571): 85%|βββββββββ | 1067/1250 [25:58<04:31, 1.49s/it]
Training 1/1 epoch (loss 2.5155): 85%|βββββββββ | 1067/1250 [26:00<04:31, 1.49s/it]
Training 1/1 epoch (loss 2.5155): 85%|βββββββββ | 1068/1250 [26:00<04:38, 1.53s/it]
Training 1/1 epoch (loss 2.4969): 85%|βββββββββ | 1068/1250 [26:00<04:38, 1.53s/it]
Training 1/1 epoch (loss 2.4969): 86%|βββββββββ | 1069/1250 [26:00<03:42, 1.23s/it]
Training 1/1 epoch (loss 2.5663): 86%|βββββββββ | 1069/1250 [26:02<03:42, 1.23s/it]
Training 1/1 epoch (loss 2.5663): 86%|βββββββββ | 1070/1250 [26:02<04:36, 1.53s/it]
Training 1/1 epoch (loss 2.6728): 86%|βββββββββ | 1070/1250 [26:04<04:36, 1.53s/it]
Training 1/1 epoch (loss 2.6728): 86%|βββββββββ | 1071/1250 [26:04<04:52, 1.63s/it]
Training 1/1 epoch (loss 2.5853): 86%|βββββββββ | 1071/1250 [26:05<04:52, 1.63s/it]
Training 1/1 epoch (loss 2.5853): 86%|βββββββββ | 1072/1250 [26:05<03:58, 1.34s/it]
Training 1/1 epoch (loss 2.5546): 86%|βββββββββ | 1072/1250 [26:07<03:58, 1.34s/it]
Training 1/1 epoch (loss 2.5546): 86%|βββββββββ | 1073/1250 [26:07<05:01, 1.70s/it]
Training 1/1 epoch (loss 2.6339): 86%|βββββββββ | 1073/1250 [26:09<05:01, 1.70s/it]
Training 1/1 epoch (loss 2.6339): 86%|βββββββββ | 1074/1250 [26:09<04:56, 1.69s/it]
Training 1/1 epoch (loss 2.6324): 86%|βββββββββ | 1074/1250 [26:10<04:56, 1.69s/it]
Training 1/1 epoch (loss 2.6324): 86%|βββββββββ | 1075/1250 [26:10<04:39, 1.60s/it]
Training 1/1 epoch (loss 2.6647): 86%|βββββββββ | 1075/1250 [26:13<04:39, 1.60s/it]
Training 1/1 epoch (loss 2.6647): 86%|βββββββββ | 1076/1250 [26:13<05:12, 1.80s/it]
Training 1/1 epoch (loss 2.5929): 86%|βββββββββ | 1076/1250 [26:14<05:12, 1.80s/it]
Training 1/1 epoch (loss 2.5929): 86%|βββββββββ | 1077/1250 [26:14<04:22, 1.52s/it]
Training 1/1 epoch (loss 2.5930): 86%|βββββββββ | 1077/1250 [26:16<04:22, 1.52s/it]
Training 1/1 epoch (loss 2.5930): 86%|βββββββββ | 1078/1250 [26:16<05:03, 1.77s/it]
Training 1/1 epoch (loss 2.5875): 86%|βββββββββ | 1078/1250 [26:18<05:03, 1.77s/it]
Training 1/1 epoch (loss 2.5875): 86%|βββββββββ | 1079/1250 [26:18<05:12, 1.83s/it]
Training 1/1 epoch (loss 2.7598): 86%|βββββββββ | 1079/1250 [26:19<05:12, 1.83s/it]
Training 1/1 epoch (loss 2.7598): 86%|βββββββββ | 1080/1250 [26:19<04:47, 1.69s/it]
Training 1/1 epoch (loss 2.7287): 86%|βββββββββ | 1080/1250 [26:21<04:47, 1.69s/it]
Training 1/1 epoch (loss 2.7287): 86%|βββββββββ | 1081/1250 [26:21<04:59, 1.77s/it]
Training 1/1 epoch (loss 2.3162): 86%|βββββββββ | 1081/1250 [26:22<04:59, 1.77s/it]
Training 1/1 epoch (loss 2.3162): 87%|βββββββββ | 1082/1250 [26:22<04:26, 1.59s/it]
Training 1/1 epoch (loss 2.6266): 87%|βββββββββ | 1082/1250 [26:23<04:26, 1.59s/it]
Training 1/1 epoch (loss 2.6266): 87%|βββββββββ | 1083/1250 [26:23<03:45, 1.35s/it]
Training 1/1 epoch (loss 2.7282): 87%|βββββββββ | 1083/1250 [26:25<03:45, 1.35s/it]
Training 1/1 epoch (loss 2.7282): 87%|βββββββββ | 1084/1250 [26:25<04:24, 1.59s/it]
Training 1/1 epoch (loss 2.4837): 87%|βββββββββ | 1084/1250 [26:27<04:24, 1.59s/it]
Training 1/1 epoch (loss 2.4837): 87%|βββββββββ | 1085/1250 [26:27<04:06, 1.50s/it]
Training 1/1 epoch (loss 2.5547): 87%|βββββββββ | 1085/1250 [26:28<04:06, 1.50s/it]
Training 1/1 epoch (loss 2.5547): 87%|βββββββββ | 1086/1250 [26:28<03:57, 1.45s/it]
Training 1/1 epoch (loss 2.6983): 87%|βββββββββ | 1086/1250 [26:30<03:57, 1.45s/it]
Training 1/1 epoch (loss 2.6983): 87%|βββββββββ | 1087/1250 [26:30<04:23, 1.62s/it]
Training 1/1 epoch (loss 2.6428): 87%|βββββββββ | 1087/1250 [26:31<04:23, 1.62s/it]
Training 1/1 epoch (loss 2.6428): 87%|βββββββββ | 1088/1250 [26:31<04:07, 1.53s/it]
Training 1/1 epoch (loss 2.4744): 87%|βββββββββ | 1088/1250 [26:33<04:07, 1.53s/it]
Training 1/1 epoch (loss 2.4744): 87%|βββββββββ | 1089/1250 [26:33<04:33, 1.70s/it]
Training 1/1 epoch (loss 2.5946): 87%|βββββββββ | 1089/1250 [26:35<04:33, 1.70s/it]
Training 1/1 epoch (loss 2.5946): 87%|βββββββββ | 1090/1250 [26:35<04:32, 1.70s/it]
Training 1/1 epoch (loss 2.6582): 87%|βββββββββ | 1090/1250 [26:36<04:32, 1.70s/it]
Training 1/1 epoch (loss 2.6582): 87%|βββββββββ | 1091/1250 [26:36<03:29, 1.32s/it]
Training 1/1 epoch (loss 2.4311): 87%|βββββββββ | 1091/1250 [26:37<03:29, 1.32s/it]
Training 1/1 epoch (loss 2.4311): 87%|βββββββββ | 1092/1250 [26:37<03:34, 1.36s/it]
Training 1/1 epoch (loss 2.4783): 87%|βββββββββ | 1092/1250 [26:39<03:34, 1.36s/it]
Training 1/1 epoch (loss 2.4783): 87%|βββββββββ | 1093/1250 [26:39<03:58, 1.52s/it]
Training 1/1 epoch (loss 2.4859): 87%|βββββββββ | 1093/1250 [26:39<03:58, 1.52s/it]
Training 1/1 epoch (loss 2.4859): 88%|βββββββββ | 1094/1250 [26:39<03:11, 1.23s/it]
Training 1/1 epoch (loss 2.6309): 88%|βββββββββ | 1094/1250 [26:41<03:11, 1.23s/it]
Training 1/1 epoch (loss 2.6309): 88%|βββββββββ | 1095/1250 [26:41<03:12, 1.24s/it]
Training 1/1 epoch (loss 2.4778): 88%|βββββββββ | 1095/1250 [26:43<03:12, 1.24s/it]
Training 1/1 epoch (loss 2.4778): 88%|βββββββββ | 1096/1250 [26:43<03:41, 1.44s/it]
Training 1/1 epoch (loss 2.7625): 88%|βββββββββ | 1096/1250 [26:43<03:41, 1.44s/it]
Training 1/1 epoch (loss 2.7625): 88%|βββββββββ | 1097/1250 [26:43<03:04, 1.21s/it]
Training 1/1 epoch (loss 2.6985): 88%|βββββββββ | 1097/1250 [26:45<03:04, 1.21s/it]
Training 1/1 epoch (loss 2.6985): 88%|βββββββββ | 1098/1250 [26:45<03:25, 1.35s/it]
Training 1/1 epoch (loss 2.6662): 88%|βββββββββ | 1098/1250 [26:47<03:25, 1.35s/it]
Training 1/1 epoch (loss 2.6662): 88%|βββββββββ | 1099/1250 [26:47<03:43, 1.48s/it]
Training 1/1 epoch (loss 2.6642): 88%|βββββββββ | 1099/1250 [26:48<03:43, 1.48s/it]
Training 1/1 epoch (loss 2.6642): 88%|βββββββββ | 1100/1250 [26:48<03:13, 1.29s/it]
Training 1/1 epoch (loss 2.4185): 88%|βββββββββ | 1100/1250 [26:49<03:13, 1.29s/it]
Training 1/1 epoch (loss 2.4185): 88%|βββββββββ | 1101/1250 [26:49<03:10, 1.28s/it]
Training 1/1 epoch (loss 2.6703): 88%|βββββββββ | 1101/1250 [26:50<03:10, 1.28s/it]
Training 1/1 epoch (loss 2.6703): 88%|βββββββββ | 1102/1250 [26:50<03:20, 1.36s/it]
Training 1/1 epoch (loss 2.5991): 88%|βββββββββ | 1102/1250 [26:51<03:20, 1.36s/it]
Training 1/1 epoch (loss 2.5991): 88%|βββββββββ | 1103/1250 [26:51<02:58, 1.21s/it]
Training 1/1 epoch (loss 2.4457): 88%|βββββββββ | 1103/1250 [26:52<02:58, 1.21s/it]
Training 1/1 epoch (loss 2.4457): 88%|βββββββββ | 1104/1250 [26:52<02:58, 1.22s/it]
Training 1/1 epoch (loss 2.5200): 88%|βββββββββ | 1104/1250 [26:54<02:58, 1.22s/it]
Training 1/1 epoch (loss 2.5200): 88%|βββββββββ | 1105/1250 [26:54<03:00, 1.24s/it]
Training 1/1 epoch (loss 2.5257): 88%|βββββββββ | 1105/1250 [26:56<03:00, 1.24s/it]
Training 1/1 epoch (loss 2.5257): 88%|βββββββββ | 1106/1250 [26:56<03:35, 1.50s/it]
Training 1/1 epoch (loss 2.4245): 88%|βββββββββ | 1106/1250 [26:58<03:35, 1.50s/it]
Training 1/1 epoch (loss 2.4245): 89%|βββββββββ | 1107/1250 [26:58<03:48, 1.59s/it]
Training 1/1 epoch (loss 2.6259): 89%|βββββββββ | 1107/1250 [26:59<03:48, 1.59s/it]
Training 1/1 epoch (loss 2.6259): 89%|βββββββββ | 1108/1250 [26:59<03:31, 1.49s/it]
Training 1/1 epoch (loss 2.6796): 89%|βββββββββ | 1108/1250 [27:01<03:31, 1.49s/it]
Training 1/1 epoch (loss 2.6796): 89%|βββββββββ | 1109/1250 [27:01<03:49, 1.63s/it]
Training 1/1 epoch (loss 2.7158): 89%|βββββββββ | 1109/1250 [27:02<03:49, 1.63s/it]
Training 1/1 epoch (loss 2.7158): 89%|βββββββββ | 1110/1250 [27:02<03:18, 1.42s/it]
Training 1/1 epoch (loss 2.7189): 89%|βββββββββ | 1110/1250 [27:02<03:18, 1.42s/it]
Training 1/1 epoch (loss 2.7189): 89%|βββββββββ | 1111/1250 [27:02<02:44, 1.19s/it]
Training 1/1 epoch (loss 2.5364): 89%|βββββββββ | 1111/1250 [27:04<02:44, 1.19s/it]
Training 1/1 epoch (loss 2.5364): 89%|βββββββββ | 1112/1250 [27:04<03:05, 1.34s/it]
Training 1/1 epoch (loss 2.6516): 89%|βββββββββ | 1112/1250 [27:05<03:05, 1.34s/it]
Training 1/1 epoch (loss 2.6516): 89%|βββββββββ | 1113/1250 [27:05<02:44, 1.20s/it]
Training 1/1 epoch (loss 2.5622): 89%|βββββββββ | 1113/1250 [27:07<02:44, 1.20s/it]
Training 1/1 epoch (loss 2.5622): 89%|βββββββββ | 1114/1250 [27:07<03:07, 1.38s/it]
Training 1/1 epoch (loss 2.4541): 89%|βββββββββ | 1114/1250 [27:09<03:07, 1.38s/it]
Training 1/1 epoch (loss 2.4541): 89%|βββββββββ | 1115/1250 [27:09<03:33, 1.58s/it]
Training 1/1 epoch (loss 2.5162): 89%|βββββββββ | 1115/1250 [27:09<03:33, 1.58s/it]
Training 1/1 epoch (loss 2.5162): 89%|βββββββββ | 1116/1250 [27:09<02:51, 1.28s/it]
Training 1/1 epoch (loss 2.6401): 89%|βββββββββ | 1116/1250 [27:12<02:51, 1.28s/it]
Training 1/1 epoch (loss 2.6401): 89%|βββββββββ | 1117/1250 [27:12<03:36, 1.63s/it]
Training 1/1 epoch (loss 2.4001): 89%|βββββββββ | 1117/1250 [27:14<03:36, 1.63s/it]
Training 1/1 epoch (loss 2.4001): 89%|βββββββββ | 1118/1250 [27:14<03:57, 1.80s/it]
Training 1/1 epoch (loss 2.4936): 89%|βββββββββ | 1118/1250 [27:15<03:57, 1.80s/it]
Training 1/1 epoch (loss 2.4936): 90%|βββββββββ | 1119/1250 [27:15<03:03, 1.40s/it]
Training 1/1 epoch (loss 2.3757): 90%|βββββββββ | 1119/1250 [27:16<03:03, 1.40s/it]
Training 1/1 epoch (loss 2.3757): 90%|βββββββββ | 1120/1250 [27:16<03:08, 1.45s/it]
Training 1/1 epoch (loss 2.5726): 90%|βββββββββ | 1120/1250 [27:18<03:08, 1.45s/it]
Training 1/1 epoch (loss 2.5726): 90%|βββββββββ | 1121/1250 [27:18<03:20, 1.56s/it]
Training 1/1 epoch (loss 2.5466): 90%|βββββββββ | 1121/1250 [27:18<03:20, 1.56s/it]
Training 1/1 epoch (loss 2.5466): 90%|βββββββββ | 1122/1250 [27:18<02:37, 1.23s/it]
Training 1/1 epoch (loss 2.6105): 90%|βββββββββ | 1122/1250 [27:20<02:37, 1.23s/it]
Training 1/1 epoch (loss 2.6105): 90%|βββββββββ | 1123/1250 [27:20<02:58, 1.40s/it]
Training 1/1 epoch (loss 2.5412): 90%|βββββββββ | 1123/1250 [27:23<02:58, 1.40s/it]
Training 1/1 epoch (loss 2.5412): 90%|βββββββββ | 1124/1250 [27:23<03:34, 1.71s/it]
Training 1/1 epoch (loss 2.6791): 90%|βββββββββ | 1124/1250 [27:23<03:34, 1.71s/it]
Training 1/1 epoch (loss 2.6791): 90%|βββββββββ | 1125/1250 [27:23<02:51, 1.37s/it]
Training 1/1 epoch (loss 2.6925): 90%|βββββββββ | 1125/1250 [27:25<02:51, 1.37s/it]
Training 1/1 epoch (loss 2.6925): 90%|βββββββββ | 1126/1250 [27:25<02:49, 1.37s/it]
Training 1/1 epoch (loss 2.4991): 90%|βββββββββ | 1126/1250 [27:27<02:49, 1.37s/it]
Training 1/1 epoch (loss 2.4991): 90%|βββββββββ | 1127/1250 [27:27<03:09, 1.54s/it]
Training 1/1 epoch (loss 2.6352): 90%|βββββββββ | 1127/1250 [27:28<03:09, 1.54s/it]
Training 1/1 epoch (loss 2.6352): 90%|βββββββββ | 1128/1250 [27:28<03:06, 1.52s/it]
Training 1/1 epoch (loss 2.5169): 90%|βββββββββ | 1128/1250 [27:30<03:06, 1.52s/it]
Training 1/1 epoch (loss 2.5169): 90%|βββββββββ | 1129/1250 [27:30<03:34, 1.78s/it]
Training 1/1 epoch (loss 2.5729): 90%|βββββββββ | 1129/1250 [27:31<03:34, 1.78s/it]
Training 1/1 epoch (loss 2.5729): 90%|βββββββββ | 1130/1250 [27:31<03:07, 1.56s/it]
Training 1/1 epoch (loss 2.3904): 90%|βββββββββ | 1130/1250 [27:33<03:07, 1.56s/it]
Training 1/1 epoch (loss 2.3904): 90%|βββββββββ | 1131/1250 [27:33<03:19, 1.68s/it]
Training 1/1 epoch (loss 2.5233): 90%|βββββββββ | 1131/1250 [27:36<03:19, 1.68s/it]
Training 1/1 epoch (loss 2.5233): 91%|βββββββββ | 1132/1250 [27:36<03:48, 1.93s/it]
Training 1/1 epoch (loss 2.4717): 91%|βββββββββ | 1132/1250 [27:37<03:48, 1.93s/it]
Training 1/1 epoch (loss 2.4717): 91%|βββββββββ | 1133/1250 [27:37<03:00, 1.54s/it]
Training 1/1 epoch (loss 2.5917): 91%|βββββββββ | 1133/1250 [27:39<03:00, 1.54s/it]
Training 1/1 epoch (loss 2.5917): 91%|βββββββββ | 1134/1250 [27:39<03:15, 1.69s/it]
Training 1/1 epoch (loss 2.5820): 91%|βββββββββ | 1134/1250 [27:40<03:15, 1.69s/it]
Training 1/1 epoch (loss 2.5820): 91%|βββββββββ | 1135/1250 [27:40<03:01, 1.57s/it]
Training 1/1 epoch (loss 2.5021): 91%|βββββββββ | 1135/1250 [27:41<03:01, 1.57s/it]
Training 1/1 epoch (loss 2.5021): 91%|βββββββββ | 1136/1250 [27:41<02:38, 1.39s/it]
Training 1/1 epoch (loss 2.5755): 91%|βββββββββ | 1136/1250 [27:42<02:38, 1.39s/it]
Training 1/1 epoch (loss 2.5755): 91%|βββββββββ | 1137/1250 [27:42<02:40, 1.42s/it]
Training 1/1 epoch (loss 2.5868): 91%|βββββββββ | 1137/1250 [27:43<02:40, 1.42s/it]
Training 1/1 epoch (loss 2.5868): 91%|βββββββββ | 1138/1250 [27:43<02:13, 1.19s/it]
Training 1/1 epoch (loss 2.3728): 91%|βββββββββ | 1138/1250 [27:45<02:13, 1.19s/it]
Training 1/1 epoch (loss 2.3728): 91%|βββββββββ | 1139/1250 [27:45<02:38, 1.43s/it]
Training 1/1 epoch (loss 2.7479): 91%|βββββββββ | 1139/1250 [27:46<02:38, 1.43s/it]
Training 1/1 epoch (loss 2.7479): 91%|βββββββββ | 1140/1250 [27:46<02:38, 1.45s/it]
Training 1/1 epoch (loss 2.4649): 91%|βββββββββ | 1140/1250 [27:47<02:38, 1.45s/it]
Training 1/1 epoch (loss 2.4649): 91%|ββββββββββ| 1141/1250 [27:47<02:10, 1.19s/it]
Training 1/1 epoch (loss 2.5440): 91%|ββββββββββ| 1141/1250 [27:49<02:10, 1.19s/it]
Training 1/1 epoch (loss 2.5440): 91%|ββββββββββ| 1142/1250 [27:49<02:22, 1.32s/it]
Training 1/1 epoch (loss 2.5934): 91%|ββββββββββ| 1142/1250 [27:50<02:22, 1.32s/it]
Training 1/1 epoch (loss 2.5934): 91%|ββββββββββ| 1143/1250 [27:50<02:34, 1.44s/it]
Training 1/1 epoch (loss 2.4030): 91%|ββββββββββ| 1143/1250 [27:51<02:34, 1.44s/it]
Training 1/1 epoch (loss 2.4030): 92%|ββββββββββ| 1144/1250 [27:51<02:18, 1.30s/it]
Training 1/1 epoch (loss 2.5512): 92%|ββββββββββ| 1144/1250 [27:54<02:18, 1.30s/it]
Training 1/1 epoch (loss 2.5512): 92%|ββββββββββ| 1145/1250 [27:54<02:52, 1.64s/it]
Training 1/1 epoch (loss 2.5944): 92%|ββββββββββ| 1145/1250 [27:55<02:52, 1.64s/it]
Training 1/1 epoch (loss 2.5944): 92%|ββββββββββ| 1146/1250 [27:55<02:35, 1.49s/it]
Training 1/1 epoch (loss 2.4347): 92%|ββββββββββ| 1146/1250 [27:56<02:35, 1.49s/it]
Training 1/1 epoch (loss 2.4347): 92%|ββββββββββ| 1147/1250 [27:56<02:18, 1.34s/it]
Training 1/1 epoch (loss 2.6403): 92%|ββββββββββ| 1147/1250 [27:58<02:18, 1.34s/it]
Training 1/1 epoch (loss 2.6403): 92%|ββββββββββ| 1148/1250 [27:58<02:28, 1.46s/it]
Training 1/1 epoch (loss 2.4058): 92%|ββββββββββ| 1148/1250 [27:58<02:28, 1.46s/it]
Training 1/1 epoch (loss 2.4058): 92%|ββββββββββ| 1149/1250 [27:58<02:07, 1.26s/it]
Training 1/1 epoch (loss 2.4109): 92%|ββββββββββ| 1149/1250 [28:00<02:07, 1.26s/it]
Training 1/1 epoch (loss 2.4109): 92%|ββββββββββ| 1150/1250 [28:00<02:26, 1.47s/it]
Training 1/1 epoch (loss 2.7828): 92%|ββββββββββ| 1150/1250 [28:02<02:26, 1.47s/it]
Training 1/1 epoch (loss 2.7828): 92%|ββββββββββ| 1151/1250 [28:02<02:19, 1.41s/it]
Training 1/1 epoch (loss 2.5855): 92%|ββββββββββ| 1151/1250 [28:03<02:19, 1.41s/it]
Training 1/1 epoch (loss 2.5855): 92%|ββββββββββ| 1152/1250 [28:03<02:01, 1.24s/it]
Training 1/1 epoch (loss 2.5834): 92%|ββββββββββ| 1152/1250 [28:05<02:01, 1.24s/it]
Training 1/1 epoch (loss 2.5834): 92%|ββββββββββ| 1153/1250 [28:05<02:29, 1.54s/it]
Training 1/1 epoch (loss 2.4872): 92%|ββββββββββ| 1153/1250 [28:07<02:29, 1.54s/it]
Training 1/1 epoch (loss 2.4872): 92%|ββββββββββ| 1154/1250 [28:07<02:37, 1.64s/it]
Training 1/1 epoch (loss 2.5188): 92%|ββββββββββ| 1154/1250 [28:07<02:37, 1.64s/it]
Training 1/1 epoch (loss 2.5188): 92%|ββββββββββ| 1155/1250 [28:07<02:07, 1.34s/it]
Training 1/1 epoch (loss 2.5595): 92%|ββββββββββ| 1155/1250 [28:09<02:07, 1.34s/it]
Training 1/1 epoch (loss 2.5595): 92%|ββββββββββ| 1156/1250 [28:09<02:19, 1.48s/it]
Training 1/1 epoch (loss 2.5277): 92%|ββββββββββ| 1156/1250 [28:10<02:19, 1.48s/it]
Training 1/1 epoch (loss 2.5277): 93%|ββββββββββ| 1157/1250 [28:10<02:08, 1.39s/it]
Training 1/1 epoch (loss 2.5257): 93%|ββββββββββ| 1157/1250 [28:11<02:08, 1.39s/it]
Training 1/1 epoch (loss 2.5257): 93%|ββββββββββ| 1158/1250 [28:11<01:57, 1.28s/it]
Training 1/1 epoch (loss 2.5206): 93%|ββββββββββ| 1158/1250 [28:13<01:57, 1.28s/it]
Training 1/1 epoch (loss 2.5206): 93%|ββββββββββ| 1159/1250 [28:13<02:16, 1.50s/it]
Training 1/1 epoch (loss 2.5084): 93%|ββββββββββ| 1159/1250 [28:14<02:16, 1.50s/it]
Training 1/1 epoch (loss 2.5084): 93%|ββββββββββ| 1160/1250 [28:14<02:02, 1.36s/it]
Training 1/1 epoch (loss 2.3911): 93%|ββββββββββ| 1160/1250 [28:17<02:02, 1.36s/it]
Training 1/1 epoch (loss 2.3911): 93%|ββββββββββ| 1161/1250 [28:17<02:29, 1.68s/it]
Training 1/1 epoch (loss 2.5214): 93%|ββββββββββ| 1161/1250 [28:19<02:29, 1.68s/it]
Training 1/1 epoch (loss 2.5214): 93%|ββββββββββ| 1162/1250 [28:19<02:36, 1.78s/it]
Training 1/1 epoch (loss 2.4351): 93%|ββββββββββ| 1162/1250 [28:20<02:36, 1.78s/it]
Training 1/1 epoch (loss 2.4351): 93%|ββββββββββ| 1163/1250 [28:20<02:21, 1.62s/it]
Training 1/1 epoch (loss 2.3491): 93%|ββββββββββ| 1163/1250 [28:22<02:21, 1.62s/it]
Training 1/1 epoch (loss 2.3491): 93%|ββββββββββ| 1164/1250 [28:22<02:39, 1.86s/it]
Training 1/1 epoch (loss 2.5583): 93%|ββββββββββ| 1164/1250 [28:24<02:39, 1.86s/it]
Training 1/1 epoch (loss 2.5583): 93%|ββββββββββ| 1165/1250 [28:24<02:32, 1.79s/it]
Training 1/1 epoch (loss 2.6846): 93%|ββββββββββ| 1165/1250 [28:26<02:32, 1.79s/it]
Training 1/1 epoch (loss 2.6846): 93%|ββββββββββ| 1166/1250 [28:26<02:23, 1.71s/it]
Training 1/1 epoch (loss 2.7476): 93%|ββββββββββ| 1166/1250 [28:28<02:23, 1.71s/it]
Training 1/1 epoch (loss 2.7476): 93%|ββββββββββ| 1167/1250 [28:28<02:26, 1.77s/it]
Training 1/1 epoch (loss 2.6697): 93%|ββββββββββ| 1167/1250 [28:29<02:26, 1.77s/it]
Training 1/1 epoch (loss 2.6697): 93%|ββββββββββ| 1168/1250 [28:29<02:12, 1.62s/it]
Training 1/1 epoch (loss 2.3768): 93%|ββββββββββ| 1168/1250 [28:30<02:12, 1.62s/it]
Training 1/1 epoch (loss 2.3768): 94%|ββββββββββ| 1169/1250 [28:30<02:07, 1.58s/it]
Training 1/1 epoch (loss 2.4952): 94%|ββββββββββ| 1169/1250 [28:32<02:07, 1.58s/it]
Training 1/1 epoch (loss 2.4952): 94%|ββββββββββ| 1170/1250 [28:32<02:07, 1.59s/it]
Training 1/1 epoch (loss 2.5288): 94%|ββββββββββ| 1170/1250 [28:32<02:07, 1.59s/it]
Training 1/1 epoch (loss 2.5288): 94%|ββββββββββ| 1171/1250 [28:32<01:41, 1.29s/it]
Training 1/1 epoch (loss 2.6879): 94%|ββββββββββ| 1171/1250 [28:34<01:41, 1.29s/it]
Training 1/1 epoch (loss 2.6879): 94%|ββββββββββ| 1172/1250 [28:34<01:43, 1.33s/it]
Training 1/1 epoch (loss 2.7168): 94%|ββββββββββ| 1172/1250 [28:35<01:43, 1.33s/it]
Training 1/1 epoch (loss 2.7168): 94%|ββββββββββ| 1173/1250 [28:35<01:45, 1.37s/it]
Training 1/1 epoch (loss 2.6382): 94%|ββββββββββ| 1173/1250 [28:36<01:45, 1.37s/it]
Training 1/1 epoch (loss 2.6382): 94%|ββββββββββ| 1174/1250 [28:36<01:23, 1.10s/it]
Training 1/1 epoch (loss 2.6200): 94%|ββββββββββ| 1174/1250 [28:37<01:23, 1.10s/it]
Training 1/1 epoch (loss 2.6200): 94%|ββββββββββ| 1175/1250 [28:37<01:27, 1.17s/it]
Training 1/1 epoch (loss 2.5829): 94%|ββββββββββ| 1175/1250 [28:39<01:27, 1.17s/it]
Training 1/1 epoch (loss 2.5829): 94%|ββββββββββ| 1176/1250 [28:39<01:45, 1.43s/it]
Training 1/1 epoch (loss 2.6541): 94%|ββββββββββ| 1176/1250 [28:41<01:45, 1.43s/it]
Training 1/1 epoch (loss 2.6541): 94%|ββββββββββ| 1177/1250 [28:41<01:42, 1.40s/it]
Training 1/1 epoch (loss 2.6480): 94%|ββββββββββ| 1177/1250 [28:42<01:42, 1.40s/it]
Training 1/1 epoch (loss 2.6480): 94%|ββββββββββ| 1178/1250 [28:42<01:50, 1.53s/it]
Training 1/1 epoch (loss 2.5011): 94%|ββββββββββ| 1178/1250 [28:43<01:50, 1.53s/it]
Training 1/1 epoch (loss 2.5011): 94%|ββββββββββ| 1179/1250 [28:43<01:38, 1.39s/it]
Training 1/1 epoch (loss 2.6760): 94%|ββββββββββ| 1179/1250 [28:44<01:38, 1.39s/it]
Training 1/1 epoch (loss 2.6760): 94%|ββββββββββ| 1180/1250 [28:44<01:28, 1.26s/it]
Training 1/1 epoch (loss 2.6525): 94%|ββββββββββ| 1180/1250 [28:47<01:28, 1.26s/it]
Training 1/1 epoch (loss 2.6525): 94%|ββββββββββ| 1181/1250 [28:47<01:50, 1.60s/it]
Training 1/1 epoch (loss 2.6090): 94%|ββββββββββ| 1181/1250 [28:47<01:50, 1.60s/it]
Training 1/1 epoch (loss 2.6090): 95%|ββββββββββ| 1182/1250 [28:47<01:30, 1.33s/it]
Training 1/1 epoch (loss 2.5906): 95%|ββββββββββ| 1182/1250 [28:50<01:30, 1.33s/it]
Training 1/1 epoch (loss 2.5906): 95%|ββββββββββ| 1183/1250 [28:50<01:52, 1.67s/it]
Training 1/1 epoch (loss 2.5045): 95%|ββββββββββ| 1183/1250 [28:52<01:52, 1.67s/it]
Training 1/1 epoch (loss 2.5045): 95%|ββββββββββ| 1184/1250 [28:52<01:59, 1.81s/it]
Training 1/1 epoch (loss 2.6087): 95%|ββββββββββ| 1184/1250 [28:53<01:59, 1.81s/it]
Training 1/1 epoch (loss 2.6087): 95%|ββββββββββ| 1185/1250 [28:53<01:46, 1.64s/it]
Training 1/1 epoch (loss 2.5958): 95%|ββββββββββ| 1185/1250 [28:55<01:46, 1.64s/it]
Training 1/1 epoch (loss 2.5958): 95%|ββββββββββ| 1186/1250 [28:55<01:39, 1.56s/it]
Training 1/1 epoch (loss 2.7413): 95%|ββββββββββ| 1186/1250 [28:56<01:39, 1.56s/it]
Training 1/1 epoch (loss 2.7413): 95%|ββββββββββ| 1187/1250 [28:56<01:37, 1.55s/it]
Training 1/1 epoch (loss 2.3428): 95%|ββββββββββ| 1187/1250 [28:58<01:37, 1.55s/it]
Training 1/1 epoch (loss 2.3428): 95%|ββββββββββ| 1188/1250 [28:58<01:39, 1.60s/it]
Training 1/1 epoch (loss 2.4637): 95%|ββββββββββ| 1188/1250 [28:59<01:39, 1.60s/it]
Training 1/1 epoch (loss 2.4637): 95%|ββββββββββ| 1189/1250 [28:59<01:35, 1.57s/it]
Training 1/1 epoch (loss 2.8186): 95%|ββββββββββ| 1189/1250 [29:01<01:35, 1.57s/it]
Training 1/1 epoch (loss 2.8186): 95%|ββββββββββ| 1190/1250 [29:01<01:35, 1.58s/it]
Training 1/1 epoch (loss 2.7044): 95%|ββββββββββ| 1190/1250 [29:02<01:35, 1.58s/it]
Training 1/1 epoch (loss 2.7044): 95%|ββββββββββ| 1191/1250 [29:02<01:21, 1.38s/it]
Training 1/1 epoch (loss 2.5824): 95%|ββββββββββ| 1191/1250 [29:04<01:21, 1.38s/it]
Training 1/1 epoch (loss 2.5824): 95%|ββββββββββ| 1192/1250 [29:04<01:25, 1.47s/it]
Training 1/1 epoch (loss 2.5608): 95%|ββββββββββ| 1192/1250 [29:05<01:25, 1.47s/it]
Training 1/1 epoch (loss 2.5608): 95%|ββββββββββ| 1193/1250 [29:05<01:17, 1.36s/it]
Training 1/1 epoch (loss 2.6238): 95%|ββββββββββ| 1193/1250 [29:07<01:17, 1.36s/it]
Training 1/1 epoch (loss 2.6238): 96%|ββββββββββ| 1194/1250 [29:07<01:31, 1.63s/it]
Training 1/1 epoch (loss 2.7050): 96%|ββββββββββ| 1194/1250 [29:09<01:31, 1.63s/it]
Training 1/1 epoch (loss 2.7050): 96%|ββββββββββ| 1195/1250 [29:09<01:32, 1.68s/it]
Training 1/1 epoch (loss 2.8214): 96%|ββββββββββ| 1195/1250 [29:10<01:32, 1.68s/it]
Training 1/1 epoch (loss 2.8214): 96%|ββββββββββ| 1196/1250 [29:10<01:24, 1.56s/it]
Training 1/1 epoch (loss 2.6563): 96%|ββββββββββ| 1196/1250 [29:12<01:24, 1.56s/it]
Training 1/1 epoch (loss 2.6563): 96%|ββββββββββ| 1197/1250 [29:12<01:32, 1.74s/it]
Training 1/1 epoch (loss 2.4924): 96%|ββββββββββ| 1197/1250 [29:13<01:32, 1.74s/it]
Training 1/1 epoch (loss 2.4924): 96%|ββββββββββ| 1198/1250 [29:13<01:16, 1.48s/it]
Training 1/1 epoch (loss 2.5062): 96%|ββββββββββ| 1198/1250 [29:15<01:16, 1.48s/it]
Training 1/1 epoch (loss 2.5062): 96%|ββββββββββ| 1199/1250 [29:15<01:25, 1.69s/it]
Training 1/1 epoch (loss 2.4934): 96%|ββββββββββ| 1199/1250 [29:17<01:25, 1.69s/it]
Training 1/1 epoch (loss 2.4934): 96%|ββββββββββ| 1200/1250 [29:17<01:18, 1.57s/it]
Training 1/1 epoch (loss 2.4510): 96%|ββββββββββ| 1200/1250 [29:18<01:18, 1.57s/it]
Training 1/1 epoch (loss 2.4510): 96%|ββββββββββ| 1201/1250 [29:18<01:21, 1.66s/it]
Training 1/1 epoch (loss 2.3981): 96%|ββββββββββ| 1201/1250 [29:20<01:21, 1.66s/it]
Training 1/1 epoch (loss 2.3981): 96%|ββββββββββ| 1202/1250 [29:20<01:23, 1.75s/it]
Training 1/1 epoch (loss 2.6060): 96%|ββββββββββ| 1202/1250 [29:21<01:23, 1.75s/it]
Training 1/1 epoch (loss 2.6060): 96%|ββββββββββ| 1203/1250 [29:21<01:05, 1.39s/it]
Training 1/1 epoch (loss 2.4702): 96%|ββββββββββ| 1203/1250 [29:22<01:05, 1.39s/it]
Training 1/1 epoch (loss 2.4702): 96%|ββββββββββ| 1204/1250 [29:22<01:03, 1.38s/it]
Training 1/1 epoch (loss 2.8554): 96%|ββββββββββ| 1204/1250 [29:24<01:03, 1.38s/it]
Training 1/1 epoch (loss 2.8554): 96%|ββββββββββ| 1205/1250 [29:24<01:10, 1.56s/it]
Training 1/1 epoch (loss 2.5014): 96%|ββββββββββ| 1205/1250 [29:25<01:10, 1.56s/it]
Training 1/1 epoch (loss 2.5014): 96%|ββββββββββ| 1206/1250 [29:25<00:56, 1.29s/it]
Training 1/1 epoch (loss 2.5519): 96%|ββββββββββ| 1206/1250 [29:26<00:56, 1.29s/it]
Training 1/1 epoch (loss 2.5519): 97%|ββββββββββ| 1207/1250 [29:26<00:56, 1.31s/it]
Training 1/1 epoch (loss 2.5414): 97%|ββββββββββ| 1207/1250 [29:27<00:56, 1.31s/it]
Training 1/1 epoch (loss 2.5414): 97%|ββββββββββ| 1208/1250 [29:27<00:51, 1.23s/it]
Training 1/1 epoch (loss 2.5866): 97%|ββββββββββ| 1208/1250 [29:28<00:51, 1.23s/it]
Training 1/1 epoch (loss 2.5866): 97%|ββββββββββ| 1209/1250 [29:28<00:46, 1.13s/it]
Training 1/1 epoch (loss 2.7022): 97%|ββββββββββ| 1209/1250 [29:31<00:46, 1.13s/it]
Training 1/1 epoch (loss 2.7022): 97%|ββββββββββ| 1210/1250 [29:31<01:01, 1.53s/it]
Training 1/1 epoch (loss 2.5918): 97%|ββββββββββ| 1210/1250 [29:32<01:01, 1.53s/it]
Training 1/1 epoch (loss 2.5918): 97%|ββββββββββ| 1211/1250 [29:32<00:52, 1.34s/it]
Training 1/1 epoch (loss 2.5189): 97%|ββββββββββ| 1211/1250 [29:33<00:52, 1.34s/it]
Training 1/1 epoch (loss 2.5189): 97%|ββββββββββ| 1212/1250 [29:33<00:50, 1.32s/it]
Training 1/1 epoch (loss 2.6406): 97%|ββββββββββ| 1212/1250 [29:34<00:50, 1.32s/it]
Training 1/1 epoch (loss 2.6406): 97%|ββββββββββ| 1213/1250 [29:34<00:47, 1.28s/it]
Training 1/1 epoch (loss 2.3997): 97%|ββββββββββ| 1213/1250 [29:35<00:47, 1.28s/it]
Training 1/1 epoch (loss 2.3997): 97%|ββββββββββ| 1214/1250 [29:35<00:44, 1.24s/it]
Training 1/1 epoch (loss 2.5515): 97%|ββββββββββ| 1214/1250 [29:37<00:44, 1.24s/it]
Training 1/1 epoch (loss 2.5515): 97%|ββββββββββ| 1215/1250 [29:37<00:44, 1.26s/it]
Training 1/1 epoch (loss 2.8467): 97%|ββββββββββ| 1215/1250 [29:39<00:44, 1.26s/it]
Training 1/1 epoch (loss 2.8467): 97%|ββββββββββ| 1216/1250 [29:39<00:57, 1.70s/it]
Training 1/1 epoch (loss 2.4436): 97%|ββββββββββ| 1216/1250 [29:40<00:57, 1.70s/it]
Training 1/1 epoch (loss 2.4436): 97%|ββββββββββ| 1217/1250 [29:40<00:44, 1.35s/it]
Training 1/1 epoch (loss 2.5091): 97%|ββββββββββ| 1217/1250 [29:41<00:44, 1.35s/it]
Training 1/1 epoch (loss 2.5091): 97%|ββββββββββ| 1218/1250 [29:41<00:45, 1.43s/it]
Training 1/1 epoch (loss 2.6322): 97%|ββββββββββ| 1218/1250 [29:43<00:45, 1.43s/it]
Training 1/1 epoch (loss 2.6322): 98%|ββββββββββ| 1219/1250 [29:43<00:46, 1.52s/it]
Training 1/1 epoch (loss 2.5312): 98%|ββββββββββ| 1219/1250 [29:44<00:46, 1.52s/it]
Training 1/1 epoch (loss 2.5312): 98%|ββββββββββ| 1220/1250 [29:44<00:38, 1.28s/it]
Training 1/1 epoch (loss 2.4408): 98%|ββββββββββ| 1220/1250 [29:45<00:38, 1.28s/it]
Training 1/1 epoch (loss 2.4408): 98%|ββββββββββ| 1221/1250 [29:45<00:34, 1.18s/it]
Training 1/1 epoch (loss 2.4282): 98%|ββββββββββ| 1221/1250 [29:46<00:34, 1.18s/it]
Training 1/1 epoch (loss 2.4282): 98%|ββββββββββ| 1222/1250 [29:46<00:36, 1.31s/it]
Training 1/1 epoch (loss 2.4532): 98%|ββββββββββ| 1222/1250 [29:47<00:36, 1.31s/it]
Training 1/1 epoch (loss 2.4532): 98%|ββββββββββ| 1223/1250 [29:47<00:32, 1.19s/it]
Training 1/1 epoch (loss 2.5952): 98%|ββββββββββ| 1223/1250 [29:50<00:32, 1.19s/it]
Training 1/1 epoch (loss 2.5952): 98%|ββββββββββ| 1224/1250 [29:50<00:39, 1.53s/it]
Training 1/1 epoch (loss 2.6099): 98%|ββββββββββ| 1224/1250 [29:50<00:39, 1.53s/it]
Training 1/1 epoch (loss 2.6099): 98%|ββββββββββ| 1225/1250 [29:50<00:33, 1.32s/it]
Training 1/1 epoch (loss 2.6889): 98%|ββββββββββ| 1225/1250 [29:51<00:33, 1.32s/it]
Training 1/1 epoch (loss 2.6889): 98%|ββββββββββ| 1226/1250 [29:51<00:29, 1.23s/it]
Training 1/1 epoch (loss 2.5131): 98%|ββββββββββ| 1226/1250 [29:53<00:29, 1.23s/it]
Training 1/1 epoch (loss 2.5131): 98%|ββββββββββ| 1227/1250 [29:53<00:33, 1.44s/it]
Training 1/1 epoch (loss 2.6867): 98%|ββββββββββ| 1227/1250 [29:54<00:33, 1.44s/it]
Training 1/1 epoch (loss 2.6867): 98%|ββββββββββ| 1228/1250 [29:54<00:25, 1.16s/it]
Training 1/1 epoch (loss 2.3714): 98%|ββββββββββ| 1228/1250 [29:56<00:25, 1.16s/it]
Training 1/1 epoch (loss 2.3714): 98%|ββββββββββ| 1229/1250 [29:56<00:28, 1.36s/it]
Training 1/1 epoch (loss 2.6576): 98%|ββββββββββ| 1229/1250 [29:57<00:28, 1.36s/it]
Training 1/1 epoch (loss 2.6576): 98%|ββββββββββ| 1230/1250 [29:57<00:26, 1.35s/it]
Training 1/1 epoch (loss 2.4986): 98%|ββββββββββ| 1230/1250 [29:58<00:26, 1.35s/it]
Training 1/1 epoch (loss 2.4986): 98%|ββββββββββ| 1231/1250 [29:58<00:22, 1.17s/it]
Training 1/1 epoch (loss 2.6246): 98%|ββββββββββ| 1231/1250 [30:01<00:22, 1.17s/it]
Training 1/1 epoch (loss 2.6246): 99%|ββββββββββ| 1232/1250 [30:01<00:30, 1.71s/it]
Training 1/1 epoch (loss 2.6569): 99%|ββββββββββ| 1232/1250 [30:03<00:30, 1.71s/it]
Training 1/1 epoch (loss 2.6569): 99%|ββββββββββ| 1233/1250 [30:03<00:30, 1.80s/it]
Training 1/1 epoch (loss 2.6504): 99%|ββββββββββ| 1233/1250 [30:04<00:30, 1.80s/it]
Training 1/1 epoch (loss 2.6504): 99%|ββββββββββ| 1234/1250 [30:04<00:24, 1.50s/it]
Training 1/1 epoch (loss 2.5479): 99%|ββββββββββ| 1234/1250 [30:05<00:24, 1.50s/it]
Training 1/1 epoch (loss 2.5479): 99%|ββββββββββ| 1235/1250 [30:05<00:20, 1.39s/it]
Training 1/1 epoch (loss 2.4806): 99%|ββββββββββ| 1235/1250 [30:07<00:20, 1.39s/it]
Training 1/1 epoch (loss 2.4806): 99%|ββββββββββ| 1236/1250 [30:07<00:21, 1.52s/it]
Training 1/1 epoch (loss 2.4953): 99%|ββββββββββ| 1236/1250 [30:07<00:21, 1.52s/it]
Training 1/1 epoch (loss 2.4953): 99%|ββββββββββ| 1237/1250 [30:07<00:15, 1.22s/it]
Training 1/1 epoch (loss 2.5670): 99%|ββββββββββ| 1237/1250 [30:09<00:15, 1.22s/it]
Training 1/1 epoch (loss 2.5670): 99%|ββββββββββ| 1238/1250 [30:09<00:15, 1.28s/it]
Training 1/1 epoch (loss 2.6437): 99%|ββββββββββ| 1238/1250 [30:10<00:15, 1.28s/it]
Training 1/1 epoch (loss 2.6437): 99%|ββββββββββ| 1239/1250 [30:10<00:14, 1.28s/it]
Training 1/1 epoch (loss 2.5759): 99%|ββββββββββ| 1239/1250 [30:11<00:14, 1.28s/it]
Training 1/1 epoch (loss 2.5759): 99%|ββββββββββ| 1240/1250 [30:11<00:11, 1.16s/it]
Training 1/1 epoch (loss 2.6161): 99%|ββββββββββ| 1240/1250 [30:12<00:11, 1.16s/it]
Training 1/1 epoch (loss 2.6161): 99%|ββββββββββ| 1241/1250 [30:12<00:11, 1.30s/it]
Training 1/1 epoch (loss 2.6190): 99%|ββββββββββ| 1241/1250 [30:13<00:11, 1.30s/it]
Training 1/1 epoch (loss 2.6190): 99%|ββββββββββ| 1242/1250 [30:13<00:09, 1.15s/it]
Training 1/1 epoch (loss 2.7743): 99%|ββββββββββ| 1242/1250 [30:15<00:09, 1.15s/it]
Training 1/1 epoch (loss 2.7743): 99%|ββββββββββ| 1243/1250 [30:15<00:09, 1.30s/it]
Training 1/1 epoch (loss 2.7077): 99%|ββββββββββ| 1243/1250 [30:17<00:09, 1.30s/it]
Training 1/1 epoch (loss 2.7077): 100%|ββββββββββ| 1244/1250 [30:17<00:09, 1.55s/it]
Training 1/1 epoch (loss 2.8131): 100%|ββββββββββ| 1244/1250 [30:17<00:09, 1.55s/it]
Training 1/1 epoch (loss 2.8131): 100%|ββββββββββ| 1245/1250 [30:17<00:06, 1.21s/it]
Training 1/1 epoch (loss 2.4559): 100%|ββββββββββ| 1245/1250 [30:19<00:06, 1.21s/it]
Training 1/1 epoch (loss 2.4559): 100%|ββββββββββ| 1246/1250 [30:19<00:05, 1.40s/it]
Training 1/1 epoch (loss 2.7500): 100%|ββββββββββ| 1246/1250 [30:21<00:05, 1.40s/it]
Training 1/1 epoch (loss 2.7500): 100%|ββββββββββ| 1247/1250 [30:21<00:05, 1.67s/it]
Training 1/1 epoch (loss 2.6166): 100%|ββββββββββ| 1247/1250 [30:22<00:05, 1.67s/it]
Training 1/1 epoch (loss 2.6166): 100%|ββββββββββ| 1248/1250 [30:22<00:02, 1.32s/it]
Training 1/1 epoch (loss 2.7382): 100%|ββββββββββ| 1248/1250 [30:24<00:02, 1.32s/it]
Training 1/1 epoch (loss 2.7382): 100%|ββββββββββ| 1249/1250 [30:24<00:01, 1.67s/it]
Training 1/1 epoch (loss 2.3804): 100%|ββββββββββ| 1249/1250 [30:26<00:01, 1.67s/it]
Training 1/1 epoch (loss 2.3804): 100%|ββββββββββ| 1250/1250 [30:26<00:00, 1.51s/it]
Training 1/1 epoch (loss 2.3804): 100%|ββββββββββ| 1250/1250 [30:26<00:00, 1.46s/it] |