|
Training 1/1 epoch (loss 2.0044): 0%| | 0/625 [00:05<?, ?it/s]
Training 1/1 epoch (loss 2.0044): 0%| | 1/625 [00:05<1:00:37, 5.83s/it]
Training 1/1 epoch (loss 2.0360): 0%| | 1/625 [00:07<1:00:37, 5.83s/it]
Training 1/1 epoch (loss 2.0360): 0%| | 2/625 [00:07<36:41, 3.53s/it]
Training 1/1 epoch (loss 1.9986): 0%| | 2/625 [00:08<36:41, 3.53s/it]
Training 1/1 epoch (loss 1.9986): 0%| | 3/625 [00:08<22:05, 2.13s/it]
Training 1/1 epoch (loss 2.0484): 0%| | 3/625 [00:08<22:05, 2.13s/it]
Training 1/1 epoch (loss 2.0484): 1%| | 4/625 [00:08<14:56, 1.44s/it]
Training 1/1 epoch (loss 2.0813): 1%| | 4/625 [00:08<14:56, 1.44s/it]
Training 1/1 epoch (loss 2.0813): 1%| | 5/625 [00:08<10:54, 1.06s/it]
Training 1/1 epoch (loss 1.9533): 1%| | 5/625 [00:09<10:54, 1.06s/it]
Training 1/1 epoch (loss 1.9533): 1%| | 6/625 [00:09<08:25, 1.23it/s]
Training 1/1 epoch (loss 1.9202): 1%| | 6/625 [00:09<08:25, 1.23it/s]
Training 1/1 epoch (loss 1.9202): 1%| | 7/625 [00:09<06:49, 1.51it/s]
Training 1/1 epoch (loss 1.9225): 1%| | 7/625 [00:10<06:49, 1.51it/s]
Training 1/1 epoch (loss 1.9225): 1%|β | 8/625 [00:10<06:16, 1.64it/s]
Training 1/1 epoch (loss 1.8820): 1%|β | 8/625 [00:10<06:16, 1.64it/s]
Training 1/1 epoch (loss 1.8820): 1%|β | 9/625 [00:10<05:38, 1.82it/s]
Training 1/1 epoch (loss 1.8777): 1%|β | 9/625 [00:11<05:38, 1.82it/s]
Training 1/1 epoch (loss 1.8777): 2%|β | 10/625 [00:11<05:29, 1.87it/s]
Training 1/1 epoch (loss 1.8593): 2%|β | 10/625 [00:11<05:29, 1.87it/s]
Training 1/1 epoch (loss 1.8593): 2%|β | 11/625 [00:11<05:20, 1.92it/s]
Training 1/1 epoch (loss 1.7939): 2%|β | 11/625 [00:11<05:20, 1.92it/s]
Training 1/1 epoch (loss 1.7939): 2%|β | 12/625 [00:11<04:53, 2.09it/s]
Training 1/1 epoch (loss 1.8386): 2%|β | 12/625 [00:12<04:53, 2.09it/s]
Training 1/1 epoch (loss 1.8386): 2%|β | 13/625 [00:12<04:39, 2.19it/s]
Training 1/1 epoch (loss 1.8954): 2%|β | 13/625 [00:12<04:39, 2.19it/s]
Training 1/1 epoch (loss 1.8954): 2%|β | 14/625 [00:12<04:32, 2.24it/s]
Training 1/1 epoch (loss 1.8885): 2%|β | 14/625 [00:13<04:32, 2.24it/s]
Training 1/1 epoch (loss 1.8885): 2%|β | 15/625 [00:13<04:16, 2.38it/s]
Training 1/1 epoch (loss 2.0033): 2%|β | 15/625 [00:13<04:16, 2.38it/s]
Training 1/1 epoch (loss 2.0033): 3%|β | 16/625 [00:13<04:24, 2.30it/s]
Training 1/1 epoch (loss 1.8278): 3%|β | 16/625 [00:14<04:24, 2.30it/s]
Training 1/1 epoch (loss 1.8278): 3%|β | 17/625 [00:14<04:16, 2.37it/s]
Training 1/1 epoch (loss 1.6787): 3%|β | 17/625 [00:14<04:16, 2.37it/s]
Training 1/1 epoch (loss 1.6787): 3%|β | 18/625 [00:14<04:15, 2.37it/s]
Training 1/1 epoch (loss 1.7157): 3%|β | 18/625 [00:14<04:15, 2.37it/s]
Training 1/1 epoch (loss 1.7157): 3%|β | 19/625 [00:14<04:17, 2.35it/s]
Training 1/1 epoch (loss 1.7396): 3%|β | 19/625 [00:15<04:17, 2.35it/s]
Training 1/1 epoch (loss 1.7396): 3%|β | 20/625 [00:15<04:03, 2.49it/s]
Training 1/1 epoch (loss 1.7724): 3%|β | 20/625 [00:15<04:03, 2.49it/s]
Training 1/1 epoch (loss 1.7724): 3%|β | 21/625 [00:15<03:54, 2.58it/s]
Training 1/1 epoch (loss 1.7079): 3%|β | 21/625 [00:15<03:54, 2.58it/s]
Training 1/1 epoch (loss 1.7079): 4%|β | 22/625 [00:15<03:49, 2.62it/s]
Training 1/1 epoch (loss 1.6790): 4%|β | 22/625 [00:16<03:49, 2.62it/s]
Training 1/1 epoch (loss 1.6790): 4%|β | 23/625 [00:16<03:53, 2.58it/s]
Training 1/1 epoch (loss 1.7280): 4%|β | 23/625 [00:16<03:53, 2.58it/s]
Training 1/1 epoch (loss 1.7280): 4%|β | 24/625 [00:16<04:05, 2.45it/s]
Training 1/1 epoch (loss 1.6257): 4%|β | 24/625 [00:17<04:05, 2.45it/s]
Training 1/1 epoch (loss 1.6257): 4%|β | 25/625 [00:17<04:04, 2.46it/s]
Training 1/1 epoch (loss 1.6377): 4%|β | 25/625 [00:17<04:04, 2.46it/s]
Training 1/1 epoch (loss 1.6377): 4%|β | 26/625 [00:17<03:54, 2.56it/s]
Training 1/1 epoch (loss 1.7331): 4%|β | 26/625 [00:18<03:54, 2.56it/s]
Training 1/1 epoch (loss 1.7331): 4%|β | 27/625 [00:18<04:12, 2.37it/s]
Training 1/1 epoch (loss 1.6777): 4%|β | 27/625 [00:18<04:12, 2.37it/s]
Training 1/1 epoch (loss 1.6777): 4%|β | 28/625 [00:18<05:00, 1.99it/s]
Training 1/1 epoch (loss 1.6170): 4%|β | 28/625 [00:19<05:00, 1.99it/s]
Training 1/1 epoch (loss 1.6170): 5%|β | 29/625 [00:19<04:40, 2.13it/s]
Training 1/1 epoch (loss 1.7382): 5%|β | 29/625 [00:19<04:40, 2.13it/s]
Training 1/1 epoch (loss 1.7382): 5%|β | 30/625 [00:19<04:20, 2.28it/s]
Training 1/1 epoch (loss 1.7477): 5%|β | 30/625 [00:19<04:20, 2.28it/s]
Training 1/1 epoch (loss 1.7477): 5%|β | 31/625 [00:19<04:04, 2.43it/s]
Training 1/1 epoch (loss 1.7107): 5%|β | 31/625 [00:20<04:04, 2.43it/s]
Training 1/1 epoch (loss 1.7107): 5%|β | 32/625 [00:20<04:04, 2.43it/s]
Training 1/1 epoch (loss 1.7415): 5%|β | 32/625 [00:20<04:04, 2.43it/s]
Training 1/1 epoch (loss 1.7415): 5%|β | 33/625 [00:20<04:00, 2.46it/s]
Training 1/1 epoch (loss 1.5139): 5%|β | 33/625 [00:21<04:00, 2.46it/s]
Training 1/1 epoch (loss 1.5139): 5%|β | 34/625 [00:21<03:59, 2.46it/s]
Training 1/1 epoch (loss 1.5641): 5%|β | 34/625 [00:21<03:59, 2.46it/s]
Training 1/1 epoch (loss 1.5641): 6%|β | 35/625 [00:21<03:53, 2.52it/s]
Training 1/1 epoch (loss 1.5476): 6%|β | 35/625 [00:21<03:53, 2.52it/s]
Training 1/1 epoch (loss 1.5476): 6%|β | 36/625 [00:21<03:48, 2.58it/s]
Training 1/1 epoch (loss 1.7826): 6%|β | 36/625 [00:22<03:48, 2.58it/s]
Training 1/1 epoch (loss 1.7826): 6%|β | 37/625 [00:22<03:56, 2.49it/s]
Training 1/1 epoch (loss 1.6069): 6%|β | 37/625 [00:22<03:56, 2.49it/s]
Training 1/1 epoch (loss 1.6069): 6%|β | 38/625 [00:22<03:57, 2.47it/s]
Training 1/1 epoch (loss 1.6117): 6%|β | 38/625 [00:23<03:57, 2.47it/s]
Training 1/1 epoch (loss 1.6117): 6%|β | 39/625 [00:23<03:57, 2.47it/s]
Training 1/1 epoch (loss 1.4835): 6%|β | 39/625 [00:23<03:57, 2.47it/s]
Training 1/1 epoch (loss 1.4835): 6%|β | 40/625 [00:23<03:59, 2.44it/s]
Training 1/1 epoch (loss 1.5841): 6%|β | 40/625 [00:23<03:59, 2.44it/s]
Training 1/1 epoch (loss 1.5841): 7%|β | 41/625 [00:23<03:52, 2.51it/s]
Training 1/1 epoch (loss 1.5176): 7%|β | 41/625 [00:24<03:52, 2.51it/s]
Training 1/1 epoch (loss 1.5176): 7%|β | 42/625 [00:24<03:51, 2.52it/s]
Training 1/1 epoch (loss 1.5982): 7%|β | 42/625 [00:24<03:51, 2.52it/s]
Training 1/1 epoch (loss 1.5982): 7%|β | 43/625 [00:24<03:54, 2.48it/s]
Training 1/1 epoch (loss 1.5213): 7%|β | 43/625 [00:25<03:54, 2.48it/s]
Training 1/1 epoch (loss 1.5213): 7%|β | 44/625 [00:25<03:55, 2.47it/s]
Training 1/1 epoch (loss 1.5993): 7%|β | 44/625 [00:25<03:55, 2.47it/s]
Training 1/1 epoch (loss 1.5993): 7%|β | 45/625 [00:25<03:51, 2.51it/s]
Training 1/1 epoch (loss 1.6557): 7%|β | 45/625 [00:25<03:51, 2.51it/s]
Training 1/1 epoch (loss 1.6557): 7%|β | 46/625 [00:25<03:43, 2.59it/s]
Training 1/1 epoch (loss 1.6516): 7%|β | 46/625 [00:26<03:43, 2.59it/s]
Training 1/1 epoch (loss 1.6516): 8%|β | 47/625 [00:26<03:41, 2.61it/s]
Training 1/1 epoch (loss 1.6376): 8%|β | 47/625 [00:26<03:41, 2.61it/s]
Training 1/1 epoch (loss 1.6376): 8%|β | 48/625 [00:26<03:57, 2.43it/s]
Training 1/1 epoch (loss 1.5749): 8%|β | 48/625 [00:27<03:57, 2.43it/s]
Training 1/1 epoch (loss 1.5749): 8%|β | 49/625 [00:27<03:59, 2.41it/s]
Training 1/1 epoch (loss 1.5121): 8%|β | 49/625 [00:27<03:59, 2.41it/s]
Training 1/1 epoch (loss 1.5121): 8%|β | 50/625 [00:27<03:48, 2.52it/s]
Training 1/1 epoch (loss 1.4984): 8%|β | 50/625 [00:27<03:48, 2.52it/s]
Training 1/1 epoch (loss 1.4984): 8%|β | 51/625 [00:27<03:41, 2.60it/s]
Training 1/1 epoch (loss 1.5231): 8%|β | 51/625 [00:28<03:41, 2.60it/s]
Training 1/1 epoch (loss 1.5231): 8%|β | 52/625 [00:28<03:36, 2.64it/s]
Training 1/1 epoch (loss 1.5070): 8%|β | 52/625 [00:28<03:36, 2.64it/s]
Training 1/1 epoch (loss 1.5070): 8%|β | 53/625 [00:28<03:43, 2.56it/s]
Training 1/1 epoch (loss 1.5511): 8%|β | 53/625 [00:29<03:43, 2.56it/s]
Training 1/1 epoch (loss 1.5511): 9%|β | 54/625 [00:29<03:52, 2.46it/s]
Training 1/1 epoch (loss 1.6091): 9%|β | 54/625 [00:29<03:52, 2.46it/s]
Training 1/1 epoch (loss 1.6091): 9%|β | 55/625 [00:29<03:49, 2.48it/s]
Training 1/1 epoch (loss 1.6875): 9%|β | 55/625 [00:29<03:49, 2.48it/s]
Training 1/1 epoch (loss 1.6875): 9%|β | 56/625 [00:29<03:43, 2.55it/s]
Training 1/1 epoch (loss 1.5898): 9%|β | 56/625 [00:30<03:43, 2.55it/s]
Training 1/1 epoch (loss 1.5898): 9%|β | 57/625 [00:30<03:52, 2.44it/s]
Training 1/1 epoch (loss 1.5863): 9%|β | 57/625 [00:30<03:52, 2.44it/s]
Training 1/1 epoch (loss 1.5863): 9%|β | 58/625 [00:30<04:02, 2.34it/s]
Training 1/1 epoch (loss 1.4697): 9%|β | 58/625 [00:31<04:02, 2.34it/s]
Training 1/1 epoch (loss 1.4697): 9%|β | 59/625 [00:31<03:55, 2.40it/s]
Training 1/1 epoch (loss 1.5849): 9%|β | 59/625 [00:31<03:55, 2.40it/s]
Training 1/1 epoch (loss 1.5849): 10%|β | 60/625 [00:31<03:44, 2.52it/s]
Training 1/1 epoch (loss 1.5517): 10%|β | 60/625 [00:31<03:44, 2.52it/s]
Training 1/1 epoch (loss 1.5517): 10%|β | 61/625 [00:31<03:48, 2.46it/s]
Training 1/1 epoch (loss 1.5059): 10%|β | 61/625 [00:32<03:48, 2.46it/s]
Training 1/1 epoch (loss 1.5059): 10%|β | 62/625 [00:32<03:46, 2.49it/s]
Training 1/1 epoch (loss 1.4923): 10%|β | 62/625 [00:32<03:46, 2.49it/s]
Training 1/1 epoch (loss 1.4923): 10%|β | 63/625 [00:32<03:38, 2.57it/s]
Training 1/1 epoch (loss 1.5338): 10%|β | 63/625 [00:33<03:38, 2.57it/s]
Training 1/1 epoch (loss 1.5338): 10%|β | 64/625 [00:33<03:52, 2.42it/s]
Training 1/1 epoch (loss 1.4603): 10%|β | 64/625 [00:33<03:52, 2.42it/s]
Training 1/1 epoch (loss 1.4603): 10%|β | 65/625 [00:33<03:50, 2.43it/s]
Training 1/1 epoch (loss 1.5408): 10%|β | 65/625 [00:33<03:50, 2.43it/s]
Training 1/1 epoch (loss 1.5408): 11%|β | 66/625 [00:33<03:52, 2.40it/s]
Training 1/1 epoch (loss 1.5752): 11%|β | 66/625 [00:34<03:52, 2.40it/s]
Training 1/1 epoch (loss 1.5752): 11%|β | 67/625 [00:34<03:50, 2.42it/s]
Training 1/1 epoch (loss 1.6184): 11%|β | 67/625 [00:34<03:50, 2.42it/s]
Training 1/1 epoch (loss 1.6184): 11%|β | 68/625 [00:34<03:55, 2.37it/s]
Training 1/1 epoch (loss 1.6209): 11%|β | 68/625 [00:35<03:55, 2.37it/s]
Training 1/1 epoch (loss 1.6209): 11%|β | 69/625 [00:35<04:00, 2.31it/s]
Training 1/1 epoch (loss 1.4759): 11%|β | 69/625 [00:35<04:00, 2.31it/s]
Training 1/1 epoch (loss 1.4759): 11%|β | 70/625 [00:35<03:47, 2.44it/s]
Training 1/1 epoch (loss 1.5932): 11%|β | 70/625 [00:35<03:47, 2.44it/s]
Training 1/1 epoch (loss 1.5932): 11%|ββ | 71/625 [00:35<03:38, 2.53it/s]
Training 1/1 epoch (loss 1.5415): 11%|ββ | 71/625 [00:36<03:38, 2.53it/s]
Training 1/1 epoch (loss 1.5415): 12%|ββ | 72/625 [00:36<03:47, 2.43it/s]
Training 1/1 epoch (loss 1.5370): 12%|ββ | 72/625 [00:36<03:47, 2.43it/s]
Training 1/1 epoch (loss 1.5370): 12%|ββ | 73/625 [00:36<03:47, 2.43it/s]
Training 1/1 epoch (loss 1.5500): 12%|ββ | 73/625 [00:37<03:47, 2.43it/s]
Training 1/1 epoch (loss 1.5500): 12%|ββ | 74/625 [00:37<03:58, 2.31it/s]
Training 1/1 epoch (loss 1.5504): 12%|ββ | 74/625 [00:37<03:58, 2.31it/s]
Training 1/1 epoch (loss 1.5504): 12%|ββ | 75/625 [00:37<03:46, 2.43it/s]
Training 1/1 epoch (loss 1.5687): 12%|ββ | 75/625 [00:38<03:46, 2.43it/s]
Training 1/1 epoch (loss 1.5687): 12%|ββ | 76/625 [00:38<03:36, 2.54it/s]
Training 1/1 epoch (loss 1.6096): 12%|ββ | 76/625 [00:38<03:36, 2.54it/s]
Training 1/1 epoch (loss 1.6096): 12%|ββ | 77/625 [00:38<03:36, 2.54it/s]
Training 1/1 epoch (loss 1.5812): 12%|ββ | 77/625 [00:38<03:36, 2.54it/s]
Training 1/1 epoch (loss 1.5812): 12%|ββ | 78/625 [00:38<03:37, 2.52it/s]
Training 1/1 epoch (loss 1.5254): 12%|ββ | 78/625 [00:39<03:37, 2.52it/s]
Training 1/1 epoch (loss 1.5254): 13%|ββ | 79/625 [00:39<03:36, 2.52it/s]
Training 1/1 epoch (loss 1.5408): 13%|ββ | 79/625 [00:39<03:36, 2.52it/s]
Training 1/1 epoch (loss 1.5408): 13%|ββ | 80/625 [00:39<03:38, 2.50it/s]
Training 1/1 epoch (loss 1.5830): 13%|ββ | 80/625 [00:39<03:38, 2.50it/s]
Training 1/1 epoch (loss 1.5830): 13%|ββ | 81/625 [00:39<03:33, 2.55it/s]
Training 1/1 epoch (loss 1.4958): 13%|ββ | 81/625 [00:40<03:33, 2.55it/s]
Training 1/1 epoch (loss 1.4958): 13%|ββ | 82/625 [00:40<03:37, 2.50it/s]
Training 1/1 epoch (loss 1.4910): 13%|ββ | 82/625 [00:40<03:37, 2.50it/s]
Training 1/1 epoch (loss 1.4910): 13%|ββ | 83/625 [00:40<03:34, 2.52it/s]
Training 1/1 epoch (loss 1.4867): 13%|ββ | 83/625 [00:41<03:34, 2.52it/s]
Training 1/1 epoch (loss 1.4867): 13%|ββ | 84/625 [00:41<03:32, 2.55it/s]
Training 1/1 epoch (loss 1.5822): 13%|ββ | 84/625 [00:41<03:32, 2.55it/s]
Training 1/1 epoch (loss 1.5822): 14%|ββ | 85/625 [00:41<03:25, 2.63it/s]
Training 1/1 epoch (loss 1.5827): 14%|ββ | 85/625 [00:41<03:25, 2.63it/s]
Training 1/1 epoch (loss 1.5827): 14%|ββ | 86/625 [00:41<03:22, 2.67it/s]
Training 1/1 epoch (loss 1.4461): 14%|ββ | 86/625 [00:42<03:22, 2.67it/s]
Training 1/1 epoch (loss 1.4461): 14%|ββ | 87/625 [00:42<03:29, 2.57it/s]
Training 1/1 epoch (loss 1.5685): 14%|ββ | 87/625 [00:42<03:29, 2.57it/s]
Training 1/1 epoch (loss 1.5685): 14%|ββ | 88/625 [00:42<03:32, 2.53it/s]
Training 1/1 epoch (loss 1.5349): 14%|ββ | 88/625 [00:43<03:32, 2.53it/s]
Training 1/1 epoch (loss 1.5349): 14%|ββ | 89/625 [00:43<03:43, 2.40it/s]
Training 1/1 epoch (loss 1.6114): 14%|ββ | 89/625 [00:43<03:43, 2.40it/s]
Training 1/1 epoch (loss 1.6114): 14%|ββ | 90/625 [00:43<03:33, 2.51it/s]
Training 1/1 epoch (loss 1.5974): 14%|ββ | 90/625 [00:43<03:33, 2.51it/s]
Training 1/1 epoch (loss 1.5974): 15%|ββ | 91/625 [00:43<03:31, 2.52it/s]
Training 1/1 epoch (loss 1.4644): 15%|ββ | 91/625 [00:44<03:31, 2.52it/s]
Training 1/1 epoch (loss 1.4644): 15%|ββ | 92/625 [00:44<03:29, 2.55it/s]
Training 1/1 epoch (loss 1.4791): 15%|ββ | 92/625 [00:44<03:29, 2.55it/s]
Training 1/1 epoch (loss 1.4791): 15%|ββ | 93/625 [00:44<03:28, 2.55it/s]
Training 1/1 epoch (loss 1.4790): 15%|ββ | 93/625 [00:45<03:28, 2.55it/s]
Training 1/1 epoch (loss 1.4790): 15%|ββ | 94/625 [00:45<03:37, 2.44it/s]
Training 1/1 epoch (loss 1.5422): 15%|ββ | 94/625 [00:45<03:37, 2.44it/s]
Training 1/1 epoch (loss 1.5422): 15%|ββ | 95/625 [00:45<03:35, 2.46it/s]
Training 1/1 epoch (loss 1.6614): 15%|ββ | 95/625 [00:45<03:35, 2.46it/s]
Training 1/1 epoch (loss 1.6614): 15%|ββ | 96/625 [00:45<03:31, 2.50it/s]
Training 1/1 epoch (loss 1.4989): 15%|ββ | 96/625 [00:46<03:31, 2.50it/s]
Training 1/1 epoch (loss 1.4989): 16%|ββ | 97/625 [00:46<03:37, 2.43it/s]
Training 1/1 epoch (loss 1.4558): 16%|ββ | 97/625 [00:46<03:37, 2.43it/s]
Training 1/1 epoch (loss 1.4558): 16%|ββ | 98/625 [00:46<03:33, 2.46it/s]
Training 1/1 epoch (loss 1.5780): 16%|ββ | 98/625 [00:47<03:33, 2.46it/s]
Training 1/1 epoch (loss 1.5780): 16%|ββ | 99/625 [00:47<03:34, 2.46it/s]
Training 1/1 epoch (loss 1.5962): 16%|ββ | 99/625 [00:47<03:34, 2.46it/s]
Training 1/1 epoch (loss 1.5962): 16%|ββ | 100/625 [00:47<03:31, 2.48it/s]
Training 1/1 epoch (loss 1.7026): 16%|ββ | 100/625 [00:48<03:31, 2.48it/s]
Training 1/1 epoch (loss 1.7026): 16%|ββ | 101/625 [00:48<03:41, 2.37it/s]
Training 1/1 epoch (loss 1.6318): 16%|ββ | 101/625 [00:48<03:41, 2.37it/s]
Training 1/1 epoch (loss 1.6318): 16%|ββ | 102/625 [00:48<03:56, 2.22it/s]
Training 1/1 epoch (loss 1.5483): 16%|ββ | 102/625 [00:49<03:56, 2.22it/s]
Training 1/1 epoch (loss 1.5483): 16%|ββ | 103/625 [00:49<03:55, 2.22it/s]
Training 1/1 epoch (loss 1.5309): 16%|ββ | 103/625 [00:49<03:55, 2.22it/s]
Training 1/1 epoch (loss 1.5309): 17%|ββ | 104/625 [00:49<03:48, 2.28it/s]
Training 1/1 epoch (loss 1.4608): 17%|ββ | 104/625 [00:49<03:48, 2.28it/s]
Training 1/1 epoch (loss 1.4608): 17%|ββ | 105/625 [00:49<03:35, 2.41it/s]
Training 1/1 epoch (loss 1.3976): 17%|ββ | 105/625 [00:50<03:35, 2.41it/s]
Training 1/1 epoch (loss 1.3976): 17%|ββ | 106/625 [00:50<03:26, 2.51it/s]
Training 1/1 epoch (loss 1.4493): 17%|ββ | 106/625 [00:50<03:26, 2.51it/s]
Training 1/1 epoch (loss 1.4493): 17%|ββ | 107/625 [00:50<03:20, 2.58it/s]
Training 1/1 epoch (loss 1.6155): 17%|ββ | 107/625 [00:50<03:20, 2.58it/s]
Training 1/1 epoch (loss 1.6155): 17%|ββ | 108/625 [00:50<03:21, 2.57it/s]
Training 1/1 epoch (loss 1.4827): 17%|ββ | 108/625 [00:51<03:21, 2.57it/s]
Training 1/1 epoch (loss 1.4827): 17%|ββ | 109/625 [00:51<03:20, 2.58it/s]
Training 1/1 epoch (loss 1.5230): 17%|ββ | 109/625 [00:51<03:20, 2.58it/s]
Training 1/1 epoch (loss 1.5230): 18%|ββ | 110/625 [00:51<03:18, 2.60it/s]
Training 1/1 epoch (loss 1.5804): 18%|ββ | 110/625 [00:52<03:18, 2.60it/s]
Training 1/1 epoch (loss 1.5804): 18%|ββ | 111/625 [00:52<03:11, 2.69it/s]
Training 1/1 epoch (loss 1.5234): 18%|ββ | 111/625 [00:52<03:11, 2.69it/s]
Training 1/1 epoch (loss 1.5234): 18%|ββ | 112/625 [00:52<03:17, 2.60it/s]
Training 1/1 epoch (loss 1.4626): 18%|ββ | 112/625 [00:52<03:17, 2.60it/s]
Training 1/1 epoch (loss 1.4626): 18%|ββ | 113/625 [00:52<03:29, 2.44it/s]
Training 1/1 epoch (loss 1.4844): 18%|ββ | 113/625 [00:53<03:29, 2.44it/s]
Training 1/1 epoch (loss 1.4844): 18%|ββ | 114/625 [00:53<03:32, 2.41it/s]
Training 1/1 epoch (loss 1.6099): 18%|ββ | 114/625 [00:53<03:32, 2.41it/s]
Training 1/1 epoch (loss 1.6099): 18%|ββ | 115/625 [00:53<03:23, 2.51it/s]
Training 1/1 epoch (loss 1.5950): 18%|ββ | 115/625 [00:54<03:23, 2.51it/s]
Training 1/1 epoch (loss 1.5950): 19%|ββ | 116/625 [00:54<03:22, 2.51it/s]
Training 1/1 epoch (loss 1.5435): 19%|ββ | 116/625 [00:54<03:22, 2.51it/s]
Training 1/1 epoch (loss 1.5435): 19%|ββ | 117/625 [00:54<03:20, 2.54it/s]
Training 1/1 epoch (loss 1.5871): 19%|ββ | 117/625 [00:54<03:20, 2.54it/s]
Training 1/1 epoch (loss 1.5871): 19%|ββ | 118/625 [00:54<03:14, 2.60it/s]
Training 1/1 epoch (loss 1.3456): 19%|ββ | 118/625 [00:55<03:14, 2.60it/s]
Training 1/1 epoch (loss 1.3456): 19%|ββ | 119/625 [00:55<03:19, 2.54it/s]
Training 1/1 epoch (loss 1.5737): 19%|ββ | 119/625 [00:55<03:19, 2.54it/s]
Training 1/1 epoch (loss 1.5737): 19%|ββ | 120/625 [00:55<03:17, 2.56it/s]
Training 1/1 epoch (loss 1.5598): 19%|ββ | 120/625 [00:55<03:17, 2.56it/s]
Training 1/1 epoch (loss 1.5598): 19%|ββ | 121/625 [00:55<03:12, 2.62it/s]
Training 1/1 epoch (loss 1.4611): 19%|ββ | 121/625 [00:56<03:12, 2.62it/s]
Training 1/1 epoch (loss 1.4611): 20%|ββ | 122/625 [00:56<03:13, 2.60it/s]
Training 1/1 epoch (loss 1.5637): 20%|ββ | 122/625 [00:56<03:13, 2.60it/s]
Training 1/1 epoch (loss 1.5637): 20%|ββ | 123/625 [00:56<03:13, 2.59it/s]
Training 1/1 epoch (loss 1.4838): 20%|ββ | 123/625 [00:57<03:13, 2.59it/s]
Training 1/1 epoch (loss 1.4838): 20%|ββ | 124/625 [00:57<03:13, 2.58it/s]
Training 1/1 epoch (loss 1.5471): 20%|ββ | 124/625 [00:57<03:13, 2.58it/s]
Training 1/1 epoch (loss 1.5471): 20%|ββ | 125/625 [00:57<03:22, 2.46it/s]
Training 1/1 epoch (loss 1.6359): 20%|ββ | 125/625 [00:57<03:22, 2.46it/s]
Training 1/1 epoch (loss 1.6359): 20%|ββ | 126/625 [00:57<03:17, 2.53it/s]
Training 1/1 epoch (loss 1.4733): 20%|ββ | 126/625 [00:58<03:17, 2.53it/s]
Training 1/1 epoch (loss 1.4733): 20%|ββ | 127/625 [00:58<03:15, 2.55it/s]
Training 1/1 epoch (loss 1.4793): 20%|ββ | 127/625 [00:58<03:15, 2.55it/s]
Training 1/1 epoch (loss 1.4793): 20%|ββ | 128/625 [00:58<03:13, 2.57it/s]
Training 1/1 epoch (loss 1.5390): 20%|ββ | 128/625 [00:59<03:13, 2.57it/s]
Training 1/1 epoch (loss 1.5390): 21%|ββ | 129/625 [00:59<03:17, 2.51it/s]
Training 1/1 epoch (loss 1.5765): 21%|ββ | 129/625 [00:59<03:17, 2.51it/s]
Training 1/1 epoch (loss 1.5765): 21%|ββ | 130/625 [00:59<03:17, 2.51it/s]
Training 1/1 epoch (loss 1.4901): 21%|ββ | 130/625 [00:59<03:17, 2.51it/s]
Training 1/1 epoch (loss 1.4901): 21%|ββ | 131/625 [00:59<03:13, 2.55it/s]
Training 1/1 epoch (loss 1.4386): 21%|ββ | 131/625 [01:00<03:13, 2.55it/s]
Training 1/1 epoch (loss 1.4386): 21%|ββ | 132/625 [01:00<03:14, 2.54it/s]
Training 1/1 epoch (loss 1.6155): 21%|ββ | 132/625 [01:00<03:14, 2.54it/s]
Training 1/1 epoch (loss 1.6155): 21%|βββ | 133/625 [01:00<03:21, 2.44it/s]
Training 1/1 epoch (loss 1.5511): 21%|βββ | 133/625 [01:01<03:21, 2.44it/s]
Training 1/1 epoch (loss 1.5511): 21%|βββ | 134/625 [01:01<03:18, 2.47it/s]
Training 1/1 epoch (loss 1.5507): 21%|βββ | 134/625 [01:01<03:18, 2.47it/s]
Training 1/1 epoch (loss 1.5507): 22%|βββ | 135/625 [01:01<03:16, 2.50it/s]
Training 1/1 epoch (loss 1.5054): 22%|βββ | 135/625 [01:01<03:16, 2.50it/s]
Training 1/1 epoch (loss 1.5054): 22%|βββ | 136/625 [01:01<03:11, 2.55it/s]
Training 1/1 epoch (loss 1.5368): 22%|βββ | 136/625 [01:02<03:11, 2.55it/s]
Training 1/1 epoch (loss 1.5368): 22%|βββ | 137/625 [01:02<03:11, 2.54it/s]
Training 1/1 epoch (loss 1.5253): 22%|βββ | 137/625 [01:02<03:11, 2.54it/s]
Training 1/1 epoch (loss 1.5253): 22%|βββ | 138/625 [01:02<03:13, 2.52it/s]
Training 1/1 epoch (loss 1.5704): 22%|βββ | 138/625 [01:03<03:13, 2.52it/s]
Training 1/1 epoch (loss 1.5704): 22%|βββ | 139/625 [01:03<03:13, 2.51it/s]
Training 1/1 epoch (loss 1.5449): 22%|βββ | 139/625 [01:03<03:13, 2.51it/s]
Training 1/1 epoch (loss 1.5449): 22%|βββ | 140/625 [01:03<03:20, 2.42it/s]
Training 1/1 epoch (loss 1.5182): 22%|βββ | 140/625 [01:03<03:20, 2.42it/s]
Training 1/1 epoch (loss 1.5182): 23%|βββ | 141/625 [01:03<03:17, 2.45it/s]
Training 1/1 epoch (loss 1.5215): 23%|βββ | 141/625 [01:04<03:17, 2.45it/s]
Training 1/1 epoch (loss 1.5215): 23%|βββ | 142/625 [01:04<03:16, 2.46it/s]
Training 1/1 epoch (loss 1.5453): 23%|βββ | 142/625 [01:04<03:16, 2.46it/s]
Training 1/1 epoch (loss 1.5453): 23%|βββ | 143/625 [01:04<03:12, 2.51it/s]
Training 1/1 epoch (loss 1.4291): 23%|βββ | 143/625 [01:05<03:12, 2.51it/s]
Training 1/1 epoch (loss 1.4291): 23%|βββ | 144/625 [01:05<03:16, 2.44it/s]
Training 1/1 epoch (loss 1.5193): 23%|βββ | 144/625 [01:05<03:16, 2.44it/s]
Training 1/1 epoch (loss 1.5193): 23%|βββ | 145/625 [01:05<03:14, 2.47it/s]
Training 1/1 epoch (loss 1.5296): 23%|βββ | 145/625 [01:05<03:14, 2.47it/s]
Training 1/1 epoch (loss 1.5296): 23%|βββ | 146/625 [01:05<03:06, 2.56it/s]
Training 1/1 epoch (loss 1.4324): 23%|βββ | 146/625 [01:06<03:06, 2.56it/s]
Training 1/1 epoch (loss 1.4324): 24%|βββ | 147/625 [01:06<03:04, 2.59it/s]
Training 1/1 epoch (loss 1.4908): 24%|βββ | 147/625 [01:06<03:04, 2.59it/s]
Training 1/1 epoch (loss 1.4908): 24%|βββ | 148/625 [01:06<03:05, 2.58it/s]
Training 1/1 epoch (loss 1.5217): 24%|βββ | 148/625 [01:07<03:05, 2.58it/s]
Training 1/1 epoch (loss 1.5217): 24%|βββ | 149/625 [01:07<03:04, 2.58it/s]
Training 1/1 epoch (loss 1.5059): 24%|βββ | 149/625 [01:07<03:04, 2.58it/s]
Training 1/1 epoch (loss 1.5059): 24%|βββ | 150/625 [01:07<03:02, 2.60it/s]
Training 1/1 epoch (loss 1.4610): 24%|βββ | 150/625 [01:07<03:02, 2.60it/s]
Training 1/1 epoch (loss 1.4610): 24%|βββ | 151/625 [01:07<02:57, 2.67it/s]
Training 1/1 epoch (loss 1.6311): 24%|βββ | 151/625 [01:08<02:57, 2.67it/s]
Training 1/1 epoch (loss 1.6311): 24%|βββ | 152/625 [01:08<03:03, 2.57it/s]
Training 1/1 epoch (loss 1.4932): 24%|βββ | 152/625 [01:08<03:03, 2.57it/s]
Training 1/1 epoch (loss 1.4932): 24%|βββ | 153/625 [01:08<03:04, 2.55it/s]
Training 1/1 epoch (loss 1.5422): 24%|βββ | 153/625 [01:09<03:04, 2.55it/s]
Training 1/1 epoch (loss 1.5422): 25%|βββ | 154/625 [01:09<03:02, 2.58it/s]
Training 1/1 epoch (loss 1.5509): 25%|βββ | 154/625 [01:09<03:02, 2.58it/s]
Training 1/1 epoch (loss 1.5509): 25%|βββ | 155/625 [01:09<03:14, 2.41it/s]
Training 1/1 epoch (loss 1.4774): 25%|βββ | 155/625 [01:09<03:14, 2.41it/s]
Training 1/1 epoch (loss 1.4774): 25%|βββ | 156/625 [01:09<03:06, 2.52it/s]
Training 1/1 epoch (loss 1.4669): 25%|βββ | 156/625 [01:10<03:06, 2.52it/s]
Training 1/1 epoch (loss 1.4669): 25%|βββ | 157/625 [01:10<03:06, 2.51it/s]
Training 1/1 epoch (loss 1.4791): 25%|βββ | 157/625 [01:10<03:06, 2.51it/s]
Training 1/1 epoch (loss 1.4791): 25%|βββ | 158/625 [01:10<03:00, 2.58it/s]
Training 1/1 epoch (loss 1.5420): 25%|βββ | 158/625 [01:11<03:00, 2.58it/s]
Training 1/1 epoch (loss 1.5420): 25%|βββ | 159/625 [01:11<02:59, 2.60it/s]
Training 1/1 epoch (loss 1.5367): 25%|βββ | 159/625 [01:11<02:59, 2.60it/s]
Training 1/1 epoch (loss 1.5367): 26%|βββ | 160/625 [01:11<03:04, 2.52it/s]
Training 1/1 epoch (loss 1.5333): 26%|βββ | 160/625 [01:11<03:04, 2.52it/s]
Training 1/1 epoch (loss 1.5333): 26%|βββ | 161/625 [01:11<02:58, 2.60it/s]
Training 1/1 epoch (loss 1.4291): 26%|βββ | 161/625 [01:12<02:58, 2.60it/s]
Training 1/1 epoch (loss 1.4291): 26%|βββ | 162/625 [01:12<02:54, 2.65it/s]
Training 1/1 epoch (loss 1.5482): 26%|βββ | 162/625 [01:12<02:54, 2.65it/s]
Training 1/1 epoch (loss 1.5482): 26%|βββ | 163/625 [01:12<03:01, 2.54it/s]
Training 1/1 epoch (loss 1.4900): 26%|βββ | 163/625 [01:12<03:01, 2.54it/s]
Training 1/1 epoch (loss 1.4900): 26%|βββ | 164/625 [01:12<03:00, 2.55it/s]
Training 1/1 epoch (loss 1.4003): 26%|βββ | 164/625 [01:13<03:00, 2.55it/s]
Training 1/1 epoch (loss 1.4003): 26%|βββ | 165/625 [01:13<03:07, 2.45it/s]
Training 1/1 epoch (loss 1.6168): 26%|βββ | 165/625 [01:13<03:07, 2.45it/s]
Training 1/1 epoch (loss 1.6168): 27%|βββ | 166/625 [01:13<03:09, 2.42it/s]
Training 1/1 epoch (loss 1.4420): 27%|βββ | 166/625 [01:14<03:09, 2.42it/s]
Training 1/1 epoch (loss 1.4420): 27%|βββ | 167/625 [01:14<03:07, 2.45it/s]
Training 1/1 epoch (loss 1.4721): 27%|βββ | 167/625 [01:14<03:07, 2.45it/s]
Training 1/1 epoch (loss 1.4721): 27%|βββ | 168/625 [01:14<03:09, 2.41it/s]
Training 1/1 epoch (loss 1.5181): 27%|βββ | 168/625 [01:15<03:09, 2.41it/s]
Training 1/1 epoch (loss 1.5181): 27%|βββ | 169/625 [01:15<03:23, 2.24it/s]
Training 1/1 epoch (loss 1.5710): 27%|βββ | 169/625 [01:15<03:23, 2.24it/s]
Training 1/1 epoch (loss 1.5710): 27%|βββ | 170/625 [01:15<03:17, 2.30it/s]
Training 1/1 epoch (loss 1.4871): 27%|βββ | 170/625 [01:15<03:17, 2.30it/s]
Training 1/1 epoch (loss 1.4871): 27%|βββ | 171/625 [01:15<03:10, 2.38it/s]
Training 1/1 epoch (loss 1.5128): 27%|βββ | 171/625 [01:16<03:10, 2.38it/s]
Training 1/1 epoch (loss 1.5128): 28%|βββ | 172/625 [01:16<03:03, 2.47it/s]
Training 1/1 epoch (loss 1.4884): 28%|βββ | 172/625 [01:16<03:03, 2.47it/s]
Training 1/1 epoch (loss 1.4884): 28%|βββ | 173/625 [01:16<02:58, 2.53it/s]
Training 1/1 epoch (loss 1.6001): 28%|βββ | 173/625 [01:17<02:58, 2.53it/s]
Training 1/1 epoch (loss 1.6001): 28%|βββ | 174/625 [01:17<02:57, 2.54it/s]
Training 1/1 epoch (loss 1.4670): 28%|βββ | 174/625 [01:17<02:57, 2.54it/s]
Training 1/1 epoch (loss 1.4670): 28%|βββ | 175/625 [01:17<02:57, 2.54it/s]
Training 1/1 epoch (loss 1.5071): 28%|βββ | 175/625 [01:18<02:57, 2.54it/s]
Training 1/1 epoch (loss 1.5071): 28%|βββ | 176/625 [01:18<03:19, 2.25it/s]
Training 1/1 epoch (loss 1.5458): 28%|βββ | 176/625 [01:18<03:19, 2.25it/s]
Training 1/1 epoch (loss 1.5458): 28%|βββ | 177/625 [01:18<03:26, 2.17it/s]
Training 1/1 epoch (loss 1.4993): 28%|βββ | 177/625 [01:19<03:26, 2.17it/s]
Training 1/1 epoch (loss 1.4993): 28%|βββ | 178/625 [01:19<03:36, 2.06it/s]
Training 1/1 epoch (loss 1.5266): 28%|βββ | 178/625 [01:19<03:36, 2.06it/s]
Training 1/1 epoch (loss 1.5266): 29%|βββ | 179/625 [01:19<03:23, 2.19it/s]
Training 1/1 epoch (loss 1.4982): 29%|βββ | 179/625 [01:19<03:23, 2.19it/s]
Training 1/1 epoch (loss 1.4982): 29%|βββ | 180/625 [01:19<03:07, 2.37it/s]
Training 1/1 epoch (loss 1.4429): 29%|βββ | 180/625 [01:20<03:07, 2.37it/s]
Training 1/1 epoch (loss 1.4429): 29%|βββ | 181/625 [01:20<03:12, 2.31it/s]
Training 1/1 epoch (loss 1.5039): 29%|βββ | 181/625 [01:20<03:12, 2.31it/s]
Training 1/1 epoch (loss 1.5039): 29%|βββ | 182/625 [01:20<03:06, 2.38it/s]
Training 1/1 epoch (loss 1.5060): 29%|βββ | 182/625 [01:21<03:06, 2.38it/s]
Training 1/1 epoch (loss 1.5060): 29%|βββ | 183/625 [01:21<03:05, 2.39it/s]
Training 1/1 epoch (loss 1.4411): 29%|βββ | 183/625 [01:21<03:05, 2.39it/s]
Training 1/1 epoch (loss 1.4411): 29%|βββ | 184/625 [01:21<03:07, 2.35it/s]
Training 1/1 epoch (loss 1.4669): 29%|βββ | 184/625 [01:21<03:07, 2.35it/s]
Training 1/1 epoch (loss 1.4669): 30%|βββ | 185/625 [01:21<02:57, 2.48it/s]
Training 1/1 epoch (loss 1.4923): 30%|βββ | 185/625 [01:22<02:57, 2.48it/s]
Training 1/1 epoch (loss 1.4923): 30%|βββ | 186/625 [01:22<02:51, 2.55it/s]
Training 1/1 epoch (loss 1.5038): 30%|βββ | 186/625 [01:22<02:51, 2.55it/s]
Training 1/1 epoch (loss 1.5038): 30%|βββ | 187/625 [01:22<02:48, 2.60it/s]
Training 1/1 epoch (loss 1.5332): 30%|βββ | 187/625 [01:23<02:48, 2.60it/s]
Training 1/1 epoch (loss 1.5332): 30%|βββ | 188/625 [01:23<02:51, 2.54it/s]
Training 1/1 epoch (loss 1.5267): 30%|βββ | 188/625 [01:23<02:51, 2.54it/s]
Training 1/1 epoch (loss 1.5267): 30%|βββ | 189/625 [01:23<02:50, 2.55it/s]
Training 1/1 epoch (loss 1.5391): 30%|βββ | 189/625 [01:23<02:50, 2.55it/s]
Training 1/1 epoch (loss 1.5391): 30%|βββ | 190/625 [01:23<02:49, 2.57it/s]
Training 1/1 epoch (loss 1.5961): 30%|βββ | 190/625 [01:24<02:49, 2.57it/s]
Training 1/1 epoch (loss 1.5961): 31%|βββ | 191/625 [01:24<02:46, 2.61it/s]
Training 1/1 epoch (loss 1.5961): 31%|βββ | 191/625 [01:24<02:46, 2.61it/s]
Training 1/1 epoch (loss 1.5961): 31%|βββ | 192/625 [01:24<02:57, 2.44it/s]
Training 1/1 epoch (loss 1.5058): 31%|βββ | 192/625 [01:25<02:57, 2.44it/s]
Training 1/1 epoch (loss 1.5058): 31%|βββ | 193/625 [01:25<02:52, 2.51it/s]
Training 1/1 epoch (loss 1.4693): 31%|βββ | 193/625 [01:25<02:52, 2.51it/s]
Training 1/1 epoch (loss 1.4693): 31%|βββ | 194/625 [01:25<02:52, 2.50it/s]
Training 1/1 epoch (loss 1.5073): 31%|βββ | 194/625 [01:25<02:52, 2.50it/s]
Training 1/1 epoch (loss 1.5073): 31%|βββ | 195/625 [01:25<02:51, 2.50it/s]
Training 1/1 epoch (loss 1.6337): 31%|βββ | 195/625 [01:26<02:51, 2.50it/s]
Training 1/1 epoch (loss 1.6337): 31%|ββββ | 196/625 [01:26<02:45, 2.60it/s]
Training 1/1 epoch (loss 1.5163): 31%|ββββ | 196/625 [01:26<02:45, 2.60it/s]
Training 1/1 epoch (loss 1.5163): 32%|ββββ | 197/625 [01:26<02:45, 2.58it/s]
Training 1/1 epoch (loss 1.3197): 32%|ββββ | 197/625 [01:27<02:45, 2.58it/s]
Training 1/1 epoch (loss 1.3197): 32%|ββββ | 198/625 [01:27<02:52, 2.47it/s]
Training 1/1 epoch (loss 1.4320): 32%|ββββ | 198/625 [01:27<02:52, 2.47it/s]
Training 1/1 epoch (loss 1.4320): 32%|ββββ | 199/625 [01:27<02:48, 2.52it/s]
Training 1/1 epoch (loss 1.6854): 32%|ββββ | 199/625 [01:27<02:48, 2.52it/s]
Training 1/1 epoch (loss 1.6854): 32%|ββββ | 200/625 [01:27<02:51, 2.48it/s]
Training 1/1 epoch (loss 1.6034): 32%|ββββ | 200/625 [01:28<02:51, 2.48it/s]
Training 1/1 epoch (loss 1.6034): 32%|ββββ | 201/625 [01:28<02:44, 2.57it/s]
Training 1/1 epoch (loss 1.4560): 32%|ββββ | 201/625 [01:28<02:44, 2.57it/s]
Training 1/1 epoch (loss 1.4560): 32%|ββββ | 202/625 [01:28<02:48, 2.51it/s]
Training 1/1 epoch (loss 1.5919): 32%|ββββ | 202/625 [01:29<02:48, 2.51it/s]
Training 1/1 epoch (loss 1.5919): 32%|ββββ | 203/625 [01:29<02:52, 2.45it/s]
Training 1/1 epoch (loss 1.5472): 32%|ββββ | 203/625 [01:29<02:52, 2.45it/s]
Training 1/1 epoch (loss 1.5472): 33%|ββββ | 204/625 [01:29<02:51, 2.46it/s]
Training 1/1 epoch (loss 1.5072): 33%|ββββ | 204/625 [01:29<02:51, 2.46it/s]
Training 1/1 epoch (loss 1.5072): 33%|ββββ | 205/625 [01:29<02:47, 2.51it/s]
Training 1/1 epoch (loss 1.4462): 33%|ββββ | 205/625 [01:30<02:47, 2.51it/s]
Training 1/1 epoch (loss 1.4462): 33%|ββββ | 206/625 [01:30<02:46, 2.51it/s]
Training 1/1 epoch (loss 1.6021): 33%|ββββ | 206/625 [01:30<02:46, 2.51it/s]
Training 1/1 epoch (loss 1.6021): 33%|ββββ | 207/625 [01:30<02:59, 2.33it/s]
Training 1/1 epoch (loss 1.6002): 33%|ββββ | 207/625 [01:31<02:59, 2.33it/s]
Training 1/1 epoch (loss 1.6002): 33%|ββββ | 208/625 [01:31<02:57, 2.35it/s]
Training 1/1 epoch (loss 1.5239): 33%|ββββ | 208/625 [01:31<02:57, 2.35it/s]
Training 1/1 epoch (loss 1.5239): 33%|ββββ | 209/625 [01:31<02:56, 2.36it/s]
Training 1/1 epoch (loss 1.6070): 33%|ββββ | 209/625 [01:31<02:56, 2.36it/s]
Training 1/1 epoch (loss 1.6070): 34%|ββββ | 210/625 [01:31<02:46, 2.50it/s]
Training 1/1 epoch (loss 1.4662): 34%|ββββ | 210/625 [01:32<02:46, 2.50it/s]
Training 1/1 epoch (loss 1.4662): 34%|ββββ | 211/625 [01:32<02:40, 2.57it/s]
Training 1/1 epoch (loss 1.5439): 34%|ββββ | 211/625 [01:32<02:40, 2.57it/s]
Training 1/1 epoch (loss 1.5439): 34%|ββββ | 212/625 [01:32<02:40, 2.58it/s]
Training 1/1 epoch (loss 1.5120): 34%|ββββ | 212/625 [01:33<02:40, 2.58it/s]
Training 1/1 epoch (loss 1.5120): 34%|ββββ | 213/625 [01:33<02:42, 2.54it/s]
Training 1/1 epoch (loss 1.5135): 34%|ββββ | 213/625 [01:33<02:42, 2.54it/s]
Training 1/1 epoch (loss 1.5135): 34%|ββββ | 214/625 [01:33<02:43, 2.52it/s]
Training 1/1 epoch (loss 1.5384): 34%|ββββ | 214/625 [01:33<02:43, 2.52it/s]
Training 1/1 epoch (loss 1.5384): 34%|ββββ | 215/625 [01:33<02:41, 2.55it/s]
Training 1/1 epoch (loss 1.5841): 34%|ββββ | 215/625 [01:34<02:41, 2.55it/s]
Training 1/1 epoch (loss 1.5841): 35%|ββββ | 216/625 [01:34<02:42, 2.52it/s]
Training 1/1 epoch (loss 1.4363): 35%|ββββ | 216/625 [01:34<02:42, 2.52it/s]
Training 1/1 epoch (loss 1.4363): 35%|ββββ | 217/625 [01:34<02:45, 2.46it/s]
Training 1/1 epoch (loss 1.5278): 35%|ββββ | 217/625 [01:35<02:45, 2.46it/s]
Training 1/1 epoch (loss 1.5278): 35%|ββββ | 218/625 [01:35<02:53, 2.35it/s]
Training 1/1 epoch (loss 1.5331): 35%|ββββ | 218/625 [01:35<02:53, 2.35it/s]
Training 1/1 epoch (loss 1.5331): 35%|ββββ | 219/625 [01:35<02:47, 2.42it/s]
Training 1/1 epoch (loss 1.4774): 35%|ββββ | 219/625 [01:35<02:47, 2.42it/s]
Training 1/1 epoch (loss 1.4774): 35%|ββββ | 220/625 [01:35<02:40, 2.52it/s]
Training 1/1 epoch (loss 1.4442): 35%|ββββ | 220/625 [01:36<02:40, 2.52it/s]
Training 1/1 epoch (loss 1.4442): 35%|ββββ | 221/625 [01:36<02:36, 2.58it/s]
Training 1/1 epoch (loss 1.5551): 35%|ββββ | 221/625 [01:36<02:36, 2.58it/s]
Training 1/1 epoch (loss 1.5551): 36%|ββββ | 222/625 [01:36<02:34, 2.60it/s]
Training 1/1 epoch (loss 1.5252): 36%|ββββ | 222/625 [01:37<02:34, 2.60it/s]
Training 1/1 epoch (loss 1.5252): 36%|ββββ | 223/625 [01:37<02:36, 2.58it/s]
Training 1/1 epoch (loss 1.3923): 36%|ββββ | 223/625 [01:37<02:36, 2.58it/s]
Training 1/1 epoch (loss 1.3923): 36%|ββββ | 224/625 [01:37<02:41, 2.49it/s]
Training 1/1 epoch (loss 1.4565): 36%|ββββ | 224/625 [01:37<02:41, 2.49it/s]
Training 1/1 epoch (loss 1.4565): 36%|ββββ | 225/625 [01:37<02:34, 2.59it/s]
Training 1/1 epoch (loss 1.6296): 36%|ββββ | 225/625 [01:38<02:34, 2.59it/s]
Training 1/1 epoch (loss 1.6296): 36%|ββββ | 226/625 [01:38<02:29, 2.67it/s]
Training 1/1 epoch (loss 1.3853): 36%|ββββ | 226/625 [01:38<02:29, 2.67it/s]
Training 1/1 epoch (loss 1.3853): 36%|ββββ | 227/625 [01:38<02:35, 2.56it/s]
Training 1/1 epoch (loss 1.4992): 36%|ββββ | 227/625 [01:39<02:35, 2.56it/s]
Training 1/1 epoch (loss 1.4992): 36%|ββββ | 228/625 [01:39<02:42, 2.45it/s]
Training 1/1 epoch (loss 1.4387): 36%|ββββ | 228/625 [01:39<02:42, 2.45it/s]
Training 1/1 epoch (loss 1.4387): 37%|ββββ | 229/625 [01:39<02:45, 2.39it/s]
Training 1/1 epoch (loss 1.4937): 37%|ββββ | 229/625 [01:39<02:45, 2.39it/s]
Training 1/1 epoch (loss 1.4937): 37%|ββββ | 230/625 [01:39<02:37, 2.51it/s]
Training 1/1 epoch (loss 1.4792): 37%|ββββ | 230/625 [01:40<02:37, 2.51it/s]
Training 1/1 epoch (loss 1.4792): 37%|ββββ | 231/625 [01:40<02:33, 2.57it/s]
Training 1/1 epoch (loss 1.6404): 37%|ββββ | 231/625 [01:40<02:33, 2.57it/s]
Training 1/1 epoch (loss 1.6404): 37%|ββββ | 232/625 [01:40<02:40, 2.45it/s]
Training 1/1 epoch (loss 1.4880): 37%|ββββ | 232/625 [01:41<02:40, 2.45it/s]
Training 1/1 epoch (loss 1.4880): 37%|ββββ | 233/625 [01:41<02:42, 2.42it/s]
Training 1/1 epoch (loss 1.4203): 37%|ββββ | 233/625 [01:41<02:42, 2.42it/s]
Training 1/1 epoch (loss 1.4203): 37%|ββββ | 234/625 [01:41<02:42, 2.41it/s]
Training 1/1 epoch (loss 1.5266): 37%|ββββ | 234/625 [01:41<02:42, 2.41it/s]
Training 1/1 epoch (loss 1.5266): 38%|ββββ | 235/625 [01:41<02:35, 2.51it/s]
Training 1/1 epoch (loss 1.4095): 38%|ββββ | 235/625 [01:42<02:35, 2.51it/s]
Training 1/1 epoch (loss 1.4095): 38%|ββββ | 236/625 [01:42<02:32, 2.56it/s]
Training 1/1 epoch (loss 1.4800): 38%|ββββ | 236/625 [01:42<02:32, 2.56it/s]
Training 1/1 epoch (loss 1.4800): 38%|ββββ | 237/625 [01:42<02:28, 2.61it/s]
Training 1/1 epoch (loss 1.4607): 38%|ββββ | 237/625 [01:42<02:28, 2.61it/s]
Training 1/1 epoch (loss 1.4607): 38%|ββββ | 238/625 [01:42<02:30, 2.57it/s]
Training 1/1 epoch (loss 1.5606): 38%|ββββ | 238/625 [01:43<02:30, 2.57it/s]
Training 1/1 epoch (loss 1.5606): 38%|ββββ | 239/625 [01:43<02:31, 2.54it/s]
Training 1/1 epoch (loss 1.5172): 38%|ββββ | 239/625 [01:43<02:31, 2.54it/s]
Training 1/1 epoch (loss 1.5172): 38%|ββββ | 240/625 [01:43<02:36, 2.47it/s]
Training 1/1 epoch (loss 1.5723): 38%|ββββ | 240/625 [01:44<02:36, 2.47it/s]
Training 1/1 epoch (loss 1.5723): 39%|ββββ | 241/625 [01:44<02:34, 2.48it/s]
Training 1/1 epoch (loss 1.4518): 39%|ββββ | 241/625 [01:44<02:34, 2.48it/s]
Training 1/1 epoch (loss 1.4518): 39%|ββββ | 242/625 [01:44<02:32, 2.50it/s]
Training 1/1 epoch (loss 1.4827): 39%|ββββ | 242/625 [01:45<02:32, 2.50it/s]
Training 1/1 epoch (loss 1.4827): 39%|ββββ | 243/625 [01:45<02:35, 2.45it/s]
Training 1/1 epoch (loss 1.5401): 39%|ββββ | 243/625 [01:45<02:35, 2.45it/s]
Training 1/1 epoch (loss 1.5401): 39%|ββββ | 244/625 [01:45<02:37, 2.42it/s]
Training 1/1 epoch (loss 1.4326): 39%|ββββ | 244/625 [01:45<02:37, 2.42it/s]
Training 1/1 epoch (loss 1.4326): 39%|ββββ | 245/625 [01:45<02:31, 2.51it/s]
Training 1/1 epoch (loss 1.5162): 39%|ββββ | 245/625 [01:46<02:31, 2.51it/s]
Training 1/1 epoch (loss 1.5162): 39%|ββββ | 246/625 [01:46<02:27, 2.57it/s]
Training 1/1 epoch (loss 1.4917): 39%|ββββ | 246/625 [01:46<02:27, 2.57it/s]
Training 1/1 epoch (loss 1.4917): 40%|ββββ | 247/625 [01:46<02:23, 2.63it/s]
Training 1/1 epoch (loss 1.5841): 40%|ββββ | 247/625 [01:46<02:23, 2.63it/s]
Training 1/1 epoch (loss 1.5841): 40%|ββββ | 248/625 [01:46<02:28, 2.54it/s]
Training 1/1 epoch (loss 1.5114): 40%|ββββ | 248/625 [01:47<02:28, 2.54it/s]
Training 1/1 epoch (loss 1.5114): 40%|ββββ | 249/625 [01:47<02:32, 2.46it/s]
Training 1/1 epoch (loss 1.5620): 40%|ββββ | 249/625 [01:47<02:32, 2.46it/s]
Training 1/1 epoch (loss 1.5620): 40%|ββββ | 250/625 [01:47<02:34, 2.43it/s]
Training 1/1 epoch (loss 1.5526): 40%|ββββ | 250/625 [01:48<02:34, 2.43it/s]
Training 1/1 epoch (loss 1.5526): 40%|ββββ | 251/625 [01:48<02:54, 2.14it/s]
Training 1/1 epoch (loss 1.4828): 40%|ββββ | 251/625 [01:48<02:54, 2.14it/s]
Training 1/1 epoch (loss 1.4828): 40%|ββββ | 252/625 [01:48<02:49, 2.20it/s]
Training 1/1 epoch (loss 1.4726): 40%|ββββ | 252/625 [01:49<02:49, 2.20it/s]
Training 1/1 epoch (loss 1.4726): 40%|ββββ | 253/625 [01:49<02:45, 2.24it/s]
Training 1/1 epoch (loss 1.4668): 40%|ββββ | 253/625 [01:49<02:45, 2.24it/s]
Training 1/1 epoch (loss 1.4668): 41%|ββββ | 254/625 [01:49<02:38, 2.34it/s]
Training 1/1 epoch (loss 1.4896): 41%|ββββ | 254/625 [01:50<02:38, 2.34it/s]
Training 1/1 epoch (loss 1.4896): 41%|ββββ | 255/625 [01:50<02:34, 2.39it/s]
Training 1/1 epoch (loss 1.4607): 41%|ββββ | 255/625 [01:50<02:34, 2.39it/s]
Training 1/1 epoch (loss 1.4607): 41%|ββββ | 256/625 [01:50<02:31, 2.43it/s]
Training 1/1 epoch (loss 1.4393): 41%|ββββ | 256/625 [01:50<02:31, 2.43it/s]
Training 1/1 epoch (loss 1.4393): 41%|ββββ | 257/625 [01:50<02:39, 2.31it/s]
Training 1/1 epoch (loss 1.3780): 41%|ββββ | 257/625 [01:51<02:39, 2.31it/s]
Training 1/1 epoch (loss 1.3780): 41%|βββββ | 258/625 [01:51<02:37, 2.34it/s]
Training 1/1 epoch (loss 1.6029): 41%|βββββ | 258/625 [01:51<02:37, 2.34it/s]
Training 1/1 epoch (loss 1.6029): 41%|βββββ | 259/625 [01:51<02:31, 2.42it/s]
Training 1/1 epoch (loss 1.4698): 41%|βββββ | 259/625 [01:52<02:31, 2.42it/s]
Training 1/1 epoch (loss 1.4698): 42%|βββββ | 260/625 [01:52<02:24, 2.52it/s]
Training 1/1 epoch (loss 1.5323): 42%|βββββ | 260/625 [01:52<02:24, 2.52it/s]
Training 1/1 epoch (loss 1.5323): 42%|βββββ | 261/625 [01:52<02:23, 2.54it/s]
Training 1/1 epoch (loss 1.4729): 42%|βββββ | 261/625 [01:52<02:23, 2.54it/s]
Training 1/1 epoch (loss 1.4729): 42%|βββββ | 262/625 [01:52<02:21, 2.57it/s]
Training 1/1 epoch (loss 1.5105): 42%|βββββ | 262/625 [01:53<02:21, 2.57it/s]
Training 1/1 epoch (loss 1.5105): 42%|βββββ | 263/625 [01:53<02:23, 2.52it/s]
Training 1/1 epoch (loss 1.5217): 42%|βββββ | 263/625 [01:53<02:23, 2.52it/s]
Training 1/1 epoch (loss 1.5217): 42%|βββββ | 264/625 [01:53<02:29, 2.42it/s]
Training 1/1 epoch (loss 1.4718): 42%|βββββ | 264/625 [01:54<02:29, 2.42it/s]
Training 1/1 epoch (loss 1.4718): 42%|βββββ | 265/625 [01:54<02:25, 2.47it/s]
Training 1/1 epoch (loss 1.4982): 42%|βββββ | 265/625 [01:54<02:25, 2.47it/s]
Training 1/1 epoch (loss 1.4982): 43%|βββββ | 266/625 [01:54<02:22, 2.52it/s]
Training 1/1 epoch (loss 1.4600): 43%|βββββ | 266/625 [01:54<02:22, 2.52it/s]
Training 1/1 epoch (loss 1.4600): 43%|βββββ | 267/625 [01:54<02:22, 2.51it/s]
Training 1/1 epoch (loss 1.3519): 43%|βββββ | 267/625 [01:55<02:22, 2.51it/s]
Training 1/1 epoch (loss 1.3519): 43%|βββββ | 268/625 [01:55<02:20, 2.54it/s]
Training 1/1 epoch (loss 1.5043): 43%|βββββ | 268/625 [01:55<02:20, 2.54it/s]
Training 1/1 epoch (loss 1.5043): 43%|βββββ | 269/625 [01:55<02:24, 2.46it/s]
Training 1/1 epoch (loss 1.4712): 43%|βββββ | 269/625 [01:56<02:24, 2.46it/s]
Training 1/1 epoch (loss 1.4712): 43%|βββββ | 270/625 [01:56<02:25, 2.44it/s]
Training 1/1 epoch (loss 1.5517): 43%|βββββ | 270/625 [01:56<02:25, 2.44it/s]
Training 1/1 epoch (loss 1.5517): 43%|βββββ | 271/625 [01:56<02:18, 2.56it/s]
Training 1/1 epoch (loss 1.4325): 43%|βββββ | 271/625 [01:56<02:18, 2.56it/s]
Training 1/1 epoch (loss 1.4325): 44%|βββββ | 272/625 [01:56<02:19, 2.52it/s]
Training 1/1 epoch (loss 1.5552): 44%|βββββ | 272/625 [01:57<02:19, 2.52it/s]
Training 1/1 epoch (loss 1.5552): 44%|βββββ | 273/625 [01:57<02:17, 2.56it/s]
Training 1/1 epoch (loss 1.4782): 44%|βββββ | 273/625 [01:57<02:17, 2.56it/s]
Training 1/1 epoch (loss 1.4782): 44%|βββββ | 274/625 [01:57<02:19, 2.52it/s]
Training 1/1 epoch (loss 1.5099): 44%|βββββ | 274/625 [01:58<02:19, 2.52it/s]
Training 1/1 epoch (loss 1.5099): 44%|βββββ | 275/625 [01:58<02:13, 2.61it/s]
Training 1/1 epoch (loss 1.4225): 44%|βββββ | 275/625 [01:58<02:13, 2.61it/s]
Training 1/1 epoch (loss 1.4225): 44%|βββββ | 276/625 [01:58<02:11, 2.65it/s]
Training 1/1 epoch (loss 1.6312): 44%|βββββ | 276/625 [01:58<02:11, 2.65it/s]
Training 1/1 epoch (loss 1.6312): 44%|βββββ | 277/625 [01:58<02:11, 2.66it/s]
Training 1/1 epoch (loss 1.5230): 44%|βββββ | 277/625 [01:59<02:11, 2.66it/s]
Training 1/1 epoch (loss 1.5230): 44%|βββββ | 278/625 [01:59<02:09, 2.67it/s]
Training 1/1 epoch (loss 1.4297): 44%|βββββ | 278/625 [01:59<02:09, 2.67it/s]
Training 1/1 epoch (loss 1.4297): 45%|βββββ | 279/625 [01:59<02:11, 2.63it/s]
Training 1/1 epoch (loss 1.5498): 45%|βββββ | 279/625 [01:59<02:11, 2.63it/s]
Training 1/1 epoch (loss 1.5498): 45%|βββββ | 280/625 [01:59<02:10, 2.65it/s]
Training 1/1 epoch (loss 1.5744): 45%|βββββ | 280/625 [02:00<02:10, 2.65it/s]
Training 1/1 epoch (loss 1.5744): 45%|βββββ | 281/625 [02:00<02:07, 2.69it/s]
Training 1/1 epoch (loss 1.4359): 45%|βββββ | 281/625 [02:00<02:07, 2.69it/s]
Training 1/1 epoch (loss 1.4359): 45%|βββββ | 282/625 [02:00<02:13, 2.56it/s]
Training 1/1 epoch (loss 1.4746): 45%|βββββ | 282/625 [02:01<02:13, 2.56it/s]
Training 1/1 epoch (loss 1.4746): 45%|βββββ | 283/625 [02:01<02:15, 2.53it/s]
Training 1/1 epoch (loss 1.5358): 45%|βββββ | 283/625 [02:01<02:15, 2.53it/s]
Training 1/1 epoch (loss 1.5358): 45%|βββββ | 284/625 [02:01<02:24, 2.37it/s]
Training 1/1 epoch (loss 1.4825): 45%|βββββ | 284/625 [02:01<02:24, 2.37it/s]
Training 1/1 epoch (loss 1.4825): 46%|βββββ | 285/625 [02:01<02:18, 2.46it/s]
Training 1/1 epoch (loss 1.4520): 46%|βββββ | 285/625 [02:02<02:18, 2.46it/s]
Training 1/1 epoch (loss 1.4520): 46%|βββββ | 286/625 [02:02<02:12, 2.55it/s]
Training 1/1 epoch (loss 1.3451): 46%|βββββ | 286/625 [02:02<02:12, 2.55it/s]
Training 1/1 epoch (loss 1.3451): 46%|βββββ | 287/625 [02:02<02:13, 2.53it/s]
Training 1/1 epoch (loss 1.3708): 46%|βββββ | 287/625 [02:03<02:13, 2.53it/s]
Training 1/1 epoch (loss 1.3708): 46%|βββββ | 288/625 [02:03<02:17, 2.45it/s]
Training 1/1 epoch (loss 1.4415): 46%|βββββ | 288/625 [02:03<02:17, 2.45it/s]
Training 1/1 epoch (loss 1.4415): 46%|βββββ | 289/625 [02:03<02:19, 2.41it/s]
Training 1/1 epoch (loss 1.4836): 46%|βββββ | 289/625 [02:03<02:19, 2.41it/s]
Training 1/1 epoch (loss 1.4836): 46%|βββββ | 290/625 [02:03<02:16, 2.46it/s]
Training 1/1 epoch (loss 1.4766): 46%|βββββ | 290/625 [02:04<02:16, 2.46it/s]
Training 1/1 epoch (loss 1.4766): 47%|βββββ | 291/625 [02:04<02:12, 2.52it/s]
Training 1/1 epoch (loss 1.4795): 47%|βββββ | 291/625 [02:04<02:12, 2.52it/s]
Training 1/1 epoch (loss 1.4795): 47%|βββββ | 292/625 [02:04<02:10, 2.55it/s]
Training 1/1 epoch (loss 1.5177): 47%|βββββ | 292/625 [02:05<02:10, 2.55it/s]
Training 1/1 epoch (loss 1.5177): 47%|βββββ | 293/625 [02:05<02:10, 2.55it/s]
Training 1/1 epoch (loss 1.3925): 47%|βββββ | 293/625 [02:05<02:10, 2.55it/s]
Training 1/1 epoch (loss 1.3925): 47%|βββββ | 294/625 [02:05<02:11, 2.52it/s]
Training 1/1 epoch (loss 1.5103): 47%|βββββ | 294/625 [02:05<02:11, 2.52it/s]
Training 1/1 epoch (loss 1.5103): 47%|βββββ | 295/625 [02:05<02:08, 2.58it/s]
Training 1/1 epoch (loss 1.4626): 47%|βββββ | 295/625 [02:06<02:08, 2.58it/s]
Training 1/1 epoch (loss 1.4626): 47%|βββββ | 296/625 [02:06<02:07, 2.59it/s]
Training 1/1 epoch (loss 1.4053): 47%|βββββ | 296/625 [02:06<02:07, 2.59it/s]
Training 1/1 epoch (loss 1.4053): 48%|βββββ | 297/625 [02:06<02:13, 2.46it/s]
Training 1/1 epoch (loss 1.5007): 48%|βββββ | 297/625 [02:07<02:13, 2.46it/s]
Training 1/1 epoch (loss 1.5007): 48%|βββββ | 298/625 [02:07<02:09, 2.53it/s]
Training 1/1 epoch (loss 1.5221): 48%|βββββ | 298/625 [02:07<02:09, 2.53it/s]
Training 1/1 epoch (loss 1.5221): 48%|βββββ | 299/625 [02:07<02:15, 2.40it/s]
Training 1/1 epoch (loss 1.4758): 48%|βββββ | 299/625 [02:07<02:15, 2.40it/s]
Training 1/1 epoch (loss 1.4758): 48%|βββββ | 300/625 [02:07<02:10, 2.49it/s]
Training 1/1 epoch (loss 1.4315): 48%|βββββ | 300/625 [02:08<02:10, 2.49it/s]
Training 1/1 epoch (loss 1.4315): 48%|βββββ | 301/625 [02:08<02:04, 2.60it/s]
Training 1/1 epoch (loss 1.4748): 48%|βββββ | 301/625 [02:08<02:04, 2.60it/s]
Training 1/1 epoch (loss 1.4748): 48%|βββββ | 302/625 [02:08<02:04, 2.59it/s]
Training 1/1 epoch (loss 1.4651): 48%|βββββ | 302/625 [02:09<02:04, 2.59it/s]
Training 1/1 epoch (loss 1.4651): 48%|βββββ | 303/625 [02:09<02:05, 2.57it/s]
Training 1/1 epoch (loss 1.4439): 48%|βββββ | 303/625 [02:09<02:05, 2.57it/s]
Training 1/1 epoch (loss 1.4439): 49%|βββββ | 304/625 [02:09<02:09, 2.48it/s]
Training 1/1 epoch (loss 1.4381): 49%|βββββ | 304/625 [02:09<02:09, 2.48it/s]
Training 1/1 epoch (loss 1.4381): 49%|βββββ | 305/625 [02:09<02:09, 2.47it/s]
Training 1/1 epoch (loss 1.5293): 49%|βββββ | 305/625 [02:10<02:09, 2.47it/s]
Training 1/1 epoch (loss 1.5293): 49%|βββββ | 306/625 [02:10<02:04, 2.56it/s]
Training 1/1 epoch (loss 1.4096): 49%|βββββ | 306/625 [02:10<02:04, 2.56it/s]
Training 1/1 epoch (loss 1.4096): 49%|βββββ | 307/625 [02:10<01:59, 2.66it/s]
Training 1/1 epoch (loss 1.4584): 49%|βββββ | 307/625 [02:10<01:59, 2.66it/s]
Training 1/1 epoch (loss 1.4584): 49%|βββββ | 308/625 [02:10<02:00, 2.63it/s]
Training 1/1 epoch (loss 1.5291): 49%|βββββ | 308/625 [02:11<02:00, 2.63it/s]
Training 1/1 epoch (loss 1.5291): 49%|βββββ | 309/625 [02:11<02:00, 2.62it/s]
Training 1/1 epoch (loss 1.4581): 49%|βββββ | 309/625 [02:11<02:00, 2.62it/s]
Training 1/1 epoch (loss 1.4581): 50%|βββββ | 310/625 [02:11<02:11, 2.40it/s]
Training 1/1 epoch (loss 1.3882): 50%|βββββ | 310/625 [02:12<02:11, 2.40it/s]
Training 1/1 epoch (loss 1.3882): 50%|βββββ | 311/625 [02:12<02:05, 2.51it/s]
Training 1/1 epoch (loss 1.5057): 50%|βββββ | 311/625 [02:12<02:05, 2.51it/s]
Training 1/1 epoch (loss 1.5057): 50%|βββββ | 312/625 [02:12<02:02, 2.55it/s]
Training 1/1 epoch (loss 1.3009): 50%|βββββ | 312/625 [02:13<02:02, 2.55it/s]
Training 1/1 epoch (loss 1.3009): 50%|βββββ | 313/625 [02:13<02:03, 2.52it/s]
Training 1/1 epoch (loss 1.4903): 50%|βββββ | 313/625 [02:13<02:03, 2.52it/s]
Training 1/1 epoch (loss 1.4903): 50%|βββββ | 314/625 [02:13<02:06, 2.45it/s]
Training 1/1 epoch (loss 1.3044): 50%|βββββ | 314/625 [02:13<02:06, 2.45it/s]
Training 1/1 epoch (loss 1.3044): 50%|βββββ | 315/625 [02:13<02:11, 2.36it/s]
Training 1/1 epoch (loss 1.3753): 50%|βββββ | 315/625 [02:14<02:11, 2.36it/s]
Training 1/1 epoch (loss 1.3753): 51%|βββββ | 316/625 [02:14<02:04, 2.49it/s]
Training 1/1 epoch (loss 1.4724): 51%|βββββ | 316/625 [02:14<02:04, 2.49it/s]
Training 1/1 epoch (loss 1.4724): 51%|βββββ | 317/625 [02:14<02:04, 2.48it/s]
Training 1/1 epoch (loss 1.4835): 51%|βββββ | 317/625 [02:15<02:04, 2.48it/s]
Training 1/1 epoch (loss 1.4835): 51%|βββββ | 318/625 [02:15<02:04, 2.47it/s]
Training 1/1 epoch (loss 1.4005): 51%|βββββ | 318/625 [02:15<02:04, 2.47it/s]
Training 1/1 epoch (loss 1.4005): 51%|βββββ | 319/625 [02:15<02:02, 2.50it/s]
Training 1/1 epoch (loss 1.4327): 51%|βββββ | 319/625 [02:15<02:02, 2.50it/s]
Training 1/1 epoch (loss 1.4327): 51%|βββββ | 320/625 [02:15<02:00, 2.53it/s]
Training 1/1 epoch (loss 1.4760): 51%|βββββ | 320/625 [02:16<02:00, 2.53it/s]
Training 1/1 epoch (loss 1.4760): 51%|ββββββ | 321/625 [02:16<01:58, 2.57it/s]
Training 1/1 epoch (loss 1.3856): 51%|ββββββ | 321/625 [02:16<01:58, 2.57it/s]
Training 1/1 epoch (loss 1.3856): 52%|ββββββ | 322/625 [02:16<01:56, 2.60it/s]
Training 1/1 epoch (loss 1.4851): 52%|ββββββ | 322/625 [02:17<01:56, 2.60it/s]
Training 1/1 epoch (loss 1.4851): 52%|ββββββ | 323/625 [02:17<01:56, 2.58it/s]
Training 1/1 epoch (loss 1.3985): 52%|ββββββ | 323/625 [02:17<01:56, 2.58it/s]
Training 1/1 epoch (loss 1.3985): 52%|ββββββ | 324/625 [02:17<01:59, 2.51it/s]
Training 1/1 epoch (loss 1.4802): 52%|ββββββ | 324/625 [02:17<01:59, 2.51it/s]
Training 1/1 epoch (loss 1.4802): 52%|ββββββ | 325/625 [02:17<01:54, 2.61it/s]
Training 1/1 epoch (loss 1.4816): 52%|ββββββ | 325/625 [02:18<01:54, 2.61it/s]
Training 1/1 epoch (loss 1.4816): 52%|ββββββ | 326/625 [02:18<02:31, 1.98it/s]
Training 1/1 epoch (loss 1.4360): 52%|ββββββ | 326/625 [02:19<02:31, 1.98it/s]
Training 1/1 epoch (loss 1.4360): 52%|ββββββ | 327/625 [02:19<02:26, 2.04it/s]
Training 1/1 epoch (loss 1.4153): 52%|ββββββ | 327/625 [02:19<02:26, 2.04it/s]
Training 1/1 epoch (loss 1.4153): 52%|ββββββ | 328/625 [02:19<02:22, 2.08it/s]
Training 1/1 epoch (loss 1.4889): 52%|ββββββ | 328/625 [02:19<02:22, 2.08it/s]
Training 1/1 epoch (loss 1.4889): 53%|ββββββ | 329/625 [02:19<02:13, 2.21it/s]
Training 1/1 epoch (loss 1.4248): 53%|ββββββ | 329/625 [02:20<02:13, 2.21it/s]
Training 1/1 epoch (loss 1.4248): 53%|ββββββ | 330/625 [02:20<02:06, 2.34it/s]
Training 1/1 epoch (loss 1.3870): 53%|ββββββ | 330/625 [02:20<02:06, 2.34it/s]
Training 1/1 epoch (loss 1.3870): 53%|ββββββ | 331/625 [02:20<02:02, 2.39it/s]
Training 1/1 epoch (loss 1.4496): 53%|ββββββ | 331/625 [02:20<02:02, 2.39it/s]
Training 1/1 epoch (loss 1.4496): 53%|ββββββ | 332/625 [02:20<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.5437): 53%|ββββββ | 332/625 [02:21<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.5437): 53%|ββββββ | 333/625 [02:21<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.6237): 53%|ββββββ | 333/625 [02:21<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.6237): 53%|ββββββ | 334/625 [02:21<01:54, 2.53it/s]
Training 1/1 epoch (loss 1.4348): 53%|ββββββ | 334/625 [02:22<01:54, 2.53it/s]
Training 1/1 epoch (loss 1.4348): 54%|ββββββ | 335/625 [02:22<01:58, 2.44it/s]
Training 1/1 epoch (loss 1.4704): 54%|ββββββ | 335/625 [02:22<01:58, 2.44it/s]
Training 1/1 epoch (loss 1.4704): 54%|ββββββ | 336/625 [02:22<01:59, 2.42it/s]
Training 1/1 epoch (loss 1.6021): 54%|ββββββ | 336/625 [02:23<01:59, 2.42it/s]
Training 1/1 epoch (loss 1.6021): 54%|ββββββ | 337/625 [02:23<01:58, 2.44it/s]
Training 1/1 epoch (loss 1.4535): 54%|ββββββ | 337/625 [02:24<01:58, 2.44it/s]
Training 1/1 epoch (loss 1.4535): 54%|ββββββ | 338/625 [02:24<02:46, 1.73it/s]
Training 1/1 epoch (loss 1.4395): 54%|ββββββ | 338/625 [02:24<02:46, 1.73it/s]
Training 1/1 epoch (loss 1.4395): 54%|ββββββ | 339/625 [02:24<02:31, 1.89it/s]
Training 1/1 epoch (loss 1.3825): 54%|ββββββ | 339/625 [02:24<02:31, 1.89it/s]
Training 1/1 epoch (loss 1.3825): 54%|ββββββ | 340/625 [02:24<02:20, 2.02it/s]
Training 1/1 epoch (loss 1.3388): 54%|ββββββ | 340/625 [02:25<02:20, 2.02it/s]
Training 1/1 epoch (loss 1.3388): 55%|ββββββ | 341/625 [02:25<02:15, 2.10it/s]
Training 1/1 epoch (loss 1.5246): 55%|ββββββ | 341/625 [02:25<02:15, 2.10it/s]
Training 1/1 epoch (loss 1.5246): 55%|ββββββ | 342/625 [02:25<02:06, 2.24it/s]
Training 1/1 epoch (loss 1.5570): 55%|ββββββ | 342/625 [02:26<02:06, 2.24it/s]
Training 1/1 epoch (loss 1.5570): 55%|ββββββ | 343/625 [02:26<02:00, 2.35it/s]
Training 1/1 epoch (loss 1.4983): 55%|ββββββ | 343/625 [02:26<02:00, 2.35it/s]
Training 1/1 epoch (loss 1.4983): 55%|ββββββ | 344/625 [02:26<01:59, 2.35it/s]
Training 1/1 epoch (loss 1.4875): 55%|ββββββ | 344/625 [02:26<01:59, 2.35it/s]
Training 1/1 epoch (loss 1.4875): 55%|ββββββ | 345/625 [02:26<01:55, 2.43it/s]
Training 1/1 epoch (loss 1.5014): 55%|ββββββ | 345/625 [02:27<01:55, 2.43it/s]
Training 1/1 epoch (loss 1.5014): 55%|ββββββ | 346/625 [02:27<01:51, 2.51it/s]
Training 1/1 epoch (loss 1.3982): 55%|ββββββ | 346/625 [02:27<01:51, 2.51it/s]
Training 1/1 epoch (loss 1.3982): 56%|ββββββ | 347/625 [02:27<01:52, 2.46it/s]
Training 1/1 epoch (loss 1.4414): 56%|ββββββ | 347/625 [02:28<01:52, 2.46it/s]
Training 1/1 epoch (loss 1.4414): 56%|ββββββ | 348/625 [02:28<01:50, 2.51it/s]
Training 1/1 epoch (loss 1.4994): 56%|ββββββ | 348/625 [02:28<01:50, 2.51it/s]
Training 1/1 epoch (loss 1.4994): 56%|ββββββ | 349/625 [02:28<01:47, 2.56it/s]
Training 1/1 epoch (loss 1.4065): 56%|ββββββ | 349/625 [02:28<01:47, 2.56it/s]
Training 1/1 epoch (loss 1.4065): 56%|ββββββ | 350/625 [02:28<01:50, 2.48it/s]
Training 1/1 epoch (loss 1.5056): 56%|ββββββ | 350/625 [02:29<01:50, 2.48it/s]
Training 1/1 epoch (loss 1.5056): 56%|ββββββ | 351/625 [02:29<01:50, 2.48it/s]
Training 1/1 epoch (loss 1.4244): 56%|ββββββ | 351/625 [02:29<01:50, 2.48it/s]
Training 1/1 epoch (loss 1.4244): 56%|ββββββ | 352/625 [02:29<01:48, 2.51it/s]
Training 1/1 epoch (loss 1.4266): 56%|ββββββ | 352/625 [02:30<01:48, 2.51it/s]
Training 1/1 epoch (loss 1.4266): 56%|ββββββ | 353/625 [02:30<01:50, 2.46it/s]
Training 1/1 epoch (loss 1.4466): 56%|ββββββ | 353/625 [02:30<01:50, 2.46it/s]
Training 1/1 epoch (loss 1.4466): 57%|ββββββ | 354/625 [02:30<01:46, 2.54it/s]
Training 1/1 epoch (loss 1.3302): 57%|ββββββ | 354/625 [02:30<01:46, 2.54it/s]
Training 1/1 epoch (loss 1.3302): 57%|ββββββ | 355/625 [02:30<01:49, 2.46it/s]
Training 1/1 epoch (loss 1.4327): 57%|ββββββ | 355/625 [02:31<01:49, 2.46it/s]
Training 1/1 epoch (loss 1.4327): 57%|ββββββ | 356/625 [02:31<01:48, 2.48it/s]
Training 1/1 epoch (loss 1.4759): 57%|ββββββ | 356/625 [02:31<01:48, 2.48it/s]
Training 1/1 epoch (loss 1.4759): 57%|ββββββ | 357/625 [02:31<01:47, 2.50it/s]
Training 1/1 epoch (loss 1.3848): 57%|ββββββ | 357/625 [02:31<01:47, 2.50it/s]
Training 1/1 epoch (loss 1.3848): 57%|ββββββ | 358/625 [02:31<01:45, 2.53it/s]
Training 1/1 epoch (loss 1.5311): 57%|ββββββ | 358/625 [02:32<01:45, 2.53it/s]
Training 1/1 epoch (loss 1.5311): 57%|ββββββ | 359/625 [02:32<01:44, 2.55it/s]
Training 1/1 epoch (loss 1.4341): 57%|ββββββ | 359/625 [02:32<01:44, 2.55it/s]
Training 1/1 epoch (loss 1.4341): 58%|ββββββ | 360/625 [02:32<01:46, 2.48it/s]
Training 1/1 epoch (loss 1.4087): 58%|ββββββ | 360/625 [02:33<01:46, 2.48it/s]
Training 1/1 epoch (loss 1.4087): 58%|ββββββ | 361/625 [02:33<01:45, 2.51it/s]
Training 1/1 epoch (loss 1.4591): 58%|ββββββ | 361/625 [02:33<01:45, 2.51it/s]
Training 1/1 epoch (loss 1.4591): 58%|ββββββ | 362/625 [02:33<01:45, 2.50it/s]
Training 1/1 epoch (loss 1.5158): 58%|ββββββ | 362/625 [02:34<01:45, 2.50it/s]
Training 1/1 epoch (loss 1.5158): 58%|ββββββ | 363/625 [02:34<01:45, 2.49it/s]
Training 1/1 epoch (loss 1.5699): 58%|ββββββ | 363/625 [02:34<01:45, 2.49it/s]
Training 1/1 epoch (loss 1.5699): 58%|ββββββ | 364/625 [02:34<01:41, 2.56it/s]
Training 1/1 epoch (loss 1.4075): 58%|ββββββ | 364/625 [02:34<01:41, 2.56it/s]
Training 1/1 epoch (loss 1.4075): 58%|ββββββ | 365/625 [02:34<01:44, 2.49it/s]
Training 1/1 epoch (loss 1.4680): 58%|ββββββ | 365/625 [02:35<01:44, 2.49it/s]
Training 1/1 epoch (loss 1.4680): 59%|ββββββ | 366/625 [02:35<01:43, 2.50it/s]
Training 1/1 epoch (loss 1.4913): 59%|ββββββ | 366/625 [02:35<01:43, 2.50it/s]
Training 1/1 epoch (loss 1.4913): 59%|ββββββ | 367/625 [02:35<01:42, 2.51it/s]
Training 1/1 epoch (loss 1.3739): 59%|ββββββ | 367/625 [02:35<01:42, 2.51it/s]
Training 1/1 epoch (loss 1.3739): 59%|ββββββ | 368/625 [02:35<01:40, 2.56it/s]
Training 1/1 epoch (loss 1.4434): 59%|ββββββ | 368/625 [02:36<01:40, 2.56it/s]
Training 1/1 epoch (loss 1.4434): 59%|ββββββ | 369/625 [02:36<01:38, 2.59it/s]
Training 1/1 epoch (loss 1.4534): 59%|ββββββ | 369/625 [02:36<01:38, 2.59it/s]
Training 1/1 epoch (loss 1.4534): 59%|ββββββ | 370/625 [02:36<01:42, 2.50it/s]
Training 1/1 epoch (loss 1.5023): 59%|ββββββ | 370/625 [02:37<01:42, 2.50it/s]
Training 1/1 epoch (loss 1.5023): 59%|ββββββ | 371/625 [02:37<01:59, 2.13it/s]
Training 1/1 epoch (loss 1.5777): 59%|ββββββ | 371/625 [02:37<01:59, 2.13it/s]
Training 1/1 epoch (loss 1.5777): 60%|ββββββ | 372/625 [02:37<01:51, 2.26it/s]
Training 1/1 epoch (loss 1.4912): 60%|ββββββ | 372/625 [02:38<01:51, 2.26it/s]
Training 1/1 epoch (loss 1.4912): 60%|ββββββ | 373/625 [02:38<01:47, 2.34it/s]
Training 1/1 epoch (loss 1.4122): 60%|ββββββ | 373/625 [02:38<01:47, 2.34it/s]
Training 1/1 epoch (loss 1.4122): 60%|ββββββ | 374/625 [02:38<01:43, 2.42it/s]
Training 1/1 epoch (loss 1.5119): 60%|ββββββ | 374/625 [02:38<01:43, 2.42it/s]
Training 1/1 epoch (loss 1.5119): 60%|ββββββ | 375/625 [02:38<01:41, 2.47it/s]
Training 1/1 epoch (loss 1.3515): 60%|ββββββ | 375/625 [02:39<01:41, 2.47it/s]
Training 1/1 epoch (loss 1.3515): 60%|ββββββ | 376/625 [02:39<01:41, 2.45it/s]
Training 1/1 epoch (loss 1.3675): 60%|ββββββ | 376/625 [02:39<01:41, 2.45it/s]
Training 1/1 epoch (loss 1.3675): 60%|ββββββ | 377/625 [02:39<01:44, 2.37it/s]
Training 1/1 epoch (loss 1.3884): 60%|ββββββ | 377/625 [02:40<01:44, 2.37it/s]
Training 1/1 epoch (loss 1.3884): 60%|ββββββ | 378/625 [02:40<01:42, 2.42it/s]
Training 1/1 epoch (loss 1.4005): 60%|ββββββ | 378/625 [02:40<01:42, 2.42it/s]
Training 1/1 epoch (loss 1.4005): 61%|ββββββ | 379/625 [02:40<01:38, 2.51it/s]
Training 1/1 epoch (loss 1.4402): 61%|ββββββ | 379/625 [02:40<01:38, 2.51it/s]
Training 1/1 epoch (loss 1.4402): 61%|ββββββ | 380/625 [02:40<01:38, 2.48it/s]
Training 1/1 epoch (loss 1.4535): 61%|ββββββ | 380/625 [02:41<01:38, 2.48it/s]
Training 1/1 epoch (loss 1.4535): 61%|ββββββ | 381/625 [02:41<01:39, 2.46it/s]
Training 1/1 epoch (loss 1.5368): 61%|ββββββ | 381/625 [02:41<01:39, 2.46it/s]
Training 1/1 epoch (loss 1.5368): 61%|ββββββ | 382/625 [02:41<01:37, 2.48it/s]
Training 1/1 epoch (loss 1.4313): 61%|ββββββ | 382/625 [02:42<01:37, 2.48it/s]
Training 1/1 epoch (loss 1.4313): 61%|βββββββ | 383/625 [02:42<01:36, 2.51it/s]
Training 1/1 epoch (loss 1.4009): 61%|βββββββ | 383/625 [02:42<01:36, 2.51it/s]
Training 1/1 epoch (loss 1.4009): 61%|βββββββ | 384/625 [02:42<01:44, 2.31it/s]
Training 1/1 epoch (loss 1.3952): 61%|βββββββ | 384/625 [02:43<01:44, 2.31it/s]
Training 1/1 epoch (loss 1.3952): 62%|βββββββ | 385/625 [02:43<01:40, 2.38it/s]
Training 1/1 epoch (loss 1.4152): 62%|βββββββ | 385/625 [02:43<01:40, 2.38it/s]
Training 1/1 epoch (loss 1.4152): 62%|βββββββ | 386/625 [02:43<01:42, 2.33it/s]
Training 1/1 epoch (loss 1.5176): 62%|βββββββ | 386/625 [02:43<01:42, 2.33it/s]
Training 1/1 epoch (loss 1.5176): 62%|βββββββ | 387/625 [02:43<01:40, 2.37it/s]
Training 1/1 epoch (loss 1.4075): 62%|βββββββ | 387/625 [02:44<01:40, 2.37it/s]
Training 1/1 epoch (loss 1.4075): 62%|βββββββ | 388/625 [02:44<01:39, 2.39it/s]
Training 1/1 epoch (loss 1.4185): 62%|βββββββ | 388/625 [02:44<01:39, 2.39it/s]
Training 1/1 epoch (loss 1.4185): 62%|βββββββ | 389/625 [02:44<01:36, 2.45it/s]
Training 1/1 epoch (loss 1.4121): 62%|βββββββ | 389/625 [02:45<01:36, 2.45it/s]
Training 1/1 epoch (loss 1.4121): 62%|βββββββ | 390/625 [02:45<01:39, 2.35it/s]
Training 1/1 epoch (loss 1.3697): 62%|βββββββ | 390/625 [02:45<01:39, 2.35it/s]
Training 1/1 epoch (loss 1.3697): 63%|βββββββ | 391/625 [02:45<01:37, 2.40it/s]
Training 1/1 epoch (loss 1.4378): 63%|βββββββ | 391/625 [02:45<01:37, 2.40it/s]
Training 1/1 epoch (loss 1.4378): 63%|βββββββ | 392/625 [02:45<01:34, 2.46it/s]
Training 1/1 epoch (loss 1.5333): 63%|βββββββ | 392/625 [02:46<01:34, 2.46it/s]
Training 1/1 epoch (loss 1.5333): 63%|βββββββ | 393/625 [02:46<01:32, 2.51it/s]
Training 1/1 epoch (loss 1.3598): 63%|βββββββ | 393/625 [02:46<01:32, 2.51it/s]
Training 1/1 epoch (loss 1.3598): 63%|βββββββ | 394/625 [02:46<01:30, 2.54it/s]
Training 1/1 epoch (loss 1.5388): 63%|βββββββ | 394/625 [02:47<01:30, 2.54it/s]
Training 1/1 epoch (loss 1.5388): 63%|βββββββ | 395/625 [02:47<01:32, 2.49it/s]
Training 1/1 epoch (loss 1.4338): 63%|βββββββ | 395/625 [02:47<01:32, 2.49it/s]
Training 1/1 epoch (loss 1.4338): 63%|βββββββ | 396/625 [02:47<01:31, 2.52it/s]
Training 1/1 epoch (loss 1.4211): 63%|βββββββ | 396/625 [02:48<01:31, 2.52it/s]
Training 1/1 epoch (loss 1.4211): 64%|βββββββ | 397/625 [02:48<01:39, 2.30it/s]
Training 1/1 epoch (loss 1.4858): 64%|βββββββ | 397/625 [02:48<01:39, 2.30it/s]
Training 1/1 epoch (loss 1.4858): 64%|βββββββ | 398/625 [02:48<01:57, 1.94it/s]
Training 1/1 epoch (loss 1.5530): 64%|βββββββ | 398/625 [02:49<01:57, 1.94it/s]
Training 1/1 epoch (loss 1.5530): 64%|βββββββ | 399/625 [02:49<01:52, 2.01it/s]
Training 1/1 epoch (loss 1.4832): 64%|βββββββ | 399/625 [02:49<01:52, 2.01it/s]
Training 1/1 epoch (loss 1.4832): 64%|βββββββ | 400/625 [02:49<01:45, 2.13it/s]
Training 1/1 epoch (loss 1.4006): 64%|βββββββ | 400/625 [02:50<01:45, 2.13it/s]
Training 1/1 epoch (loss 1.4006): 64%|βββββββ | 401/625 [02:50<01:39, 2.26it/s]
Training 1/1 epoch (loss 1.4322): 64%|βββββββ | 401/625 [02:50<01:39, 2.26it/s]
Training 1/1 epoch (loss 1.4322): 64%|βββββββ | 402/625 [02:50<01:33, 2.38it/s]
Training 1/1 epoch (loss 1.4526): 64%|βββββββ | 402/625 [02:50<01:33, 2.38it/s]
Training 1/1 epoch (loss 1.4526): 64%|βββββββ | 403/625 [02:50<01:30, 2.45it/s]
Training 1/1 epoch (loss 1.4566): 64%|βββββββ | 403/625 [02:51<01:30, 2.45it/s]
Training 1/1 epoch (loss 1.4566): 65%|βββββββ | 404/625 [02:51<01:27, 2.51it/s]
Training 1/1 epoch (loss 1.4641): 65%|βββββββ | 404/625 [02:51<01:27, 2.51it/s]
Training 1/1 epoch (loss 1.4641): 65%|βββββββ | 405/625 [02:51<01:24, 2.59it/s]
Training 1/1 epoch (loss 1.4150): 65%|βββββββ | 405/625 [02:51<01:24, 2.59it/s]
Training 1/1 epoch (loss 1.4150): 65%|βββββββ | 406/625 [02:51<01:23, 2.62it/s]
Training 1/1 epoch (loss 1.4919): 65%|βββββββ | 406/625 [02:52<01:23, 2.62it/s]
Training 1/1 epoch (loss 1.4919): 65%|βββββββ | 407/625 [02:52<01:24, 2.58it/s]
Training 1/1 epoch (loss 1.4409): 65%|βββββββ | 407/625 [02:52<01:24, 2.58it/s]
Training 1/1 epoch (loss 1.4409): 65%|βββββββ | 408/625 [02:52<01:23, 2.61it/s]
Training 1/1 epoch (loss 1.3754): 65%|βββββββ | 408/625 [02:53<01:23, 2.61it/s]
Training 1/1 epoch (loss 1.3754): 65%|βββββββ | 409/625 [02:53<01:28, 2.44it/s]
Training 1/1 epoch (loss 1.3559): 65%|βββββββ | 409/625 [02:53<01:28, 2.44it/s]
Training 1/1 epoch (loss 1.3559): 66%|βββββββ | 410/625 [02:53<01:30, 2.37it/s]
Training 1/1 epoch (loss 1.4576): 66%|βββββββ | 410/625 [02:53<01:30, 2.37it/s]
Training 1/1 epoch (loss 1.4576): 66%|βββββββ | 411/625 [02:53<01:30, 2.37it/s]
Training 1/1 epoch (loss 1.4273): 66%|βββββββ | 411/625 [02:54<01:30, 2.37it/s]
Training 1/1 epoch (loss 1.4273): 66%|βββββββ | 412/625 [02:54<01:30, 2.36it/s]
Training 1/1 epoch (loss 1.3804): 66%|βββββββ | 412/625 [02:54<01:30, 2.36it/s]
Training 1/1 epoch (loss 1.3804): 66%|βββββββ | 413/625 [02:54<01:30, 2.33it/s]
Training 1/1 epoch (loss 1.3740): 66%|βββββββ | 413/625 [02:55<01:30, 2.33it/s]
Training 1/1 epoch (loss 1.3740): 66%|βββββββ | 414/625 [02:55<01:29, 2.36it/s]
Training 1/1 epoch (loss 1.4739): 66%|βββββββ | 414/625 [02:55<01:29, 2.36it/s]
Training 1/1 epoch (loss 1.4739): 66%|βββββββ | 415/625 [02:55<01:28, 2.37it/s]
Training 1/1 epoch (loss 1.5281): 66%|βββββββ | 415/625 [02:56<01:28, 2.37it/s]
Training 1/1 epoch (loss 1.5281): 67%|βββββββ | 416/625 [02:56<01:26, 2.42it/s]
Training 1/1 epoch (loss 1.4126): 67%|βββββββ | 416/625 [02:56<01:26, 2.42it/s]
Training 1/1 epoch (loss 1.4126): 67%|βββββββ | 417/625 [02:56<01:24, 2.46it/s]
Training 1/1 epoch (loss 1.5116): 67%|βββββββ | 417/625 [02:56<01:24, 2.46it/s]
Training 1/1 epoch (loss 1.5116): 67%|βββββββ | 418/625 [02:56<01:20, 2.56it/s]
Training 1/1 epoch (loss 1.4334): 67%|βββββββ | 418/625 [02:57<01:20, 2.56it/s]
Training 1/1 epoch (loss 1.4334): 67%|βββββββ | 419/625 [02:57<01:21, 2.52it/s]
Training 1/1 epoch (loss 1.3257): 67%|βββββββ | 419/625 [02:57<01:21, 2.52it/s]
Training 1/1 epoch (loss 1.3257): 67%|βββββββ | 420/625 [02:57<01:21, 2.50it/s]
Training 1/1 epoch (loss 1.4864): 67%|βββββββ | 420/625 [02:58<01:21, 2.50it/s]
Training 1/1 epoch (loss 1.4864): 67%|βββββββ | 421/625 [02:58<01:21, 2.52it/s]
Training 1/1 epoch (loss 1.4023): 67%|βββββββ | 421/625 [02:58<01:21, 2.52it/s]
Training 1/1 epoch (loss 1.4023): 68%|βββββββ | 422/625 [02:58<01:18, 2.59it/s]
Training 1/1 epoch (loss 1.3754): 68%|βββββββ | 422/625 [02:58<01:18, 2.59it/s]
Training 1/1 epoch (loss 1.3754): 68%|βββββββ | 423/625 [02:58<01:19, 2.53it/s]
Training 1/1 epoch (loss 1.4724): 68%|βββββββ | 423/625 [02:59<01:19, 2.53it/s]
Training 1/1 epoch (loss 1.4724): 68%|βββββββ | 424/625 [02:59<01:20, 2.50it/s]
Training 1/1 epoch (loss 1.4888): 68%|βββββββ | 424/625 [02:59<01:20, 2.50it/s]
Training 1/1 epoch (loss 1.4888): 68%|βββββββ | 425/625 [02:59<01:20, 2.48it/s]
Training 1/1 epoch (loss 1.5504): 68%|βββββββ | 425/625 [03:00<01:20, 2.48it/s]
Training 1/1 epoch (loss 1.5504): 68%|βββββββ | 426/625 [03:00<01:18, 2.52it/s]
Training 1/1 epoch (loss 1.5093): 68%|βββββββ | 426/625 [03:00<01:18, 2.52it/s]
Training 1/1 epoch (loss 1.5093): 68%|βββββββ | 427/625 [03:00<01:19, 2.49it/s]
Training 1/1 epoch (loss 1.3816): 68%|βββββββ | 427/625 [03:00<01:19, 2.49it/s]
Training 1/1 epoch (loss 1.3816): 68%|βββββββ | 428/625 [03:00<01:21, 2.42it/s]
Training 1/1 epoch (loss 1.4660): 68%|βββββββ | 428/625 [03:01<01:21, 2.42it/s]
Training 1/1 epoch (loss 1.4660): 69%|βββββββ | 429/625 [03:01<01:18, 2.49it/s]
Training 1/1 epoch (loss 1.3719): 69%|βββββββ | 429/625 [03:01<01:18, 2.49it/s]
Training 1/1 epoch (loss 1.3719): 69%|βββββββ | 430/625 [03:01<01:17, 2.51it/s]
Training 1/1 epoch (loss 1.4639): 69%|βββββββ | 430/625 [03:02<01:17, 2.51it/s]
Training 1/1 epoch (loss 1.4639): 69%|βββββββ | 431/625 [03:02<01:16, 2.55it/s]
Training 1/1 epoch (loss 1.4608): 69%|βββββββ | 431/625 [03:02<01:16, 2.55it/s]
Training 1/1 epoch (loss 1.4608): 69%|βββββββ | 432/625 [03:02<01:15, 2.55it/s]
Training 1/1 epoch (loss 1.2934): 69%|βββββββ | 432/625 [03:02<01:15, 2.55it/s]
Training 1/1 epoch (loss 1.2934): 69%|βββββββ | 433/625 [03:02<01:14, 2.57it/s]
Training 1/1 epoch (loss 1.4469): 69%|βββββββ | 433/625 [03:03<01:14, 2.57it/s]
Training 1/1 epoch (loss 1.4469): 69%|βββββββ | 434/625 [03:03<01:16, 2.51it/s]
Training 1/1 epoch (loss 1.5199): 69%|βββββββ | 434/625 [03:03<01:16, 2.51it/s]
Training 1/1 epoch (loss 1.5199): 70%|βββββββ | 435/625 [03:03<01:15, 2.50it/s]
Training 1/1 epoch (loss 1.5760): 70%|βββββββ | 435/625 [03:03<01:15, 2.50it/s]
Training 1/1 epoch (loss 1.5760): 70%|βββββββ | 436/625 [03:03<01:14, 2.55it/s]
Training 1/1 epoch (loss 1.5338): 70%|βββββββ | 436/625 [03:04<01:14, 2.55it/s]
Training 1/1 epoch (loss 1.5338): 70%|βββββββ | 437/625 [03:04<01:15, 2.50it/s]
Training 1/1 epoch (loss 1.4775): 70%|βββββββ | 437/625 [03:04<01:15, 2.50it/s]
Training 1/1 epoch (loss 1.4775): 70%|βββββββ | 438/625 [03:04<01:14, 2.50it/s]
Training 1/1 epoch (loss 1.4204): 70%|βββββββ | 438/625 [03:05<01:14, 2.50it/s]
Training 1/1 epoch (loss 1.4204): 70%|βββββββ | 439/625 [03:05<01:15, 2.46it/s]
Training 1/1 epoch (loss 1.4310): 70%|βββββββ | 439/625 [03:05<01:15, 2.46it/s]
Training 1/1 epoch (loss 1.4310): 70%|βββββββ | 440/625 [03:05<01:28, 2.09it/s]
Training 1/1 epoch (loss 1.4536): 70%|βββββββ | 440/625 [03:06<01:28, 2.09it/s]
Training 1/1 epoch (loss 1.4536): 71%|βββββββ | 441/625 [03:06<01:25, 2.16it/s]
Training 1/1 epoch (loss 1.4884): 71%|βββββββ | 441/625 [03:06<01:25, 2.16it/s]
Training 1/1 epoch (loss 1.4884): 71%|βββββββ | 442/625 [03:06<01:19, 2.32it/s]
Training 1/1 epoch (loss 1.4069): 71%|βββββββ | 442/625 [03:07<01:19, 2.32it/s]
Training 1/1 epoch (loss 1.4069): 71%|βββββββ | 443/625 [03:07<01:17, 2.36it/s]
Training 1/1 epoch (loss 1.4912): 71%|βββββββ | 443/625 [03:07<01:17, 2.36it/s]
Training 1/1 epoch (loss 1.4912): 71%|βββββββ | 444/625 [03:07<01:14, 2.42it/s]
Training 1/1 epoch (loss 1.3599): 71%|βββββββ | 444/625 [03:07<01:14, 2.42it/s]
Training 1/1 epoch (loss 1.3599): 71%|βββββββ | 445/625 [03:07<01:14, 2.41it/s]
Training 1/1 epoch (loss 1.4853): 71%|βββββββ | 445/625 [03:08<01:14, 2.41it/s]
Training 1/1 epoch (loss 1.4853): 71%|ββββββββ | 446/625 [03:08<01:12, 2.48it/s]
Training 1/1 epoch (loss 1.3558): 71%|ββββββββ | 446/625 [03:08<01:12, 2.48it/s]
Training 1/1 epoch (loss 1.3558): 72%|ββββββββ | 447/625 [03:08<01:10, 2.54it/s]
Training 1/1 epoch (loss 1.3797): 72%|ββββββββ | 447/625 [03:09<01:10, 2.54it/s]
Training 1/1 epoch (loss 1.3797): 72%|ββββββββ | 448/625 [03:09<01:10, 2.50it/s]
Training 1/1 epoch (loss 1.4886): 72%|ββββββββ | 448/625 [03:09<01:10, 2.50it/s]
Training 1/1 epoch (loss 1.4886): 72%|ββββββββ | 449/625 [03:09<01:10, 2.50it/s]
Training 1/1 epoch (loss 1.4747): 72%|ββββββββ | 449/625 [03:09<01:10, 2.50it/s]
Training 1/1 epoch (loss 1.4747): 72%|ββββββββ | 450/625 [03:09<01:11, 2.46it/s]
Training 1/1 epoch (loss 1.4749): 72%|ββββββββ | 450/625 [03:10<01:11, 2.46it/s]
Training 1/1 epoch (loss 1.4749): 72%|ββββββββ | 451/625 [03:10<01:10, 2.48it/s]
Training 1/1 epoch (loss 1.4114): 72%|ββββββββ | 451/625 [03:10<01:10, 2.48it/s]
Training 1/1 epoch (loss 1.4114): 72%|ββββββββ | 452/625 [03:10<01:10, 2.45it/s]
Training 1/1 epoch (loss 1.4276): 72%|ββββββββ | 452/625 [03:11<01:10, 2.45it/s]
Training 1/1 epoch (loss 1.4276): 72%|ββββββββ | 453/625 [03:11<01:12, 2.39it/s]
Training 1/1 epoch (loss 1.4681): 72%|ββββββββ | 453/625 [03:11<01:12, 2.39it/s]
Training 1/1 epoch (loss 1.4681): 73%|ββββββββ | 454/625 [03:11<01:14, 2.29it/s]
Training 1/1 epoch (loss 1.4184): 73%|ββββββββ | 454/625 [03:11<01:14, 2.29it/s]
Training 1/1 epoch (loss 1.4184): 73%|ββββββββ | 455/625 [03:11<01:11, 2.37it/s]
Training 1/1 epoch (loss 1.5204): 73%|ββββββββ | 455/625 [03:12<01:11, 2.37it/s]
Training 1/1 epoch (loss 1.5204): 73%|ββββββββ | 456/625 [03:12<01:10, 2.40it/s]
Training 1/1 epoch (loss 1.5075): 73%|ββββββββ | 456/625 [03:12<01:10, 2.40it/s]
Training 1/1 epoch (loss 1.5075): 73%|ββββββββ | 457/625 [03:12<01:07, 2.47it/s]
Training 1/1 epoch (loss 1.3734): 73%|ββββββββ | 457/625 [03:13<01:07, 2.47it/s]
Training 1/1 epoch (loss 1.3734): 73%|ββββββββ | 458/625 [03:13<01:06, 2.49it/s]
Training 1/1 epoch (loss 1.4479): 73%|ββββββββ | 458/625 [03:13<01:06, 2.49it/s]
Training 1/1 epoch (loss 1.4479): 73%|ββββββββ | 459/625 [03:13<01:09, 2.38it/s]
Training 1/1 epoch (loss 1.4663): 73%|ββββββββ | 459/625 [03:14<01:09, 2.38it/s]
Training 1/1 epoch (loss 1.4663): 74%|ββββββββ | 460/625 [03:14<01:10, 2.33it/s]
Training 1/1 epoch (loss 1.3602): 74%|ββββββββ | 460/625 [03:14<01:10, 2.33it/s]
Training 1/1 epoch (loss 1.3602): 74%|ββββββββ | 461/625 [03:14<01:07, 2.42it/s]
Training 1/1 epoch (loss 1.4770): 74%|ββββββββ | 461/625 [03:14<01:07, 2.42it/s]
Training 1/1 epoch (loss 1.4770): 74%|ββββββββ | 462/625 [03:14<01:09, 2.34it/s]
Training 1/1 epoch (loss 1.4085): 74%|ββββββββ | 462/625 [03:15<01:09, 2.34it/s]
Training 1/1 epoch (loss 1.4085): 74%|ββββββββ | 463/625 [03:15<01:10, 2.30it/s]
Training 1/1 epoch (loss 1.4743): 74%|ββββββββ | 463/625 [03:15<01:10, 2.30it/s]
Training 1/1 epoch (loss 1.4743): 74%|ββββββββ | 464/625 [03:15<01:09, 2.32it/s]
Training 1/1 epoch (loss 1.4318): 74%|ββββββββ | 464/625 [03:16<01:09, 2.32it/s]
Training 1/1 epoch (loss 1.4318): 74%|ββββββββ | 465/625 [03:16<01:06, 2.42it/s]
Training 1/1 epoch (loss 1.4451): 74%|ββββββββ | 465/625 [03:16<01:06, 2.42it/s]
Training 1/1 epoch (loss 1.4451): 75%|ββββββββ | 466/625 [03:16<01:04, 2.47it/s]
Training 1/1 epoch (loss 1.3253): 75%|ββββββββ | 466/625 [03:16<01:04, 2.47it/s]
Training 1/1 epoch (loss 1.3253): 75%|ββββββββ | 467/625 [03:16<01:04, 2.46it/s]
Training 1/1 epoch (loss 1.5539): 75%|ββββββββ | 467/625 [03:17<01:04, 2.46it/s]
Training 1/1 epoch (loss 1.5539): 75%|ββββββββ | 468/625 [03:17<01:03, 2.46it/s]
Training 1/1 epoch (loss 1.3874): 75%|ββββββββ | 468/625 [03:17<01:03, 2.46it/s]
Training 1/1 epoch (loss 1.3874): 75%|ββββββββ | 469/625 [03:17<01:01, 2.53it/s]
Training 1/1 epoch (loss 1.5439): 75%|ββββββββ | 469/625 [03:18<01:01, 2.53it/s]
Training 1/1 epoch (loss 1.5439): 75%|ββββββββ | 470/625 [03:18<01:15, 2.06it/s]
Training 1/1 epoch (loss 1.3792): 75%|ββββββββ | 470/625 [03:18<01:15, 2.06it/s]
Training 1/1 epoch (loss 1.3792): 75%|ββββββββ | 471/625 [03:18<01:13, 2.08it/s]
Training 1/1 epoch (loss 1.6465): 75%|ββββββββ | 471/625 [03:19<01:13, 2.08it/s]
Training 1/1 epoch (loss 1.6465): 76%|ββββββββ | 472/625 [03:19<01:10, 2.18it/s]
Training 1/1 epoch (loss 1.4604): 76%|ββββββββ | 472/625 [03:19<01:10, 2.18it/s]
Training 1/1 epoch (loss 1.4604): 76%|ββββββββ | 473/625 [03:19<01:07, 2.26it/s]
Training 1/1 epoch (loss 1.3650): 76%|ββββββββ | 473/625 [03:20<01:07, 2.26it/s]
Training 1/1 epoch (loss 1.3650): 76%|ββββββββ | 474/625 [03:20<01:03, 2.37it/s]
Training 1/1 epoch (loss 1.3676): 76%|ββββββββ | 474/625 [03:20<01:03, 2.37it/s]
Training 1/1 epoch (loss 1.3676): 76%|ββββββββ | 475/625 [03:20<01:03, 2.36it/s]
Training 1/1 epoch (loss 1.4405): 76%|ββββββββ | 475/625 [03:20<01:03, 2.36it/s]
Training 1/1 epoch (loss 1.4405): 76%|ββββββββ | 476/625 [03:20<01:01, 2.41it/s]
Training 1/1 epoch (loss 1.3083): 76%|ββββββββ | 476/625 [03:21<01:01, 2.41it/s]
Training 1/1 epoch (loss 1.3083): 76%|ββββββββ | 477/625 [03:21<01:00, 2.45it/s]
Training 1/1 epoch (loss 1.5202): 76%|ββββββββ | 477/625 [03:21<01:00, 2.45it/s]
Training 1/1 epoch (loss 1.5202): 76%|ββββββββ | 478/625 [03:21<00:58, 2.49it/s]
Training 1/1 epoch (loss 1.3294): 76%|ββββββββ | 478/625 [03:22<00:58, 2.49it/s]
Training 1/1 epoch (loss 1.3294): 77%|ββββββββ | 479/625 [03:22<00:57, 2.54it/s]
Training 1/1 epoch (loss 1.4787): 77%|ββββββββ | 479/625 [03:22<00:57, 2.54it/s]
Training 1/1 epoch (loss 1.4787): 77%|ββββββββ | 480/625 [03:22<00:57, 2.51it/s]
Training 1/1 epoch (loss 1.5193): 77%|ββββββββ | 480/625 [03:22<00:57, 2.51it/s]
Training 1/1 epoch (loss 1.5193): 77%|ββββββββ | 481/625 [03:22<00:57, 2.52it/s]
Training 1/1 epoch (loss 1.3382): 77%|ββββββββ | 481/625 [03:23<00:57, 2.52it/s]
Training 1/1 epoch (loss 1.3382): 77%|ββββββββ | 482/625 [03:23<00:55, 2.57it/s]
Training 1/1 epoch (loss 1.4900): 77%|ββββββββ | 482/625 [03:23<00:55, 2.57it/s]
Training 1/1 epoch (loss 1.4900): 77%|ββββββββ | 483/625 [03:23<00:55, 2.54it/s]
Training 1/1 epoch (loss 1.4388): 77%|ββββββββ | 483/625 [03:24<00:55, 2.54it/s]
Training 1/1 epoch (loss 1.4388): 77%|ββββββββ | 484/625 [03:24<01:01, 2.30it/s]
Training 1/1 epoch (loss 1.4276): 77%|ββββββββ | 484/625 [03:24<01:01, 2.30it/s]
Training 1/1 epoch (loss 1.4276): 78%|ββββββββ | 485/625 [03:24<01:00, 2.33it/s]
Training 1/1 epoch (loss 1.3803): 78%|ββββββββ | 485/625 [03:24<01:00, 2.33it/s]
Training 1/1 epoch (loss 1.3803): 78%|ββββββββ | 486/625 [03:24<00:57, 2.42it/s]
Training 1/1 epoch (loss 1.5822): 78%|ββββββββ | 486/625 [03:25<00:57, 2.42it/s]
Training 1/1 epoch (loss 1.5822): 78%|ββββββββ | 487/625 [03:25<00:57, 2.41it/s]
Training 1/1 epoch (loss 1.3427): 78%|ββββββββ | 487/625 [03:25<00:57, 2.41it/s]
Training 1/1 epoch (loss 1.3427): 78%|ββββββββ | 488/625 [03:25<00:57, 2.39it/s]
Training 1/1 epoch (loss 1.3959): 78%|ββββββββ | 488/625 [03:26<00:57, 2.39it/s]
Training 1/1 epoch (loss 1.3959): 78%|ββββββββ | 489/625 [03:26<00:55, 2.47it/s]
Training 1/1 epoch (loss 1.4058): 78%|ββββββββ | 489/625 [03:26<00:55, 2.47it/s]
Training 1/1 epoch (loss 1.4058): 78%|ββββββββ | 490/625 [03:26<00:54, 2.48it/s]
Training 1/1 epoch (loss 1.2734): 78%|ββββββββ | 490/625 [03:26<00:54, 2.48it/s]
Training 1/1 epoch (loss 1.2734): 79%|ββββββββ | 491/625 [03:26<00:52, 2.55it/s]
Training 1/1 epoch (loss 1.3950): 79%|ββββββββ | 491/625 [03:27<00:52, 2.55it/s]
Training 1/1 epoch (loss 1.3950): 79%|ββββββββ | 492/625 [03:27<00:51, 2.57it/s]
Training 1/1 epoch (loss 1.3457): 79%|ββββββββ | 492/625 [03:27<00:51, 2.57it/s]
Training 1/1 epoch (loss 1.3457): 79%|ββββββββ | 493/625 [03:27<00:53, 2.46it/s]
Training 1/1 epoch (loss 1.5260): 79%|ββββββββ | 493/625 [03:28<00:53, 2.46it/s]
Training 1/1 epoch (loss 1.5260): 79%|ββββββββ | 494/625 [03:28<00:51, 2.53it/s]
Training 1/1 epoch (loss 1.3710): 79%|ββββββββ | 494/625 [03:28<00:51, 2.53it/s]
Training 1/1 epoch (loss 1.3710): 79%|ββββββββ | 495/625 [03:28<00:50, 2.60it/s]
Training 1/1 epoch (loss 1.4241): 79%|ββββββββ | 495/625 [03:28<00:50, 2.60it/s]
Training 1/1 epoch (loss 1.4241): 79%|ββββββββ | 496/625 [03:28<00:50, 2.55it/s]
Training 1/1 epoch (loss 1.3364): 79%|ββββββββ | 496/625 [03:29<00:50, 2.55it/s]
Training 1/1 epoch (loss 1.3364): 80%|ββββββββ | 497/625 [03:29<00:50, 2.53it/s]
Training 1/1 epoch (loss 1.5642): 80%|ββββββββ | 497/625 [03:29<00:50, 2.53it/s]
Training 1/1 epoch (loss 1.5642): 80%|ββββββββ | 498/625 [03:29<00:52, 2.42it/s]
Training 1/1 epoch (loss 1.4242): 80%|ββββββββ | 498/625 [03:30<00:52, 2.42it/s]
Training 1/1 epoch (loss 1.4242): 80%|ββββββββ | 499/625 [03:30<00:52, 2.42it/s]
Training 1/1 epoch (loss 1.5848): 80%|ββββββββ | 499/625 [03:30<00:52, 2.42it/s]
Training 1/1 epoch (loss 1.5848): 80%|ββββββββ | 500/625 [03:30<00:51, 2.43it/s]
Training 1/1 epoch (loss 1.3818): 80%|ββββββββ | 500/625 [03:30<00:51, 2.43it/s]
Training 1/1 epoch (loss 1.3818): 80%|ββββββββ | 501/625 [03:30<00:50, 2.44it/s]
Training 1/1 epoch (loss 1.3933): 80%|ββββββββ | 501/625 [03:31<00:50, 2.44it/s]
Training 1/1 epoch (loss 1.3933): 80%|ββββββββ | 502/625 [03:31<00:50, 2.45it/s]
Training 1/1 epoch (loss 1.3367): 80%|ββββββββ | 502/625 [03:31<00:50, 2.45it/s]
Training 1/1 epoch (loss 1.3367): 80%|ββββββββ | 503/625 [03:31<00:49, 2.47it/s]
Training 1/1 epoch (loss 1.3396): 80%|ββββββββ | 503/625 [03:32<00:49, 2.47it/s]
Training 1/1 epoch (loss 1.3396): 81%|ββββββββ | 504/625 [03:32<00:48, 2.50it/s]
Training 1/1 epoch (loss 1.5486): 81%|ββββββββ | 504/625 [03:32<00:48, 2.50it/s]
Training 1/1 epoch (loss 1.5486): 81%|ββββββββ | 505/625 [03:32<00:47, 2.51it/s]
Training 1/1 epoch (loss 1.4059): 81%|ββββββββ | 505/625 [03:32<00:47, 2.51it/s]
Training 1/1 epoch (loss 1.4059): 81%|ββββββββ | 506/625 [03:32<00:46, 2.56it/s]
Training 1/1 epoch (loss 1.4491): 81%|ββββββββ | 506/625 [03:33<00:46, 2.56it/s]
Training 1/1 epoch (loss 1.4491): 81%|ββββββββ | 507/625 [03:33<00:45, 2.57it/s]
Training 1/1 epoch (loss 1.3663): 81%|ββββββββ | 507/625 [03:33<00:45, 2.57it/s]
Training 1/1 epoch (loss 1.3663): 81%|βββββββββ | 508/625 [03:33<00:45, 2.56it/s]
Training 1/1 epoch (loss 1.5077): 81%|βββββββββ | 508/625 [03:34<00:45, 2.56it/s]
Training 1/1 epoch (loss 1.5077): 81%|βββββββββ | 509/625 [03:34<00:47, 2.43it/s]
Training 1/1 epoch (loss 1.4360): 81%|βββββββββ | 509/625 [03:34<00:47, 2.43it/s]
Training 1/1 epoch (loss 1.4360): 82%|βββββββββ | 510/625 [03:34<00:46, 2.47it/s]
Training 1/1 epoch (loss 1.3999): 82%|βββββββββ | 510/625 [03:34<00:46, 2.47it/s]
Training 1/1 epoch (loss 1.3999): 82%|βββββββββ | 511/625 [03:34<00:45, 2.52it/s]
Training 1/1 epoch (loss 1.3976): 82%|βββββββββ | 511/625 [03:35<00:45, 2.52it/s]
Training 1/1 epoch (loss 1.3976): 82%|βββββββββ | 512/625 [03:35<00:45, 2.46it/s]
Training 1/1 epoch (loss 1.5432): 82%|βββββββββ | 512/625 [03:35<00:45, 2.46it/s]
Training 1/1 epoch (loss 1.5432): 82%|βββββββββ | 513/625 [03:35<00:45, 2.45it/s]
Training 1/1 epoch (loss 1.5092): 82%|βββββββββ | 513/625 [03:36<00:45, 2.45it/s]
Training 1/1 epoch (loss 1.5092): 82%|βββββββββ | 514/625 [03:36<00:46, 2.40it/s]
Training 1/1 epoch (loss 1.4325): 82%|βββββββββ | 514/625 [03:36<00:46, 2.40it/s]
Training 1/1 epoch (loss 1.4325): 82%|βββββββββ | 515/625 [03:36<00:45, 2.44it/s]
Training 1/1 epoch (loss 1.4666): 82%|βββββββββ | 515/625 [03:37<00:45, 2.44it/s]
Training 1/1 epoch (loss 1.4666): 83%|βββββββββ | 516/625 [03:37<00:44, 2.44it/s]
Training 1/1 epoch (loss 1.3580): 83%|βββββββββ | 516/625 [03:37<00:44, 2.44it/s]
Training 1/1 epoch (loss 1.3580): 83%|βββββββββ | 517/625 [03:37<00:44, 2.44it/s]
Training 1/1 epoch (loss 1.4570): 83%|βββββββββ | 517/625 [03:37<00:44, 2.44it/s]
Training 1/1 epoch (loss 1.4570): 83%|βββββββββ | 518/625 [03:37<00:43, 2.45it/s]
Training 1/1 epoch (loss 1.5211): 83%|βββββββββ | 518/625 [03:38<00:43, 2.45it/s]
Training 1/1 epoch (loss 1.5211): 83%|βββββββββ | 519/625 [03:38<00:41, 2.53it/s]
Training 1/1 epoch (loss 1.4083): 83%|βββββββββ | 519/625 [03:38<00:41, 2.53it/s]
Training 1/1 epoch (loss 1.4083): 83%|βββββββββ | 520/625 [03:38<00:41, 2.52it/s]
Training 1/1 epoch (loss 1.5544): 83%|βββββββββ | 520/625 [03:38<00:41, 2.52it/s]
Training 1/1 epoch (loss 1.5544): 83%|βββββββββ | 521/625 [03:38<00:41, 2.52it/s]
Training 1/1 epoch (loss 1.2922): 83%|βββββββββ | 521/625 [03:39<00:41, 2.52it/s]
Training 1/1 epoch (loss 1.2922): 84%|βββββββββ | 522/625 [03:39<00:40, 2.54it/s]
Training 1/1 epoch (loss 1.5164): 84%|βββββββββ | 522/625 [03:39<00:40, 2.54it/s]
Training 1/1 epoch (loss 1.5164): 84%|βββββββββ | 523/625 [03:39<00:41, 2.48it/s]
Training 1/1 epoch (loss 1.3083): 84%|βββββββββ | 523/625 [03:40<00:41, 2.48it/s]
Training 1/1 epoch (loss 1.3083): 84%|βββββββββ | 524/625 [03:40<00:39, 2.56it/s]
Training 1/1 epoch (loss 1.4258): 84%|βββββββββ | 524/625 [03:40<00:39, 2.56it/s]
Training 1/1 epoch (loss 1.4258): 84%|βββββββββ | 525/625 [03:40<00:39, 2.56it/s]
Training 1/1 epoch (loss 1.5174): 84%|βββββββββ | 525/625 [03:40<00:39, 2.56it/s]
Training 1/1 epoch (loss 1.5174): 84%|βββββββββ | 526/625 [03:40<00:39, 2.53it/s]
Training 1/1 epoch (loss 1.4362): 84%|βββββββββ | 526/625 [03:41<00:39, 2.53it/s]
Training 1/1 epoch (loss 1.4362): 84%|βββββββββ | 527/625 [03:41<00:40, 2.42it/s]
Training 1/1 epoch (loss 1.4547): 84%|βββββββββ | 527/625 [03:41<00:40, 2.42it/s]
Training 1/1 epoch (loss 1.4547): 84%|βββββββββ | 528/625 [03:41<00:40, 2.39it/s]
Training 1/1 epoch (loss 1.5123): 84%|βββββββββ | 528/625 [03:42<00:40, 2.39it/s]
Training 1/1 epoch (loss 1.5123): 85%|βββββββββ | 529/625 [03:42<00:39, 2.45it/s]
Training 1/1 epoch (loss 1.4899): 85%|βββββββββ | 529/625 [03:42<00:39, 2.45it/s]
Training 1/1 epoch (loss 1.4899): 85%|βββββββββ | 530/625 [03:42<00:38, 2.49it/s]
Training 1/1 epoch (loss 1.3235): 85%|βββββββββ | 530/625 [03:42<00:38, 2.49it/s]
Training 1/1 epoch (loss 1.3235): 85%|βββββββββ | 531/625 [03:42<00:36, 2.55it/s]
Training 1/1 epoch (loss 1.4719): 85%|βββββββββ | 531/625 [03:43<00:36, 2.55it/s]
Training 1/1 epoch (loss 1.4719): 85%|βββββββββ | 532/625 [03:43<00:36, 2.57it/s]
Training 1/1 epoch (loss 1.5149): 85%|βββββββββ | 532/625 [03:43<00:36, 2.57it/s]
Training 1/1 epoch (loss 1.5149): 85%|βββββββββ | 533/625 [03:43<00:36, 2.52it/s]
Training 1/1 epoch (loss 1.5833): 85%|βββββββββ | 533/625 [03:44<00:36, 2.52it/s]
Training 1/1 epoch (loss 1.5833): 85%|βββββββββ | 534/625 [03:44<00:35, 2.55it/s]
Training 1/1 epoch (loss 1.4528): 85%|βββββββββ | 534/625 [03:44<00:35, 2.55it/s]
Training 1/1 epoch (loss 1.4528): 86%|βββββββββ | 535/625 [03:44<00:36, 2.48it/s]
Training 1/1 epoch (loss 1.4263): 86%|βββββββββ | 535/625 [03:45<00:36, 2.48it/s]
Training 1/1 epoch (loss 1.4263): 86%|βββββββββ | 536/625 [03:45<00:36, 2.45it/s]
Training 1/1 epoch (loss 1.4912): 86%|βββββββββ | 536/625 [03:45<00:36, 2.45it/s]
Training 1/1 epoch (loss 1.4912): 86%|βββββββββ | 537/625 [03:45<00:35, 2.49it/s]
Training 1/1 epoch (loss 1.4541): 86%|βββββββββ | 537/625 [03:45<00:35, 2.49it/s]
Training 1/1 epoch (loss 1.4541): 86%|βββββββββ | 538/625 [03:45<00:35, 2.48it/s]
Training 1/1 epoch (loss 1.4729): 86%|βββββββββ | 538/625 [03:46<00:35, 2.48it/s]
Training 1/1 epoch (loss 1.4729): 86%|βββββββββ | 539/625 [03:46<00:34, 2.52it/s]
Training 1/1 epoch (loss 1.4355): 86%|βββββββββ | 539/625 [03:46<00:34, 2.52it/s]
Training 1/1 epoch (loss 1.4355): 86%|βββββββββ | 540/625 [03:46<00:33, 2.53it/s]
Training 1/1 epoch (loss 1.4432): 86%|βββββββββ | 540/625 [03:46<00:33, 2.53it/s]
Training 1/1 epoch (loss 1.4432): 87%|βββββββββ | 541/625 [03:46<00:32, 2.55it/s]
Training 1/1 epoch (loss 1.5283): 87%|βββββββββ | 541/625 [03:47<00:32, 2.55it/s]
Training 1/1 epoch (loss 1.5283): 87%|βββββββββ | 542/625 [03:47<00:34, 2.44it/s]
Training 1/1 epoch (loss 1.3228): 87%|βββββββββ | 542/625 [03:47<00:34, 2.44it/s]
Training 1/1 epoch (loss 1.3228): 87%|βββββββββ | 543/625 [03:47<00:33, 2.48it/s]
Training 1/1 epoch (loss 1.4075): 87%|βββββββββ | 543/625 [03:48<00:33, 2.48it/s]
Training 1/1 epoch (loss 1.4075): 87%|βββββββββ | 544/625 [03:48<00:37, 2.14it/s]
Training 1/1 epoch (loss 1.5155): 87%|βββββββββ | 544/625 [03:48<00:37, 2.14it/s]
Training 1/1 epoch (loss 1.5155): 87%|βββββββββ | 545/625 [03:48<00:37, 2.16it/s]
Training 1/1 epoch (loss 1.4053): 87%|βββββββββ | 545/625 [03:49<00:37, 2.16it/s]
Training 1/1 epoch (loss 1.4053): 87%|βββββββββ | 546/625 [03:49<00:34, 2.28it/s]
Training 1/1 epoch (loss 1.5215): 87%|βββββββββ | 546/625 [03:49<00:34, 2.28it/s]
Training 1/1 epoch (loss 1.5215): 88%|βββββββββ | 547/625 [03:49<00:32, 2.37it/s]
Training 1/1 epoch (loss 1.3679): 88%|βββββββββ | 547/625 [03:50<00:32, 2.37it/s]
Training 1/1 epoch (loss 1.3679): 88%|βββββββββ | 548/625 [03:50<00:31, 2.43it/s]
Training 1/1 epoch (loss 1.3995): 88%|βββββββββ | 548/625 [03:50<00:31, 2.43it/s]
Training 1/1 epoch (loss 1.3995): 88%|βββββββββ | 549/625 [03:50<00:30, 2.53it/s]
Training 1/1 epoch (loss 1.4264): 88%|βββββββββ | 549/625 [03:50<00:30, 2.53it/s]
Training 1/1 epoch (loss 1.4264): 88%|βββββββββ | 550/625 [03:50<00:29, 2.52it/s]
Training 1/1 epoch (loss 1.4928): 88%|βββββββββ | 550/625 [03:51<00:29, 2.52it/s]
Training 1/1 epoch (loss 1.4928): 88%|βββββββββ | 551/625 [03:51<00:29, 2.50it/s]
Training 1/1 epoch (loss 1.3735): 88%|βββββββββ | 551/625 [03:51<00:29, 2.50it/s]
Training 1/1 epoch (loss 1.3735): 88%|βββββββββ | 552/625 [03:51<00:28, 2.52it/s]
Training 1/1 epoch (loss 1.3875): 88%|βββββββββ | 552/625 [03:52<00:28, 2.52it/s]
Training 1/1 epoch (loss 1.3875): 88%|βββββββββ | 553/625 [03:52<00:29, 2.44it/s]
Training 1/1 epoch (loss 1.4190): 88%|βββββββββ | 553/625 [03:52<00:29, 2.44it/s]
Training 1/1 epoch (loss 1.4190): 89%|βββββββββ | 554/625 [03:52<00:27, 2.54it/s]
Training 1/1 epoch (loss 1.4495): 89%|βββββββββ | 554/625 [03:52<00:27, 2.54it/s]
Training 1/1 epoch (loss 1.4495): 89%|βββββββββ | 555/625 [03:52<00:27, 2.55it/s]
Training 1/1 epoch (loss 1.4234): 89%|βββββββββ | 555/625 [03:53<00:27, 2.55it/s]
Training 1/1 epoch (loss 1.4234): 89%|βββββββββ | 556/625 [03:53<00:27, 2.47it/s]
Training 1/1 epoch (loss 1.3825): 89%|βββββββββ | 556/625 [03:53<00:27, 2.47it/s]
Training 1/1 epoch (loss 1.3825): 89%|βββββββββ | 557/625 [03:53<00:27, 2.47it/s]
Training 1/1 epoch (loss 1.4836): 89%|βββββββββ | 557/625 [03:54<00:27, 2.47it/s]
Training 1/1 epoch (loss 1.4836): 89%|βββββββββ | 558/625 [03:54<00:27, 2.45it/s]
Training 1/1 epoch (loss 1.4554): 89%|βββββββββ | 558/625 [03:54<00:27, 2.45it/s]
Training 1/1 epoch (loss 1.4554): 89%|βββββββββ | 559/625 [03:54<00:27, 2.42it/s]
Training 1/1 epoch (loss 1.4124): 89%|βββββββββ | 559/625 [03:54<00:27, 2.42it/s]
Training 1/1 epoch (loss 1.4124): 90%|βββββββββ | 560/625 [03:54<00:27, 2.41it/s]
Training 1/1 epoch (loss 1.4554): 90%|βββββββββ | 560/625 [03:55<00:27, 2.41it/s]
Training 1/1 epoch (loss 1.4554): 90%|βββββββββ | 561/625 [03:55<00:26, 2.42it/s]
Training 1/1 epoch (loss 1.4027): 90%|βββββββββ | 561/625 [03:55<00:26, 2.42it/s]
Training 1/1 epoch (loss 1.4027): 90%|βββββββββ | 562/625 [03:55<00:25, 2.44it/s]
Training 1/1 epoch (loss 1.3814): 90%|βββββββββ | 562/625 [03:56<00:25, 2.44it/s]
Training 1/1 epoch (loss 1.3814): 90%|βββββββββ | 563/625 [03:56<00:24, 2.51it/s]
Training 1/1 epoch (loss 1.4441): 90%|βββββββββ | 563/625 [03:56<00:24, 2.51it/s]
Training 1/1 epoch (loss 1.4441): 90%|βββββββββ | 564/625 [03:56<00:23, 2.57it/s]
Training 1/1 epoch (loss 1.4297): 90%|βββββββββ | 564/625 [03:56<00:23, 2.57it/s]
Training 1/1 epoch (loss 1.4297): 90%|βββββββββ | 565/625 [03:56<00:22, 2.61it/s]
Training 1/1 epoch (loss 1.4105): 90%|βββββββββ | 565/625 [03:57<00:22, 2.61it/s]
Training 1/1 epoch (loss 1.4105): 91%|βββββββββ | 566/625 [03:57<00:22, 2.58it/s]
Training 1/1 epoch (loss 1.3665): 91%|βββββββββ | 566/625 [03:57<00:22, 2.58it/s]
Training 1/1 epoch (loss 1.3665): 91%|βββββββββ | 567/625 [03:57<00:22, 2.52it/s]
Training 1/1 epoch (loss 1.4789): 91%|βββββββββ | 567/625 [03:58<00:22, 2.52it/s]
Training 1/1 epoch (loss 1.4789): 91%|βββββββββ | 568/625 [03:58<00:22, 2.53it/s]
Training 1/1 epoch (loss 1.4045): 91%|βββββββββ | 568/625 [03:58<00:22, 2.53it/s]
Training 1/1 epoch (loss 1.4045): 91%|βββββββββ | 569/625 [03:58<00:21, 2.60it/s]
Training 1/1 epoch (loss 1.4585): 91%|βββββββββ | 569/625 [03:58<00:21, 2.60it/s]
Training 1/1 epoch (loss 1.4585): 91%|βββββββββ | 570/625 [03:58<00:21, 2.53it/s]
Training 1/1 epoch (loss 1.4837): 91%|βββββββββ | 570/625 [03:59<00:21, 2.53it/s]
Training 1/1 epoch (loss 1.4837): 91%|ββββββββββ| 571/625 [03:59<00:21, 2.47it/s]
Training 1/1 epoch (loss 1.5218): 91%|ββββββββββ| 571/625 [03:59<00:21, 2.47it/s]
Training 1/1 epoch (loss 1.5218): 92%|ββββββββββ| 572/625 [03:59<00:21, 2.44it/s]
Training 1/1 epoch (loss 1.3820): 92%|ββββββββββ| 572/625 [04:00<00:21, 2.44it/s]
Training 1/1 epoch (loss 1.3820): 92%|ββββββββββ| 573/625 [04:00<00:21, 2.42it/s]
Training 1/1 epoch (loss 1.3713): 92%|ββββββββββ| 573/625 [04:00<00:21, 2.42it/s]
Training 1/1 epoch (loss 1.3713): 92%|ββββββββββ| 574/625 [04:00<00:20, 2.50it/s]
Training 1/1 epoch (loss 1.4556): 92%|ββββββββββ| 574/625 [04:00<00:20, 2.50it/s]
Training 1/1 epoch (loss 1.4556): 92%|ββββββββββ| 575/625 [04:00<00:20, 2.44it/s]
Training 1/1 epoch (loss 1.4271): 92%|ββββββββββ| 575/625 [04:01<00:20, 2.44it/s]
Training 1/1 epoch (loss 1.4271): 92%|ββββββββββ| 576/625 [04:01<00:20, 2.42it/s]
Training 1/1 epoch (loss 1.4275): 92%|ββββββββββ| 576/625 [04:01<00:20, 2.42it/s]
Training 1/1 epoch (loss 1.4275): 92%|ββββββββββ| 577/625 [04:01<00:19, 2.45it/s]
Training 1/1 epoch (loss 1.4948): 92%|ββββββββββ| 577/625 [04:02<00:19, 2.45it/s]
Training 1/1 epoch (loss 1.4948): 92%|ββββββββββ| 578/625 [04:02<00:19, 2.45it/s]
Training 1/1 epoch (loss 1.4204): 92%|ββββββββββ| 578/625 [04:02<00:19, 2.45it/s]
Training 1/1 epoch (loss 1.4204): 93%|ββββββββββ| 579/625 [04:02<00:18, 2.51it/s]
Training 1/1 epoch (loss 1.4191): 93%|ββββββββββ| 579/625 [04:02<00:18, 2.51it/s]
Training 1/1 epoch (loss 1.4191): 93%|ββββββββββ| 580/625 [04:02<00:18, 2.47it/s]
Training 1/1 epoch (loss 1.3441): 93%|ββββββββββ| 580/625 [04:03<00:18, 2.47it/s]
Training 1/1 epoch (loss 1.3441): 93%|ββββββββββ| 581/625 [04:03<00:17, 2.51it/s]
Training 1/1 epoch (loss 1.4904): 93%|ββββββββββ| 581/625 [04:03<00:17, 2.51it/s]
Training 1/1 epoch (loss 1.4904): 93%|ββββββββββ| 582/625 [04:03<00:17, 2.50it/s]
Training 1/1 epoch (loss 1.4579): 93%|ββββββββββ| 582/625 [04:04<00:17, 2.50it/s]
Training 1/1 epoch (loss 1.4579): 93%|ββββββββββ| 583/625 [04:04<00:17, 2.44it/s]
Training 1/1 epoch (loss 1.4415): 93%|ββββββββββ| 583/625 [04:04<00:17, 2.44it/s]
Training 1/1 epoch (loss 1.4415): 93%|ββββββββββ| 584/625 [04:04<00:17, 2.39it/s]
Training 1/1 epoch (loss 1.4402): 93%|ββββββββββ| 584/625 [04:05<00:17, 2.39it/s]
Training 1/1 epoch (loss 1.4402): 94%|ββββββββββ| 585/625 [04:05<00:17, 2.30it/s]
Training 1/1 epoch (loss 1.4864): 94%|ββββββββββ| 585/625 [04:05<00:17, 2.30it/s]
Training 1/1 epoch (loss 1.4864): 94%|ββββββββββ| 586/625 [04:05<00:16, 2.35it/s]
Training 1/1 epoch (loss 1.4092): 94%|ββββββββββ| 586/625 [04:05<00:16, 2.35it/s]
Training 1/1 epoch (loss 1.4092): 94%|ββββββββββ| 587/625 [04:05<00:15, 2.44it/s]
Training 1/1 epoch (loss 1.5164): 94%|ββββββββββ| 587/625 [04:06<00:15, 2.44it/s]
Training 1/1 epoch (loss 1.5164): 94%|ββββββββββ| 588/625 [04:06<00:14, 2.48it/s]
Training 1/1 epoch (loss 1.3503): 94%|ββββββββββ| 588/625 [04:06<00:14, 2.48it/s]
Training 1/1 epoch (loss 1.3503): 94%|ββββββββββ| 589/625 [04:06<00:14, 2.53it/s]
Training 1/1 epoch (loss 1.3646): 94%|ββββββββββ| 589/625 [04:06<00:14, 2.53it/s]
Training 1/1 epoch (loss 1.3646): 94%|ββββββββββ| 590/625 [04:06<00:13, 2.61it/s]
Training 1/1 epoch (loss 1.3475): 94%|ββββββββββ| 590/625 [04:07<00:13, 2.61it/s]
Training 1/1 epoch (loss 1.3475): 95%|ββββββββββ| 591/625 [04:07<00:13, 2.58it/s]
Training 1/1 epoch (loss 1.4025): 95%|ββββββββββ| 591/625 [04:07<00:13, 2.58it/s]
Training 1/1 epoch (loss 1.4025): 95%|ββββββββββ| 592/625 [04:07<00:13, 2.41it/s]
Training 1/1 epoch (loss 1.4224): 95%|ββββββββββ| 592/625 [04:08<00:13, 2.41it/s]
Training 1/1 epoch (loss 1.4224): 95%|ββββββββββ| 593/625 [04:08<00:12, 2.51it/s]
Training 1/1 epoch (loss 1.3361): 95%|ββββββββββ| 593/625 [04:08<00:12, 2.51it/s]
Training 1/1 epoch (loss 1.3361): 95%|ββββββββββ| 594/625 [04:08<00:12, 2.51it/s]
Training 1/1 epoch (loss 1.4601): 95%|ββββββββββ| 594/625 [04:08<00:12, 2.51it/s]
Training 1/1 epoch (loss 1.4601): 95%|ββββββββββ| 595/625 [04:08<00:11, 2.59it/s]
Training 1/1 epoch (loss 1.4331): 95%|ββββββββββ| 595/625 [04:09<00:11, 2.59it/s]
Training 1/1 epoch (loss 1.4331): 95%|ββββββββββ| 596/625 [04:09<00:11, 2.60it/s]
Training 1/1 epoch (loss 1.2932): 95%|ββββββββββ| 596/625 [04:09<00:11, 2.60it/s]
Training 1/1 epoch (loss 1.2932): 96%|ββββββββββ| 597/625 [04:09<00:10, 2.60it/s]
Training 1/1 epoch (loss 1.3651): 96%|ββββββββββ| 597/625 [04:10<00:10, 2.60it/s]
Training 1/1 epoch (loss 1.3651): 96%|ββββββββββ| 598/625 [04:10<00:10, 2.63it/s]
Training 1/1 epoch (loss 1.4434): 96%|ββββββββββ| 598/625 [04:10<00:10, 2.63it/s]
Training 1/1 epoch (loss 1.4434): 96%|ββββββββββ| 599/625 [04:10<00:09, 2.61it/s]
Training 1/1 epoch (loss 1.3627): 96%|ββββββββββ| 599/625 [04:10<00:09, 2.61it/s]
Training 1/1 epoch (loss 1.3627): 96%|ββββββββββ| 600/625 [04:10<00:10, 2.50it/s]
Training 1/1 epoch (loss 1.5082): 96%|ββββββββββ| 600/625 [04:11<00:10, 2.50it/s]
Training 1/1 epoch (loss 1.5082): 96%|ββββββββββ| 601/625 [04:11<00:09, 2.48it/s]
Training 1/1 epoch (loss 1.4456): 96%|ββββββββββ| 601/625 [04:11<00:09, 2.48it/s]
Training 1/1 epoch (loss 1.4456): 96%|ββββββββββ| 602/625 [04:11<00:09, 2.51it/s]
Training 1/1 epoch (loss 1.3699): 96%|ββββββββββ| 602/625 [04:12<00:09, 2.51it/s]
Training 1/1 epoch (loss 1.3699): 96%|ββββββββββ| 603/625 [04:12<00:08, 2.55it/s]
Training 1/1 epoch (loss 1.4074): 96%|ββββββββββ| 603/625 [04:12<00:08, 2.55it/s]
Training 1/1 epoch (loss 1.4074): 97%|ββββββββββ| 604/625 [04:12<00:08, 2.61it/s]
Training 1/1 epoch (loss 1.4863): 97%|ββββββββββ| 604/625 [04:12<00:08, 2.61it/s]
Training 1/1 epoch (loss 1.4863): 97%|ββββββββββ| 605/625 [04:12<00:07, 2.61it/s]
Training 1/1 epoch (loss 1.4012): 97%|ββββββββββ| 605/625 [04:13<00:07, 2.61it/s]
Training 1/1 epoch (loss 1.4012): 97%|ββββββββββ| 606/625 [04:13<00:07, 2.56it/s]
Training 1/1 epoch (loss 1.4381): 97%|ββββββββββ| 606/625 [04:13<00:07, 2.56it/s]
Training 1/1 epoch (loss 1.4381): 97%|ββββββββββ| 607/625 [04:13<00:07, 2.44it/s]
Training 1/1 epoch (loss 1.5144): 97%|ββββββββββ| 607/625 [04:14<00:07, 2.44it/s]
Training 1/1 epoch (loss 1.5144): 97%|ββββββββββ| 608/625 [04:14<00:07, 2.43it/s]
Training 1/1 epoch (loss 1.4032): 97%|ββββββββββ| 608/625 [04:14<00:07, 2.43it/s]
Training 1/1 epoch (loss 1.4032): 97%|ββββββββββ| 609/625 [04:14<00:06, 2.47it/s]
Training 1/1 epoch (loss 1.3838): 97%|ββββββββββ| 609/625 [04:14<00:06, 2.47it/s]
Training 1/1 epoch (loss 1.3838): 98%|ββββββββββ| 610/625 [04:14<00:06, 2.40it/s]
Training 1/1 epoch (loss 1.4635): 98%|ββββββββββ| 610/625 [04:15<00:06, 2.40it/s]
Training 1/1 epoch (loss 1.4635): 98%|ββββββββββ| 611/625 [04:15<00:05, 2.38it/s]
Training 1/1 epoch (loss 1.4380): 98%|ββββββββββ| 611/625 [04:15<00:05, 2.38it/s]
Training 1/1 epoch (loss 1.4380): 98%|ββββββββββ| 612/625 [04:15<00:05, 2.38it/s]
Training 1/1 epoch (loss 1.4820): 98%|ββββββββββ| 612/625 [04:16<00:05, 2.38it/s]
Training 1/1 epoch (loss 1.4820): 98%|ββββββββββ| 613/625 [04:16<00:04, 2.42it/s]
Training 1/1 epoch (loss 1.3530): 98%|ββββββββββ| 613/625 [04:16<00:04, 2.42it/s]
Training 1/1 epoch (loss 1.3530): 98%|ββββββββββ| 614/625 [04:16<00:04, 2.34it/s]
Training 1/1 epoch (loss 1.3178): 98%|ββββββββββ| 614/625 [04:16<00:04, 2.34it/s]
Training 1/1 epoch (loss 1.3178): 98%|ββββββββββ| 615/625 [04:16<00:04, 2.43it/s]
Training 1/1 epoch (loss 1.4000): 98%|ββββββββββ| 615/625 [04:17<00:04, 2.43it/s]
Training 1/1 epoch (loss 1.4000): 99%|ββββββββββ| 616/625 [04:17<00:03, 2.45it/s]
Training 1/1 epoch (loss 1.4462): 99%|ββββββββββ| 616/625 [04:17<00:03, 2.45it/s]
Training 1/1 epoch (loss 1.4462): 99%|ββββββββββ| 617/625 [04:17<00:03, 2.39it/s]
Training 1/1 epoch (loss 1.4736): 99%|ββββββββββ| 617/625 [04:18<00:03, 2.39it/s]
Training 1/1 epoch (loss 1.4736): 99%|ββββββββββ| 618/625 [04:18<00:03, 1.93it/s]
Training 1/1 epoch (loss 1.3866): 99%|ββββββββββ| 618/625 [04:19<00:03, 1.93it/s]
Training 1/1 epoch (loss 1.3866): 99%|ββββββββββ| 619/625 [04:19<00:02, 2.02it/s]
Training 1/1 epoch (loss 1.3863): 99%|ββββββββββ| 619/625 [04:19<00:02, 2.02it/s]
Training 1/1 epoch (loss 1.3863): 99%|ββββββββββ| 620/625 [04:19<00:02, 2.12it/s]
Training 1/1 epoch (loss 1.4680): 99%|ββββββββββ| 620/625 [04:19<00:02, 2.12it/s]
Training 1/1 epoch (loss 1.4680): 99%|ββββββββββ| 621/625 [04:19<00:01, 2.25it/s]
Training 1/1 epoch (loss 1.4386): 99%|ββββββββββ| 621/625 [04:20<00:01, 2.25it/s]
Training 1/1 epoch (loss 1.4386): 100%|ββββββββββ| 622/625 [04:20<00:01, 2.33it/s]
Training 1/1 epoch (loss 1.4419): 100%|ββββββββββ| 622/625 [04:20<00:01, 2.33it/s]
Training 1/1 epoch (loss 1.4419): 100%|ββββββββββ| 623/625 [04:20<00:00, 2.44it/s]
Training 1/1 epoch (loss 1.4414): 100%|ββββββββββ| 623/625 [04:20<00:00, 2.44it/s]
Training 1/1 epoch (loss 1.4414): 100%|ββββββββββ| 624/625 [04:20<00:00, 2.43it/s]
Training 1/1 epoch (loss 1.3255): 100%|ββββββββββ| 624/625 [04:21<00:00, 2.43it/s]
Training 1/1 epoch (loss 1.3255): 100%|ββββββββββ| 625/625 [04:21<00:00, 2.45it/s]
Training 1/1 epoch (loss 1.3255): 100%|ββββββββββ| 625/625 [04:21<00:00, 2.39it/s] |