|
Training 1/1 epoch (loss 4.4668): 0%| | 0/625 [00:05<?, ?it/s]
Training 1/1 epoch (loss 4.4668): 0%| | 1/625 [00:05<1:00:05, 5.78s/it]
Training 1/1 epoch (loss 4.6779): 0%| | 1/625 [00:07<1:00:05, 5.78s/it]
Training 1/1 epoch (loss 4.6779): 0%| | 2/625 [00:07<34:37, 3.33s/it]
Training 1/1 epoch (loss 4.5013): 0%| | 2/625 [00:07<34:37, 3.33s/it]
Training 1/1 epoch (loss 4.5013): 0%| | 3/625 [00:07<20:39, 1.99s/it]
Training 1/1 epoch (loss 4.5720): 0%| | 3/625 [00:08<20:39, 1.99s/it]
Training 1/1 epoch (loss 4.5720): 1%| | 4/625 [00:08<13:57, 1.35s/it]
Training 1/1 epoch (loss 4.7567): 1%| | 4/625 [00:08<13:57, 1.35s/it]
Training 1/1 epoch (loss 4.7567): 1%| | 5/625 [00:08<10:17, 1.00it/s]
Training 1/1 epoch (loss 4.7881): 1%| | 5/625 [00:08<10:17, 1.00it/s]
Training 1/1 epoch (loss 4.7881): 1%| | 6/625 [00:08<07:54, 1.31it/s]
Training 1/1 epoch (loss 4.7019): 1%| | 6/625 [00:09<07:54, 1.31it/s]
Training 1/1 epoch (loss 4.7019): 1%| | 7/625 [00:09<06:26, 1.60it/s]
Training 1/1 epoch (loss 4.8541): 1%| | 7/625 [00:09<06:26, 1.60it/s]
Training 1/1 epoch (loss 4.8541): 1%|β | 8/625 [00:09<05:51, 1.76it/s]
Training 1/1 epoch (loss 4.4458): 1%|β | 8/625 [00:09<05:51, 1.76it/s]
Training 1/1 epoch (loss 4.4458): 1%|β | 9/625 [00:09<05:09, 1.99it/s]
Training 1/1 epoch (loss 4.7437): 1%|β | 9/625 [00:10<05:09, 1.99it/s]
Training 1/1 epoch (loss 4.7437): 2%|β | 10/625 [00:10<04:43, 2.17it/s]
Training 1/1 epoch (loss 4.4261): 2%|β | 10/625 [00:10<04:43, 2.17it/s]
Training 1/1 epoch (loss 4.4261): 2%|β | 11/625 [00:10<04:17, 2.38it/s]
Training 1/1 epoch (loss 4.5333): 2%|β | 11/625 [00:11<04:17, 2.38it/s]
Training 1/1 epoch (loss 4.5333): 2%|β | 12/625 [00:11<04:01, 2.54it/s]
Training 1/1 epoch (loss 4.5288): 2%|β | 12/625 [00:11<04:01, 2.54it/s]
Training 1/1 epoch (loss 4.5288): 2%|β | 13/625 [00:11<03:43, 2.74it/s]
Training 1/1 epoch (loss 4.9279): 2%|β | 13/625 [00:11<03:43, 2.74it/s]
Training 1/1 epoch (loss 4.9279): 2%|β | 14/625 [00:11<03:37, 2.81it/s]
Training 1/1 epoch (loss 4.4261): 2%|β | 14/625 [00:11<03:37, 2.81it/s]
Training 1/1 epoch (loss 4.4261): 2%|β | 15/625 [00:11<03:30, 2.89it/s]
Training 1/1 epoch (loss 4.7965): 2%|β | 15/625 [00:12<03:30, 2.89it/s]
Training 1/1 epoch (loss 4.7965): 3%|β | 16/625 [00:12<03:39, 2.77it/s]
Training 1/1 epoch (loss 4.4697): 3%|β | 16/625 [00:12<03:39, 2.77it/s]
Training 1/1 epoch (loss 4.4697): 3%|β | 17/625 [00:12<03:36, 2.81it/s]
Training 1/1 epoch (loss 4.3186): 3%|β | 17/625 [00:13<03:36, 2.81it/s]
Training 1/1 epoch (loss 4.3186): 3%|β | 18/625 [00:13<03:35, 2.81it/s]
Training 1/1 epoch (loss 4.6209): 3%|β | 18/625 [00:13<03:35, 2.81it/s]
Training 1/1 epoch (loss 4.6209): 3%|β | 19/625 [00:13<03:29, 2.89it/s]
Training 1/1 epoch (loss 4.6410): 3%|β | 19/625 [00:13<03:29, 2.89it/s]
Training 1/1 epoch (loss 4.6410): 3%|β | 20/625 [00:13<03:22, 2.99it/s]
Training 1/1 epoch (loss 4.3288): 3%|β | 20/625 [00:14<03:22, 2.99it/s]
Training 1/1 epoch (loss 4.3288): 3%|β | 21/625 [00:14<03:21, 3.00it/s]
Training 1/1 epoch (loss 4.4718): 3%|β | 21/625 [00:14<03:21, 3.00it/s]
Training 1/1 epoch (loss 4.4718): 4%|β | 22/625 [00:14<03:24, 2.95it/s]
Training 1/1 epoch (loss 4.4318): 4%|β | 22/625 [00:14<03:24, 2.95it/s]
Training 1/1 epoch (loss 4.4318): 4%|β | 23/625 [00:14<03:24, 2.94it/s]
Training 1/1 epoch (loss 4.2327): 4%|β | 23/625 [00:15<03:24, 2.94it/s]
Training 1/1 epoch (loss 4.2327): 4%|β | 24/625 [00:15<03:30, 2.85it/s]
Training 1/1 epoch (loss 4.0163): 4%|β | 24/625 [00:15<03:30, 2.85it/s]
Training 1/1 epoch (loss 4.0163): 4%|β | 25/625 [00:15<03:33, 2.81it/s]
Training 1/1 epoch (loss 4.5317): 4%|β | 25/625 [00:15<03:33, 2.81it/s]
Training 1/1 epoch (loss 4.5317): 4%|β | 26/625 [00:15<03:26, 2.90it/s]
Training 1/1 epoch (loss 3.7876): 4%|β | 26/625 [00:16<03:26, 2.90it/s]
Training 1/1 epoch (loss 3.7876): 4%|β | 27/625 [00:16<03:28, 2.87it/s]
Training 1/1 epoch (loss 4.4582): 4%|β | 27/625 [00:16<03:28, 2.87it/s]
Training 1/1 epoch (loss 4.4582): 4%|β | 28/625 [00:16<03:36, 2.76it/s]
Training 1/1 epoch (loss 4.1466): 4%|β | 28/625 [00:16<03:36, 2.76it/s]
Training 1/1 epoch (loss 4.1466): 5%|β | 29/625 [00:16<03:28, 2.85it/s]
Training 1/1 epoch (loss 4.3357): 5%|β | 29/625 [00:17<03:28, 2.85it/s]
Training 1/1 epoch (loss 4.3357): 5%|β | 30/625 [00:17<03:23, 2.92it/s]
Training 1/1 epoch (loss 4.0999): 5%|β | 30/625 [00:17<03:23, 2.92it/s]
Training 1/1 epoch (loss 4.0999): 5%|β | 31/625 [00:17<03:20, 2.96it/s]
Training 1/1 epoch (loss 4.2150): 5%|β | 31/625 [00:17<03:20, 2.96it/s]
Training 1/1 epoch (loss 4.2150): 5%|β | 32/625 [00:17<03:30, 2.81it/s]
Training 1/1 epoch (loss 3.9162): 5%|β | 32/625 [00:18<03:30, 2.81it/s]
Training 1/1 epoch (loss 3.9162): 5%|β | 33/625 [00:18<03:54, 2.52it/s]
Training 1/1 epoch (loss 3.7561): 5%|β | 33/625 [00:18<03:54, 2.52it/s]
Training 1/1 epoch (loss 3.7561): 5%|β | 34/625 [00:18<03:47, 2.59it/s]
Training 1/1 epoch (loss 3.9071): 5%|β | 34/625 [00:19<03:47, 2.59it/s]
Training 1/1 epoch (loss 3.9071): 6%|β | 35/625 [00:19<03:36, 2.72it/s]
Training 1/1 epoch (loss 3.7704): 6%|β | 35/625 [00:19<03:36, 2.72it/s]
Training 1/1 epoch (loss 3.7704): 6%|β | 36/625 [00:19<03:25, 2.86it/s]
Training 1/1 epoch (loss 4.2088): 6%|β | 36/625 [00:19<03:25, 2.86it/s]
Training 1/1 epoch (loss 4.2088): 6%|β | 37/625 [00:19<03:24, 2.88it/s]
Training 1/1 epoch (loss 3.7491): 6%|β | 37/625 [00:20<03:24, 2.88it/s]
Training 1/1 epoch (loss 3.7491): 6%|β | 38/625 [00:20<03:15, 3.01it/s]
Training 1/1 epoch (loss 4.1078): 6%|β | 38/625 [00:20<03:15, 3.01it/s]
Training 1/1 epoch (loss 4.1078): 6%|β | 39/625 [00:20<03:22, 2.89it/s]
Training 1/1 epoch (loss 3.8806): 6%|β | 39/625 [00:20<03:22, 2.89it/s]
Training 1/1 epoch (loss 3.8806): 6%|β | 40/625 [00:20<03:28, 2.81it/s]
Training 1/1 epoch (loss 3.9669): 6%|β | 40/625 [00:21<03:28, 2.81it/s]
Training 1/1 epoch (loss 3.9669): 7%|β | 41/625 [00:21<03:26, 2.83it/s]
Training 1/1 epoch (loss 3.9035): 7%|β | 41/625 [00:21<03:26, 2.83it/s]
Training 1/1 epoch (loss 3.9035): 7%|β | 42/625 [00:21<03:17, 2.96it/s]
Training 1/1 epoch (loss 3.9459): 7%|β | 42/625 [00:21<03:17, 2.96it/s]
Training 1/1 epoch (loss 3.9459): 7%|β | 43/625 [00:21<03:13, 3.01it/s]
Training 1/1 epoch (loss 3.8424): 7%|β | 43/625 [00:22<03:13, 3.01it/s]
Training 1/1 epoch (loss 3.8424): 7%|β | 44/625 [00:22<03:14, 2.99it/s]
Training 1/1 epoch (loss 4.0493): 7%|β | 44/625 [00:22<03:14, 2.99it/s]
Training 1/1 epoch (loss 4.0493): 7%|β | 45/625 [00:22<03:17, 2.94it/s]
Training 1/1 epoch (loss 3.7841): 7%|β | 45/625 [00:22<03:17, 2.94it/s]
Training 1/1 epoch (loss 3.7841): 7%|β | 46/625 [00:22<03:15, 2.96it/s]
Training 1/1 epoch (loss 3.8318): 7%|β | 46/625 [00:23<03:15, 2.96it/s]
Training 1/1 epoch (loss 3.8318): 8%|β | 47/625 [00:23<03:17, 2.92it/s]
Training 1/1 epoch (loss 3.7664): 8%|β | 47/625 [00:23<03:17, 2.92it/s]
Training 1/1 epoch (loss 3.7664): 8%|β | 48/625 [00:23<04:03, 2.37it/s]
Training 1/1 epoch (loss 3.9139): 8%|β | 48/625 [00:24<04:03, 2.37it/s]
Training 1/1 epoch (loss 3.9139): 8%|β | 49/625 [00:24<04:38, 2.07it/s]
Training 1/1 epoch (loss 4.1815): 8%|β | 49/625 [00:24<04:38, 2.07it/s]
Training 1/1 epoch (loss 4.1815): 8%|β | 50/625 [00:24<04:07, 2.33it/s]
Training 1/1 epoch (loss 3.8383): 8%|β | 50/625 [00:25<04:07, 2.33it/s]
Training 1/1 epoch (loss 3.8383): 8%|β | 51/625 [00:25<03:48, 2.51it/s]
Training 1/1 epoch (loss 3.7954): 8%|β | 51/625 [00:25<03:48, 2.51it/s]
Training 1/1 epoch (loss 3.7954): 8%|β | 52/625 [00:25<03:45, 2.54it/s]
Training 1/1 epoch (loss 3.7260): 8%|β | 52/625 [00:25<03:45, 2.54it/s]
Training 1/1 epoch (loss 3.7260): 8%|β | 53/625 [00:25<03:38, 2.62it/s]
Training 1/1 epoch (loss 3.6432): 8%|β | 53/625 [00:26<03:38, 2.62it/s]
Training 1/1 epoch (loss 3.6432): 9%|β | 54/625 [00:26<03:29, 2.72it/s]
Training 1/1 epoch (loss 3.9700): 9%|β | 54/625 [00:26<03:29, 2.72it/s]
Training 1/1 epoch (loss 3.9700): 9%|β | 55/625 [00:26<03:33, 2.67it/s]
Training 1/1 epoch (loss 3.9824): 9%|β | 55/625 [00:26<03:33, 2.67it/s]
Training 1/1 epoch (loss 3.9824): 9%|β | 56/625 [00:26<03:34, 2.66it/s]
Training 1/1 epoch (loss 3.5208): 9%|β | 56/625 [00:27<03:34, 2.66it/s]
Training 1/1 epoch (loss 3.5208): 9%|β | 57/625 [00:27<03:27, 2.74it/s]
Training 1/1 epoch (loss 3.4617): 9%|β | 57/625 [00:27<03:27, 2.74it/s]
Training 1/1 epoch (loss 3.4617): 9%|β | 58/625 [00:27<03:18, 2.86it/s]
Training 1/1 epoch (loss 3.4368): 9%|β | 58/625 [00:27<03:18, 2.86it/s]
Training 1/1 epoch (loss 3.4368): 9%|β | 59/625 [00:27<03:17, 2.87it/s]
Training 1/1 epoch (loss 3.3069): 9%|β | 59/625 [00:28<03:17, 2.87it/s]
Training 1/1 epoch (loss 3.3069): 10%|β | 60/625 [00:28<03:22, 2.79it/s]
Training 1/1 epoch (loss 3.4806): 10%|β | 60/625 [00:28<03:22, 2.79it/s]
Training 1/1 epoch (loss 3.4806): 10%|β | 61/625 [00:28<03:24, 2.76it/s]
Training 1/1 epoch (loss 3.5428): 10%|β | 61/625 [00:28<03:24, 2.76it/s]
Training 1/1 epoch (loss 3.5428): 10%|β | 62/625 [00:28<03:26, 2.73it/s]
Training 1/1 epoch (loss 3.4872): 10%|β | 62/625 [00:29<03:26, 2.73it/s]
Training 1/1 epoch (loss 3.4872): 10%|β | 63/625 [00:29<03:22, 2.78it/s]
Training 1/1 epoch (loss 3.4505): 10%|β | 63/625 [00:29<03:22, 2.78it/s]
Training 1/1 epoch (loss 3.4505): 10%|β | 64/625 [00:29<03:20, 2.80it/s]
Training 1/1 epoch (loss 3.2411): 10%|β | 64/625 [00:30<03:20, 2.80it/s]
Training 1/1 epoch (loss 3.2411): 10%|β | 65/625 [00:30<03:19, 2.81it/s]
Training 1/1 epoch (loss 3.5053): 10%|β | 65/625 [00:30<03:19, 2.81it/s]
Training 1/1 epoch (loss 3.5053): 11%|β | 66/625 [00:30<03:24, 2.74it/s]
Training 1/1 epoch (loss 3.4040): 11%|β | 66/625 [00:30<03:24, 2.74it/s]
Training 1/1 epoch (loss 3.4040): 11%|β | 67/625 [00:30<03:17, 2.83it/s]
Training 1/1 epoch (loss 3.1889): 11%|β | 67/625 [00:31<03:17, 2.83it/s]
Training 1/1 epoch (loss 3.1889): 11%|β | 68/625 [00:31<03:11, 2.91it/s]
Training 1/1 epoch (loss 3.4726): 11%|β | 68/625 [00:31<03:11, 2.91it/s]
Training 1/1 epoch (loss 3.4726): 11%|β | 69/625 [00:31<03:10, 2.93it/s]
Training 1/1 epoch (loss 3.2033): 11%|β | 69/625 [00:31<03:10, 2.93it/s]
Training 1/1 epoch (loss 3.2033): 11%|β | 70/625 [00:31<03:02, 3.03it/s]
Training 1/1 epoch (loss 3.3759): 11%|β | 70/625 [00:32<03:02, 3.03it/s]
Training 1/1 epoch (loss 3.3759): 11%|ββ | 71/625 [00:32<02:58, 3.10it/s]
Training 1/1 epoch (loss 3.2108): 11%|ββ | 71/625 [00:32<02:58, 3.10it/s]
Training 1/1 epoch (loss 3.2108): 12%|ββ | 72/625 [00:32<03:06, 2.97it/s]
Training 1/1 epoch (loss 3.2089): 12%|ββ | 72/625 [00:32<03:06, 2.97it/s]
Training 1/1 epoch (loss 3.2089): 12%|ββ | 73/625 [00:32<03:07, 2.95it/s]
Training 1/1 epoch (loss 3.5316): 12%|ββ | 73/625 [00:33<03:07, 2.95it/s]
Training 1/1 epoch (loss 3.5316): 12%|ββ | 74/625 [00:33<03:10, 2.89it/s]
Training 1/1 epoch (loss 3.0627): 12%|ββ | 74/625 [00:33<03:10, 2.89it/s]
Training 1/1 epoch (loss 3.0627): 12%|ββ | 75/625 [00:33<03:04, 2.98it/s]
Training 1/1 epoch (loss 3.1479): 12%|ββ | 75/625 [00:33<03:04, 2.98it/s]
Training 1/1 epoch (loss 3.1479): 12%|ββ | 76/625 [00:33<02:59, 3.06it/s]
Training 1/1 epoch (loss 3.2397): 12%|ββ | 76/625 [00:34<02:59, 3.06it/s]
Training 1/1 epoch (loss 3.2397): 12%|ββ | 77/625 [00:34<02:57, 3.08it/s]
Training 1/1 epoch (loss 3.2617): 12%|ββ | 77/625 [00:34<02:57, 3.08it/s]
Training 1/1 epoch (loss 3.2617): 12%|ββ | 78/625 [00:34<03:16, 2.78it/s]
Training 1/1 epoch (loss 3.4527): 12%|ββ | 78/625 [00:34<03:16, 2.78it/s]
Training 1/1 epoch (loss 3.4527): 13%|ββ | 79/625 [00:34<03:12, 2.84it/s]
Training 1/1 epoch (loss 3.2472): 13%|ββ | 79/625 [00:35<03:12, 2.84it/s]
Training 1/1 epoch (loss 3.2472): 13%|ββ | 80/625 [00:35<03:08, 2.88it/s]
Training 1/1 epoch (loss 3.2342): 13%|ββ | 80/625 [00:35<03:08, 2.88it/s]
Training 1/1 epoch (loss 3.2342): 13%|ββ | 81/625 [00:35<03:10, 2.86it/s]
Training 1/1 epoch (loss 3.2009): 13%|ββ | 81/625 [00:35<03:10, 2.86it/s]
Training 1/1 epoch (loss 3.2009): 13%|ββ | 82/625 [00:35<03:06, 2.92it/s]
Training 1/1 epoch (loss 3.1946): 13%|ββ | 82/625 [00:36<03:06, 2.92it/s]
Training 1/1 epoch (loss 3.1946): 13%|ββ | 83/625 [00:36<03:01, 2.99it/s]
Training 1/1 epoch (loss 2.9060): 13%|ββ | 83/625 [00:36<03:01, 2.99it/s]
Training 1/1 epoch (loss 2.9060): 13%|ββ | 84/625 [00:36<03:05, 2.92it/s]
Training 1/1 epoch (loss 3.2977): 13%|ββ | 84/625 [00:36<03:05, 2.92it/s]
Training 1/1 epoch (loss 3.2977): 14%|ββ | 85/625 [00:36<03:02, 2.95it/s]
Training 1/1 epoch (loss 3.1080): 14%|ββ | 85/625 [00:37<03:02, 2.95it/s]
Training 1/1 epoch (loss 3.1080): 14%|ββ | 86/625 [00:37<03:00, 2.99it/s]
Training 1/1 epoch (loss 3.2930): 14%|ββ | 86/625 [00:37<03:00, 2.99it/s]
Training 1/1 epoch (loss 3.2930): 14%|ββ | 87/625 [00:37<02:58, 3.02it/s]
Training 1/1 epoch (loss 3.3207): 14%|ββ | 87/625 [00:37<02:58, 3.02it/s]
Training 1/1 epoch (loss 3.3207): 14%|ββ | 88/625 [00:37<03:06, 2.87it/s]
Training 1/1 epoch (loss 3.3501): 14%|ββ | 88/625 [00:38<03:06, 2.87it/s]
Training 1/1 epoch (loss 3.3501): 14%|ββ | 89/625 [00:38<03:07, 2.85it/s]
Training 1/1 epoch (loss 3.3579): 14%|ββ | 89/625 [00:38<03:07, 2.85it/s]
Training 1/1 epoch (loss 3.3579): 14%|ββ | 90/625 [00:38<03:08, 2.84it/s]
Training 1/1 epoch (loss 2.9940): 14%|ββ | 90/625 [00:38<03:08, 2.84it/s]
Training 1/1 epoch (loss 2.9940): 15%|ββ | 91/625 [00:38<03:18, 2.69it/s]
Training 1/1 epoch (loss 3.1116): 15%|ββ | 91/625 [00:39<03:18, 2.69it/s]
Training 1/1 epoch (loss 3.1116): 15%|ββ | 92/625 [00:39<03:09, 2.81it/s]
Training 1/1 epoch (loss 3.1253): 15%|ββ | 92/625 [00:39<03:09, 2.81it/s]
Training 1/1 epoch (loss 3.1253): 15%|ββ | 93/625 [00:39<03:09, 2.81it/s]
Training 1/1 epoch (loss 3.1985): 15%|ββ | 93/625 [00:40<03:09, 2.81it/s]
Training 1/1 epoch (loss 3.1985): 15%|ββ | 94/625 [00:40<03:11, 2.77it/s]
Training 1/1 epoch (loss 3.2941): 15%|ββ | 94/625 [00:40<03:11, 2.77it/s]
Training 1/1 epoch (loss 3.2941): 15%|ββ | 95/625 [00:40<03:13, 2.74it/s]
Training 1/1 epoch (loss 3.0701): 15%|ββ | 95/625 [00:40<03:13, 2.74it/s]
Training 1/1 epoch (loss 3.0701): 15%|ββ | 96/625 [00:40<03:09, 2.80it/s]
Training 1/1 epoch (loss 3.0740): 15%|ββ | 96/625 [00:41<03:09, 2.80it/s]
Training 1/1 epoch (loss 3.0740): 16%|ββ | 97/625 [00:41<03:06, 2.83it/s]
Training 1/1 epoch (loss 3.0261): 16%|ββ | 97/625 [00:41<03:06, 2.83it/s]
Training 1/1 epoch (loss 3.0261): 16%|ββ | 98/625 [00:41<03:03, 2.87it/s]
Training 1/1 epoch (loss 3.2906): 16%|ββ | 98/625 [00:41<03:03, 2.87it/s]
Training 1/1 epoch (loss 3.2906): 16%|ββ | 99/625 [00:41<02:56, 2.98it/s]
Training 1/1 epoch (loss 3.1607): 16%|ββ | 99/625 [00:42<02:56, 2.98it/s]
Training 1/1 epoch (loss 3.1607): 16%|ββ | 100/625 [00:42<03:04, 2.84it/s]
Training 1/1 epoch (loss 3.0722): 16%|ββ | 100/625 [00:42<03:04, 2.84it/s]
Training 1/1 epoch (loss 3.0722): 16%|ββ | 101/625 [00:42<03:03, 2.85it/s]
Training 1/1 epoch (loss 3.1539): 16%|ββ | 101/625 [00:42<03:03, 2.85it/s]
Training 1/1 epoch (loss 3.1539): 16%|ββ | 102/625 [00:42<02:57, 2.95it/s]
Training 1/1 epoch (loss 3.1304): 16%|ββ | 102/625 [00:43<02:57, 2.95it/s]
Training 1/1 epoch (loss 3.1304): 16%|ββ | 103/625 [00:43<02:58, 2.92it/s]
Training 1/1 epoch (loss 3.0015): 16%|ββ | 103/625 [00:43<02:58, 2.92it/s]
Training 1/1 epoch (loss 3.0015): 17%|ββ | 104/625 [00:43<02:55, 2.96it/s]
Training 1/1 epoch (loss 3.0181): 17%|ββ | 104/625 [00:43<02:55, 2.96it/s]
Training 1/1 epoch (loss 3.0181): 17%|ββ | 105/625 [00:43<02:57, 2.94it/s]
Training 1/1 epoch (loss 2.9742): 17%|ββ | 105/625 [00:44<02:57, 2.94it/s]
Training 1/1 epoch (loss 2.9742): 17%|ββ | 106/625 [00:44<02:54, 2.98it/s]
Training 1/1 epoch (loss 3.4916): 17%|ββ | 106/625 [00:44<02:54, 2.98it/s]
Training 1/1 epoch (loss 3.4916): 17%|ββ | 107/625 [00:44<02:56, 2.93it/s]
Training 1/1 epoch (loss 3.1583): 17%|ββ | 107/625 [00:44<02:56, 2.93it/s]
Training 1/1 epoch (loss 3.1583): 17%|ββ | 108/625 [00:44<02:50, 3.03it/s]
Training 1/1 epoch (loss 3.2220): 17%|ββ | 108/625 [00:45<02:50, 3.03it/s]
Training 1/1 epoch (loss 3.2220): 17%|ββ | 109/625 [00:45<02:58, 2.89it/s]
Training 1/1 epoch (loss 3.2727): 17%|ββ | 109/625 [00:45<02:58, 2.89it/s]
Training 1/1 epoch (loss 3.2727): 18%|ββ | 110/625 [00:45<02:56, 2.92it/s]
Training 1/1 epoch (loss 3.2626): 18%|ββ | 110/625 [00:45<02:56, 2.92it/s]
Training 1/1 epoch (loss 3.2626): 18%|ββ | 111/625 [00:45<02:56, 2.92it/s]
Training 1/1 epoch (loss 3.1803): 18%|ββ | 111/625 [00:46<02:56, 2.92it/s]
Training 1/1 epoch (loss 3.1803): 18%|ββ | 112/625 [00:46<03:00, 2.84it/s]
Training 1/1 epoch (loss 2.8998): 18%|ββ | 112/625 [00:46<03:00, 2.84it/s]
Training 1/1 epoch (loss 2.8998): 18%|ββ | 113/625 [00:46<03:04, 2.78it/s]
Training 1/1 epoch (loss 3.0520): 18%|ββ | 113/625 [00:46<03:04, 2.78it/s]
Training 1/1 epoch (loss 3.0520): 18%|ββ | 114/625 [00:46<02:59, 2.84it/s]
Training 1/1 epoch (loss 3.1207): 18%|ββ | 114/625 [00:47<02:59, 2.84it/s]
Training 1/1 epoch (loss 3.1207): 18%|ββ | 115/625 [00:47<02:59, 2.83it/s]
Training 1/1 epoch (loss 3.2280): 18%|ββ | 115/625 [00:47<02:59, 2.83it/s]
Training 1/1 epoch (loss 3.2280): 19%|ββ | 116/625 [00:47<02:54, 2.91it/s]
Training 1/1 epoch (loss 3.1190): 19%|ββ | 116/625 [00:47<02:54, 2.91it/s]
Training 1/1 epoch (loss 3.1190): 19%|ββ | 117/625 [00:47<02:58, 2.85it/s]
Training 1/1 epoch (loss 2.7914): 19%|ββ | 117/625 [00:48<02:58, 2.85it/s]
Training 1/1 epoch (loss 2.7914): 19%|ββ | 118/625 [00:48<02:55, 2.89it/s]
Training 1/1 epoch (loss 2.7535): 19%|ββ | 118/625 [00:48<02:55, 2.89it/s]
Training 1/1 epoch (loss 2.7535): 19%|ββ | 119/625 [00:48<02:54, 2.91it/s]
Training 1/1 epoch (loss 2.8812): 19%|ββ | 119/625 [00:49<02:54, 2.91it/s]
Training 1/1 epoch (loss 2.8812): 19%|ββ | 120/625 [00:49<02:57, 2.84it/s]
Training 1/1 epoch (loss 2.8459): 19%|ββ | 120/625 [00:49<02:57, 2.84it/s]
Training 1/1 epoch (loss 2.8459): 19%|ββ | 121/625 [00:49<02:52, 2.93it/s]
Training 1/1 epoch (loss 2.8998): 19%|ββ | 121/625 [00:49<02:52, 2.93it/s]
Training 1/1 epoch (loss 2.8998): 20%|ββ | 122/625 [00:49<02:50, 2.95it/s]
Training 1/1 epoch (loss 2.8491): 20%|ββ | 122/625 [00:50<02:50, 2.95it/s]
Training 1/1 epoch (loss 2.8491): 20%|ββ | 123/625 [00:50<02:53, 2.89it/s]
Training 1/1 epoch (loss 3.0644): 20%|ββ | 123/625 [00:50<02:53, 2.89it/s]
Training 1/1 epoch (loss 3.0644): 20%|ββ | 124/625 [00:50<03:06, 2.68it/s]
Training 1/1 epoch (loss 2.9004): 20%|ββ | 124/625 [00:50<03:06, 2.68it/s]
Training 1/1 epoch (loss 2.9004): 20%|ββ | 125/625 [00:50<03:01, 2.75it/s]
Training 1/1 epoch (loss 3.0749): 20%|ββ | 125/625 [00:51<03:01, 2.75it/s]
Training 1/1 epoch (loss 3.0749): 20%|ββ | 126/625 [00:51<03:00, 2.76it/s]
Training 1/1 epoch (loss 3.0616): 20%|ββ | 126/625 [00:51<03:00, 2.76it/s]
Training 1/1 epoch (loss 3.0616): 20%|ββ | 127/625 [00:51<02:52, 2.88it/s]
Training 1/1 epoch (loss 2.9648): 20%|ββ | 127/625 [00:51<02:52, 2.88it/s]
Training 1/1 epoch (loss 2.9648): 20%|ββ | 128/625 [00:51<02:50, 2.92it/s]
Training 1/1 epoch (loss 2.8082): 20%|ββ | 128/625 [00:52<02:50, 2.92it/s]
Training 1/1 epoch (loss 2.8082): 21%|ββ | 129/625 [00:52<02:49, 2.92it/s]
Training 1/1 epoch (loss 3.0368): 21%|ββ | 129/625 [00:52<02:49, 2.92it/s]
Training 1/1 epoch (loss 3.0368): 21%|ββ | 130/625 [00:52<02:46, 2.98it/s]
Training 1/1 epoch (loss 2.8160): 21%|ββ | 130/625 [00:52<02:46, 2.98it/s]
Training 1/1 epoch (loss 2.8160): 21%|ββ | 131/625 [00:52<02:48, 2.94it/s]
Training 1/1 epoch (loss 3.1015): 21%|ββ | 131/625 [00:53<02:48, 2.94it/s]
Training 1/1 epoch (loss 3.1015): 21%|ββ | 132/625 [00:53<02:46, 2.96it/s]
Training 1/1 epoch (loss 3.0130): 21%|ββ | 132/625 [00:53<02:46, 2.96it/s]
Training 1/1 epoch (loss 3.0130): 21%|βββ | 133/625 [00:53<02:56, 2.78it/s]
Training 1/1 epoch (loss 2.8835): 21%|βββ | 133/625 [00:54<02:56, 2.78it/s]
Training 1/1 epoch (loss 2.8835): 21%|βββ | 134/625 [00:54<03:27, 2.37it/s]
Training 1/1 epoch (loss 3.0746): 21%|βββ | 134/625 [00:54<03:27, 2.37it/s]
Training 1/1 epoch (loss 3.0746): 22%|βββ | 135/625 [00:54<03:19, 2.46it/s]
Training 1/1 epoch (loss 3.1257): 22%|βββ | 135/625 [00:54<03:19, 2.46it/s]
Training 1/1 epoch (loss 3.1257): 22%|βββ | 136/625 [00:54<03:12, 2.54it/s]
Training 1/1 epoch (loss 2.9486): 22%|βββ | 136/625 [00:55<03:12, 2.54it/s]
Training 1/1 epoch (loss 2.9486): 22%|βββ | 137/625 [00:55<02:59, 2.71it/s]
Training 1/1 epoch (loss 2.9305): 22%|βββ | 137/625 [00:55<02:59, 2.71it/s]
Training 1/1 epoch (loss 2.9305): 22%|βββ | 138/625 [00:55<03:23, 2.40it/s]
Training 1/1 epoch (loss 3.0390): 22%|βββ | 138/625 [00:56<03:23, 2.40it/s]
Training 1/1 epoch (loss 3.0390): 22%|βββ | 139/625 [00:56<03:13, 2.52it/s]
Training 1/1 epoch (loss 2.8485): 22%|βββ | 139/625 [00:56<03:13, 2.52it/s]
Training 1/1 epoch (loss 2.8485): 22%|βββ | 140/625 [00:56<03:10, 2.54it/s]
Training 1/1 epoch (loss 2.8152): 22%|βββ | 140/625 [00:56<03:10, 2.54it/s]
Training 1/1 epoch (loss 2.8152): 23%|βββ | 141/625 [00:56<03:04, 2.62it/s]
Training 1/1 epoch (loss 2.9032): 23%|βββ | 141/625 [00:57<03:04, 2.62it/s]
Training 1/1 epoch (loss 2.9032): 23%|βββ | 142/625 [00:57<02:56, 2.74it/s]
Training 1/1 epoch (loss 2.9801): 23%|βββ | 142/625 [00:57<02:56, 2.74it/s]
Training 1/1 epoch (loss 2.9801): 23%|βββ | 143/625 [00:57<02:56, 2.73it/s]
Training 1/1 epoch (loss 3.1263): 23%|βββ | 143/625 [00:57<02:56, 2.73it/s]
Training 1/1 epoch (loss 3.1263): 23%|βββ | 144/625 [00:57<02:54, 2.75it/s]
Training 1/1 epoch (loss 3.1262): 23%|βββ | 144/625 [00:58<02:54, 2.75it/s]
Training 1/1 epoch (loss 3.1262): 23%|βββ | 145/625 [00:58<02:56, 2.72it/s]
Training 1/1 epoch (loss 3.0276): 23%|βββ | 145/625 [00:58<02:56, 2.72it/s]
Training 1/1 epoch (loss 3.0276): 23%|βββ | 146/625 [00:58<02:55, 2.73it/s]
Training 1/1 epoch (loss 2.9039): 23%|βββ | 146/625 [00:58<02:55, 2.73it/s]
Training 1/1 epoch (loss 2.9039): 24%|βββ | 147/625 [00:58<02:50, 2.81it/s]
Training 1/1 epoch (loss 3.0482): 24%|βββ | 147/625 [00:59<02:50, 2.81it/s]
Training 1/1 epoch (loss 3.0482): 24%|βββ | 148/625 [00:59<02:45, 2.88it/s]
Training 1/1 epoch (loss 3.0285): 24%|βββ | 148/625 [00:59<02:45, 2.88it/s]
Training 1/1 epoch (loss 3.0285): 24%|βββ | 149/625 [00:59<02:40, 2.96it/s]
Training 1/1 epoch (loss 2.9273): 24%|βββ | 149/625 [00:59<02:40, 2.96it/s]
Training 1/1 epoch (loss 2.9273): 24%|βββ | 150/625 [00:59<02:40, 2.96it/s]
Training 1/1 epoch (loss 2.9248): 24%|βββ | 150/625 [01:00<02:40, 2.96it/s]
Training 1/1 epoch (loss 2.9248): 24%|βββ | 151/625 [01:00<02:44, 2.88it/s]
Training 1/1 epoch (loss 2.9653): 24%|βββ | 151/625 [01:00<02:44, 2.88it/s]
Training 1/1 epoch (loss 2.9653): 24%|βββ | 152/625 [01:00<02:56, 2.68it/s]
Training 1/1 epoch (loss 2.9838): 24%|βββ | 152/625 [01:01<02:56, 2.68it/s]
Training 1/1 epoch (loss 2.9838): 24%|βββ | 153/625 [01:01<02:48, 2.80it/s]
Training 1/1 epoch (loss 2.9592): 24%|βββ | 153/625 [01:01<02:48, 2.80it/s]
Training 1/1 epoch (loss 2.9592): 25%|βββ | 154/625 [01:01<02:45, 2.84it/s]
Training 1/1 epoch (loss 3.0122): 25%|βββ | 154/625 [01:01<02:45, 2.84it/s]
Training 1/1 epoch (loss 3.0122): 25%|βββ | 155/625 [01:01<02:39, 2.95it/s]
Training 1/1 epoch (loss 2.8762): 25%|βββ | 155/625 [01:01<02:39, 2.95it/s]
Training 1/1 epoch (loss 2.8762): 25%|βββ | 156/625 [01:01<02:35, 3.01it/s]
Training 1/1 epoch (loss 3.2393): 25%|βββ | 156/625 [01:02<02:35, 3.01it/s]
Training 1/1 epoch (loss 3.2393): 25%|βββ | 157/625 [01:02<02:33, 3.05it/s]
Training 1/1 epoch (loss 2.9452): 25%|βββ | 157/625 [01:02<02:33, 3.05it/s]
Training 1/1 epoch (loss 2.9452): 25%|βββ | 158/625 [01:02<02:42, 2.88it/s]
Training 1/1 epoch (loss 3.1170): 25%|βββ | 158/625 [01:02<02:42, 2.88it/s]
Training 1/1 epoch (loss 3.1170): 25%|βββ | 159/625 [01:02<02:33, 3.03it/s]
Training 1/1 epoch (loss 2.8022): 25%|βββ | 159/625 [01:03<02:33, 3.03it/s]
Training 1/1 epoch (loss 2.8022): 26%|βββ | 160/625 [01:03<02:35, 3.00it/s]
Training 1/1 epoch (loss 3.0377): 26%|βββ | 160/625 [01:03<02:35, 3.00it/s]
Training 1/1 epoch (loss 3.0377): 26%|βββ | 161/625 [01:03<02:35, 2.99it/s]
Training 1/1 epoch (loss 3.0006): 26%|βββ | 161/625 [01:03<02:35, 2.99it/s]
Training 1/1 epoch (loss 3.0006): 26%|βββ | 162/625 [01:03<02:32, 3.03it/s]
Training 1/1 epoch (loss 3.0049): 26%|βββ | 162/625 [01:04<02:32, 3.03it/s]
Training 1/1 epoch (loss 3.0049): 26%|βββ | 163/625 [01:04<02:36, 2.94it/s]
Training 1/1 epoch (loss 2.9746): 26%|βββ | 163/625 [01:04<02:36, 2.94it/s]
Training 1/1 epoch (loss 2.9746): 26%|βββ | 164/625 [01:04<02:39, 2.89it/s]
Training 1/1 epoch (loss 2.9568): 26%|βββ | 164/625 [01:05<02:39, 2.89it/s]
Training 1/1 epoch (loss 2.9568): 26%|βββ | 165/625 [01:05<02:34, 2.97it/s]
Training 1/1 epoch (loss 2.8485): 26%|βββ | 165/625 [01:05<02:34, 2.97it/s]
Training 1/1 epoch (loss 2.8485): 27%|βββ | 166/625 [01:05<02:32, 3.02it/s]
Training 1/1 epoch (loss 2.7434): 27%|βββ | 166/625 [01:05<02:32, 3.02it/s]
Training 1/1 epoch (loss 2.7434): 27%|βββ | 167/625 [01:05<02:32, 3.01it/s]
Training 1/1 epoch (loss 2.8790): 27%|βββ | 167/625 [01:06<02:32, 3.01it/s]
Training 1/1 epoch (loss 2.8790): 27%|βββ | 168/625 [01:06<02:34, 2.97it/s]
Training 1/1 epoch (loss 2.6690): 27%|βββ | 168/625 [01:06<02:34, 2.97it/s]
Training 1/1 epoch (loss 2.6690): 27%|βββ | 169/625 [01:06<02:45, 2.76it/s]
Training 1/1 epoch (loss 2.9139): 27%|βββ | 169/625 [01:06<02:45, 2.76it/s]
Training 1/1 epoch (loss 2.9139): 27%|βββ | 170/625 [01:06<02:48, 2.70it/s]
Training 1/1 epoch (loss 2.8911): 27%|βββ | 170/625 [01:07<02:48, 2.70it/s]
Training 1/1 epoch (loss 2.8911): 27%|βββ | 171/625 [01:07<02:40, 2.82it/s]
Training 1/1 epoch (loss 3.0765): 27%|βββ | 171/625 [01:07<02:40, 2.82it/s]
Training 1/1 epoch (loss 3.0765): 28%|βββ | 172/625 [01:07<02:36, 2.89it/s]
Training 1/1 epoch (loss 2.8429): 28%|βββ | 172/625 [01:07<02:36, 2.89it/s]
Training 1/1 epoch (loss 2.8429): 28%|βββ | 173/625 [01:07<02:42, 2.78it/s]
Training 1/1 epoch (loss 2.6896): 28%|βββ | 173/625 [01:08<02:42, 2.78it/s]
Training 1/1 epoch (loss 2.6896): 28%|βββ | 174/625 [01:08<02:37, 2.86it/s]
Training 1/1 epoch (loss 2.9691): 28%|βββ | 174/625 [01:08<02:37, 2.86it/s]
Training 1/1 epoch (loss 2.9691): 28%|βββ | 175/625 [01:08<02:34, 2.92it/s]
Training 1/1 epoch (loss 2.8055): 28%|βββ | 175/625 [01:08<02:34, 2.92it/s]
Training 1/1 epoch (loss 2.8055): 28%|βββ | 176/625 [01:08<02:43, 2.74it/s]
Training 1/1 epoch (loss 2.9631): 28%|βββ | 176/625 [01:09<02:43, 2.74it/s]
Training 1/1 epoch (loss 2.9631): 28%|βββ | 177/625 [01:09<02:40, 2.80it/s]
Training 1/1 epoch (loss 2.7783): 28%|βββ | 177/625 [01:09<02:40, 2.80it/s]
Training 1/1 epoch (loss 2.7783): 28%|βββ | 178/625 [01:09<02:34, 2.89it/s]
Training 1/1 epoch (loss 2.8081): 28%|βββ | 178/625 [01:09<02:34, 2.89it/s]
Training 1/1 epoch (loss 2.8081): 29%|βββ | 179/625 [01:09<02:33, 2.91it/s]
Training 1/1 epoch (loss 2.8410): 29%|βββ | 179/625 [01:10<02:33, 2.91it/s]
Training 1/1 epoch (loss 2.8410): 29%|βββ | 180/625 [01:10<02:35, 2.86it/s]
Training 1/1 epoch (loss 2.8966): 29%|βββ | 180/625 [01:10<02:35, 2.86it/s]
Training 1/1 epoch (loss 2.8966): 29%|βββ | 181/625 [01:10<02:35, 2.86it/s]
Training 1/1 epoch (loss 2.9512): 29%|βββ | 181/625 [01:10<02:35, 2.86it/s]
Training 1/1 epoch (loss 2.9512): 29%|βββ | 182/625 [01:10<02:33, 2.89it/s]
Training 1/1 epoch (loss 3.0908): 29%|βββ | 182/625 [01:11<02:33, 2.89it/s]
Training 1/1 epoch (loss 3.0908): 29%|βββ | 183/625 [01:11<02:34, 2.85it/s]
Training 1/1 epoch (loss 2.9080): 29%|βββ | 183/625 [01:11<02:34, 2.85it/s]
Training 1/1 epoch (loss 2.9080): 29%|βββ | 184/625 [01:11<02:32, 2.89it/s]
Training 1/1 epoch (loss 2.7342): 29%|βββ | 184/625 [01:12<02:32, 2.89it/s]
Training 1/1 epoch (loss 2.7342): 30%|βββ | 185/625 [01:12<02:30, 2.92it/s]
Training 1/1 epoch (loss 2.8068): 30%|βββ | 185/625 [01:12<02:30, 2.92it/s]
Training 1/1 epoch (loss 2.8068): 30%|βββ | 186/625 [01:12<02:27, 2.97it/s]
Training 1/1 epoch (loss 3.0225): 30%|βββ | 186/625 [01:12<02:27, 2.97it/s]
Training 1/1 epoch (loss 3.0225): 30%|βββ | 187/625 [01:12<02:31, 2.89it/s]
Training 1/1 epoch (loss 2.6802): 30%|βββ | 187/625 [01:13<02:31, 2.89it/s]
Training 1/1 epoch (loss 2.6802): 30%|βββ | 188/625 [01:13<02:30, 2.91it/s]
Training 1/1 epoch (loss 2.9101): 30%|βββ | 188/625 [01:13<02:30, 2.91it/s]
Training 1/1 epoch (loss 2.9101): 30%|βββ | 189/625 [01:13<02:23, 3.03it/s]
Training 1/1 epoch (loss 3.0885): 30%|βββ | 189/625 [01:13<02:23, 3.03it/s]
Training 1/1 epoch (loss 3.0885): 30%|βββ | 190/625 [01:13<02:21, 3.07it/s]
Training 1/1 epoch (loss 2.8741): 30%|βββ | 190/625 [01:13<02:21, 3.07it/s]
Training 1/1 epoch (loss 2.8741): 31%|βββ | 191/625 [01:13<02:18, 3.13it/s]
Training 1/1 epoch (loss 2.8933): 31%|βββ | 191/625 [01:14<02:18, 3.13it/s]
Training 1/1 epoch (loss 2.8933): 31%|βββ | 192/625 [01:14<02:28, 2.91it/s]
Training 1/1 epoch (loss 2.7871): 31%|βββ | 192/625 [01:14<02:28, 2.91it/s]
Training 1/1 epoch (loss 2.7871): 31%|βββ | 193/625 [01:14<02:29, 2.88it/s]
Training 1/1 epoch (loss 2.8790): 31%|βββ | 193/625 [01:15<02:29, 2.88it/s]
Training 1/1 epoch (loss 2.8790): 31%|βββ | 194/625 [01:15<02:34, 2.79it/s]
Training 1/1 epoch (loss 2.8414): 31%|βββ | 194/625 [01:15<02:34, 2.79it/s]
Training 1/1 epoch (loss 2.8414): 31%|βββ | 195/625 [01:15<02:33, 2.81it/s]
Training 1/1 epoch (loss 3.1511): 31%|βββ | 195/625 [01:15<02:33, 2.81it/s]
Training 1/1 epoch (loss 3.1511): 31%|ββββ | 196/625 [01:15<02:27, 2.91it/s]
Training 1/1 epoch (loss 3.0681): 31%|ββββ | 196/625 [01:16<02:27, 2.91it/s]
Training 1/1 epoch (loss 3.0681): 32%|ββββ | 197/625 [01:16<02:24, 2.97it/s]
Training 1/1 epoch (loss 2.8412): 32%|ββββ | 197/625 [01:16<02:24, 2.97it/s]
Training 1/1 epoch (loss 2.8412): 32%|ββββ | 198/625 [01:16<02:37, 2.72it/s]
Training 1/1 epoch (loss 2.8792): 32%|ββββ | 198/625 [01:16<02:37, 2.72it/s]
Training 1/1 epoch (loss 2.8792): 32%|ββββ | 199/625 [01:16<02:34, 2.75it/s]
Training 1/1 epoch (loss 3.0193): 32%|ββββ | 199/625 [01:17<02:34, 2.75it/s]
Training 1/1 epoch (loss 3.0193): 32%|ββββ | 200/625 [01:17<02:27, 2.88it/s]
Training 1/1 epoch (loss 3.0538): 32%|ββββ | 200/625 [01:17<02:27, 2.88it/s]
Training 1/1 epoch (loss 3.0538): 32%|ββββ | 201/625 [01:17<02:33, 2.76it/s]
Training 1/1 epoch (loss 2.7906): 32%|ββββ | 201/625 [01:17<02:33, 2.76it/s]
Training 1/1 epoch (loss 2.7906): 32%|ββββ | 202/625 [01:17<02:35, 2.73it/s]
Training 1/1 epoch (loss 2.8510): 32%|ββββ | 202/625 [01:18<02:35, 2.73it/s]
Training 1/1 epoch (loss 2.8510): 32%|ββββ | 203/625 [01:18<02:31, 2.78it/s]
Training 1/1 epoch (loss 2.9483): 32%|ββββ | 203/625 [01:18<02:31, 2.78it/s]
Training 1/1 epoch (loss 2.9483): 33%|ββββ | 204/625 [01:18<02:37, 2.68it/s]
Training 1/1 epoch (loss 2.9379): 33%|ββββ | 204/625 [01:19<02:37, 2.68it/s]
Training 1/1 epoch (loss 2.9379): 33%|ββββ | 205/625 [01:19<02:34, 2.72it/s]
Training 1/1 epoch (loss 3.0693): 33%|ββββ | 205/625 [01:19<02:34, 2.72it/s]
Training 1/1 epoch (loss 3.0693): 33%|ββββ | 206/625 [01:19<02:29, 2.80it/s]
Training 1/1 epoch (loss 2.6167): 33%|ββββ | 206/625 [01:19<02:29, 2.80it/s]
Training 1/1 epoch (loss 2.6167): 33%|ββββ | 207/625 [01:19<02:27, 2.83it/s]
Training 1/1 epoch (loss 2.8742): 33%|ββββ | 207/625 [01:20<02:27, 2.83it/s]
Training 1/1 epoch (loss 2.8742): 33%|ββββ | 208/625 [01:20<02:27, 2.83it/s]
Training 1/1 epoch (loss 2.7096): 33%|ββββ | 208/625 [01:20<02:27, 2.83it/s]
Training 1/1 epoch (loss 2.7096): 33%|ββββ | 209/625 [01:20<02:32, 2.73it/s]
Training 1/1 epoch (loss 2.9113): 33%|ββββ | 209/625 [01:20<02:32, 2.73it/s]
Training 1/1 epoch (loss 2.9113): 34%|ββββ | 210/625 [01:20<02:33, 2.70it/s]
Training 1/1 epoch (loss 3.0747): 34%|ββββ | 210/625 [01:21<02:33, 2.70it/s]
Training 1/1 epoch (loss 3.0747): 34%|ββββ | 211/625 [01:21<02:24, 2.86it/s]
Training 1/1 epoch (loss 3.0178): 34%|ββββ | 211/625 [01:21<02:24, 2.86it/s]
Training 1/1 epoch (loss 3.0178): 34%|ββββ | 212/625 [01:21<02:22, 2.90it/s]
Training 1/1 epoch (loss 3.1520): 34%|ββββ | 212/625 [01:21<02:22, 2.90it/s]
Training 1/1 epoch (loss 3.1520): 34%|ββββ | 213/625 [01:21<02:25, 2.82it/s]
Training 1/1 epoch (loss 3.0570): 34%|ββββ | 213/625 [01:22<02:25, 2.82it/s]
Training 1/1 epoch (loss 3.0570): 34%|ββββ | 214/625 [01:22<02:19, 2.94it/s]
Training 1/1 epoch (loss 2.6713): 34%|ββββ | 214/625 [01:22<02:19, 2.94it/s]
Training 1/1 epoch (loss 2.6713): 34%|ββββ | 215/625 [01:22<02:14, 3.05it/s]
Training 1/1 epoch (loss 2.6194): 34%|ββββ | 215/625 [01:22<02:14, 3.05it/s]
Training 1/1 epoch (loss 2.6194): 35%|ββββ | 216/625 [01:22<02:22, 2.88it/s]
Training 1/1 epoch (loss 2.8831): 35%|ββββ | 216/625 [01:23<02:22, 2.88it/s]
Training 1/1 epoch (loss 2.8831): 35%|ββββ | 217/625 [01:23<02:18, 2.94it/s]
Training 1/1 epoch (loss 2.7221): 35%|ββββ | 217/625 [01:23<02:18, 2.94it/s]
Training 1/1 epoch (loss 2.7221): 35%|ββββ | 218/625 [01:23<02:32, 2.67it/s]
Training 1/1 epoch (loss 3.0408): 35%|ββββ | 218/625 [01:24<02:32, 2.67it/s]
Training 1/1 epoch (loss 3.0408): 35%|ββββ | 219/625 [01:24<02:54, 2.33it/s]
Training 1/1 epoch (loss 3.0810): 35%|ββββ | 219/625 [01:24<02:54, 2.33it/s]
Training 1/1 epoch (loss 3.0810): 35%|ββββ | 220/625 [01:24<02:42, 2.50it/s]
Training 1/1 epoch (loss 2.9967): 35%|ββββ | 220/625 [01:24<02:42, 2.50it/s]
Training 1/1 epoch (loss 2.9967): 35%|ββββ | 221/625 [01:24<02:36, 2.58it/s]
Training 1/1 epoch (loss 2.8240): 35%|ββββ | 221/625 [01:25<02:36, 2.58it/s]
Training 1/1 epoch (loss 2.8240): 36%|ββββ | 222/625 [01:25<02:25, 2.76it/s]
Training 1/1 epoch (loss 3.2021): 36%|ββββ | 222/625 [01:25<02:25, 2.76it/s]
Training 1/1 epoch (loss 3.2021): 36%|ββββ | 223/625 [01:25<02:40, 2.50it/s]
Training 1/1 epoch (loss 2.9030): 36%|ββββ | 223/625 [01:26<02:40, 2.50it/s]
Training 1/1 epoch (loss 2.9030): 36%|ββββ | 224/625 [01:26<02:32, 2.63it/s]
Training 1/1 epoch (loss 2.9290): 36%|ββββ | 224/625 [01:26<02:32, 2.63it/s]
Training 1/1 epoch (loss 2.9290): 36%|ββββ | 225/625 [01:26<02:33, 2.61it/s]
Training 1/1 epoch (loss 3.1035): 36%|ββββ | 225/625 [01:26<02:33, 2.61it/s]
Training 1/1 epoch (loss 3.1035): 36%|ββββ | 226/625 [01:26<02:29, 2.67it/s]
Training 1/1 epoch (loss 2.7940): 36%|ββββ | 226/625 [01:27<02:29, 2.67it/s]
Training 1/1 epoch (loss 2.7940): 36%|ββββ | 227/625 [01:27<02:31, 2.63it/s]
Training 1/1 epoch (loss 2.7052): 36%|ββββ | 227/625 [01:27<02:31, 2.63it/s]
Training 1/1 epoch (loss 2.7052): 36%|ββββ | 228/625 [01:27<02:20, 2.82it/s]
Training 1/1 epoch (loss 3.1114): 36%|ββββ | 228/625 [01:27<02:20, 2.82it/s]
Training 1/1 epoch (loss 3.1114): 37%|ββββ | 229/625 [01:27<02:16, 2.90it/s]
Training 1/1 epoch (loss 2.9670): 37%|ββββ | 229/625 [01:28<02:16, 2.90it/s]
Training 1/1 epoch (loss 2.9670): 37%|ββββ | 230/625 [01:28<02:22, 2.77it/s]
Training 1/1 epoch (loss 2.9156): 37%|ββββ | 230/625 [01:28<02:22, 2.77it/s]
Training 1/1 epoch (loss 2.9156): 37%|ββββ | 231/625 [01:28<02:20, 2.79it/s]
Training 1/1 epoch (loss 3.0619): 37%|ββββ | 231/625 [01:28<02:20, 2.79it/s]
Training 1/1 epoch (loss 3.0619): 37%|ββββ | 232/625 [01:28<02:24, 2.72it/s]
Training 1/1 epoch (loss 2.7630): 37%|ββββ | 232/625 [01:29<02:24, 2.72it/s]
Training 1/1 epoch (loss 2.7630): 37%|ββββ | 233/625 [01:29<02:20, 2.80it/s]
Training 1/1 epoch (loss 2.9133): 37%|ββββ | 233/625 [01:29<02:20, 2.80it/s]
Training 1/1 epoch (loss 2.9133): 37%|ββββ | 234/625 [01:29<02:14, 2.91it/s]
Training 1/1 epoch (loss 2.8945): 37%|ββββ | 234/625 [01:29<02:14, 2.91it/s]
Training 1/1 epoch (loss 2.8945): 38%|ββββ | 235/625 [01:29<02:11, 2.97it/s]
Training 1/1 epoch (loss 2.9304): 38%|ββββ | 235/625 [01:30<02:11, 2.97it/s]
Training 1/1 epoch (loss 2.9304): 38%|ββββ | 236/625 [01:30<02:12, 2.94it/s]
Training 1/1 epoch (loss 2.9389): 38%|ββββ | 236/625 [01:30<02:12, 2.94it/s]
Training 1/1 epoch (loss 2.9389): 38%|ββββ | 237/625 [01:30<02:15, 2.87it/s]
Training 1/1 epoch (loss 2.7989): 38%|ββββ | 237/625 [01:30<02:15, 2.87it/s]
Training 1/1 epoch (loss 2.7989): 38%|ββββ | 238/625 [01:30<02:11, 2.93it/s]
Training 1/1 epoch (loss 2.8456): 38%|ββββ | 238/625 [01:31<02:11, 2.93it/s]
Training 1/1 epoch (loss 2.8456): 38%|ββββ | 239/625 [01:31<02:07, 3.02it/s]
Training 1/1 epoch (loss 2.7048): 38%|ββββ | 239/625 [01:31<02:07, 3.02it/s]
Training 1/1 epoch (loss 2.7048): 38%|ββββ | 240/625 [01:31<02:09, 2.96it/s]
Training 1/1 epoch (loss 2.7900): 38%|ββββ | 240/625 [01:31<02:09, 2.96it/s]
Training 1/1 epoch (loss 2.7900): 39%|ββββ | 241/625 [01:31<02:08, 2.98it/s]
Training 1/1 epoch (loss 3.1135): 39%|ββββ | 241/625 [01:32<02:08, 2.98it/s]
Training 1/1 epoch (loss 3.1135): 39%|ββββ | 242/625 [01:32<02:12, 2.89it/s]
Training 1/1 epoch (loss 2.8850): 39%|ββββ | 242/625 [01:32<02:12, 2.89it/s]
Training 1/1 epoch (loss 2.8850): 39%|ββββ | 243/625 [01:32<02:12, 2.87it/s]
Training 1/1 epoch (loss 2.8714): 39%|ββββ | 243/625 [01:32<02:12, 2.87it/s]
Training 1/1 epoch (loss 2.8714): 39%|ββββ | 244/625 [01:32<02:09, 2.94it/s]
Training 1/1 epoch (loss 2.9641): 39%|ββββ | 244/625 [01:33<02:09, 2.94it/s]
Training 1/1 epoch (loss 2.9641): 39%|ββββ | 245/625 [01:33<02:07, 2.99it/s]
Training 1/1 epoch (loss 2.8210): 39%|ββββ | 245/625 [01:33<02:07, 2.99it/s]
Training 1/1 epoch (loss 2.8210): 39%|ββββ | 246/625 [01:33<02:05, 3.01it/s]
Training 1/1 epoch (loss 2.8480): 39%|ββββ | 246/625 [01:33<02:05, 3.01it/s]
Training 1/1 epoch (loss 2.8480): 40%|ββββ | 247/625 [01:33<02:03, 3.05it/s]
Training 1/1 epoch (loss 2.7082): 40%|ββββ | 247/625 [01:34<02:03, 3.05it/s]
Training 1/1 epoch (loss 2.7082): 40%|ββββ | 248/625 [01:34<02:07, 2.97it/s]
Training 1/1 epoch (loss 2.8173): 40%|ββββ | 248/625 [01:34<02:07, 2.97it/s]
Training 1/1 epoch (loss 2.8173): 40%|ββββ | 249/625 [01:34<02:10, 2.88it/s]
Training 1/1 epoch (loss 3.1584): 40%|ββββ | 249/625 [01:35<02:10, 2.88it/s]
Training 1/1 epoch (loss 3.1584): 40%|ββββ | 250/625 [01:35<02:13, 2.82it/s]
Training 1/1 epoch (loss 2.8013): 40%|ββββ | 250/625 [01:35<02:13, 2.82it/s]
Training 1/1 epoch (loss 2.8013): 40%|ββββ | 251/625 [01:35<02:05, 2.98it/s]
Training 1/1 epoch (loss 3.0233): 40%|ββββ | 251/625 [01:35<02:05, 2.98it/s]
Training 1/1 epoch (loss 3.0233): 40%|ββββ | 252/625 [01:35<02:04, 2.99it/s]
Training 1/1 epoch (loss 2.9415): 40%|ββββ | 252/625 [01:35<02:04, 2.99it/s]
Training 1/1 epoch (loss 2.9415): 40%|ββββ | 253/625 [01:35<02:00, 3.08it/s]
Training 1/1 epoch (loss 2.7488): 40%|ββββ | 253/625 [01:36<02:00, 3.08it/s]
Training 1/1 epoch (loss 2.7488): 41%|ββββ | 254/625 [01:36<02:03, 3.01it/s]
Training 1/1 epoch (loss 2.9373): 41%|ββββ | 254/625 [01:36<02:03, 3.01it/s]
Training 1/1 epoch (loss 2.9373): 41%|ββββ | 255/625 [01:36<02:05, 2.95it/s]
Training 1/1 epoch (loss 2.8492): 41%|ββββ | 255/625 [01:37<02:05, 2.95it/s]
Training 1/1 epoch (loss 2.8492): 41%|ββββ | 256/625 [01:37<02:04, 2.96it/s]
Training 1/1 epoch (loss 2.7330): 41%|ββββ | 256/625 [01:37<02:04, 2.96it/s]
Training 1/1 epoch (loss 2.7330): 41%|ββββ | 257/625 [01:37<02:07, 2.88it/s]
Training 1/1 epoch (loss 2.7910): 41%|ββββ | 257/625 [01:37<02:07, 2.88it/s]
Training 1/1 epoch (loss 2.7910): 41%|βββββ | 258/625 [01:37<02:07, 2.88it/s]
Training 1/1 epoch (loss 3.0964): 41%|βββββ | 258/625 [01:38<02:07, 2.88it/s]
Training 1/1 epoch (loss 3.0964): 41%|βββββ | 259/625 [01:38<02:06, 2.90it/s]
Training 1/1 epoch (loss 2.9284): 41%|βββββ | 259/625 [01:38<02:06, 2.90it/s]
Training 1/1 epoch (loss 2.9284): 42%|βββββ | 260/625 [01:38<02:11, 2.78it/s]
Training 1/1 epoch (loss 2.8488): 42%|βββββ | 260/625 [01:38<02:11, 2.78it/s]
Training 1/1 epoch (loss 2.8488): 42%|βββββ | 261/625 [01:38<02:19, 2.62it/s]
Training 1/1 epoch (loss 2.9464): 42%|βββββ | 261/625 [01:39<02:19, 2.62it/s]
Training 1/1 epoch (loss 2.9464): 42%|βββββ | 262/625 [01:39<02:11, 2.76it/s]
Training 1/1 epoch (loss 2.8296): 42%|βββββ | 262/625 [01:39<02:11, 2.76it/s]
Training 1/1 epoch (loss 2.8296): 42%|βββββ | 263/625 [01:39<02:03, 2.93it/s]
Training 1/1 epoch (loss 2.5757): 42%|βββββ | 263/625 [01:39<02:03, 2.93it/s]
Training 1/1 epoch (loss 2.5757): 42%|βββββ | 264/625 [01:39<02:02, 2.96it/s]
Training 1/1 epoch (loss 2.7696): 42%|βββββ | 264/625 [01:40<02:02, 2.96it/s]
Training 1/1 epoch (loss 2.7696): 42%|βββββ | 265/625 [01:40<02:02, 2.94it/s]
Training 1/1 epoch (loss 2.6879): 42%|βββββ | 265/625 [01:40<02:02, 2.94it/s]
Training 1/1 epoch (loss 2.6879): 43%|βββββ | 266/625 [01:40<02:05, 2.85it/s]
Training 1/1 epoch (loss 2.9676): 43%|βββββ | 266/625 [01:40<02:05, 2.85it/s]
Training 1/1 epoch (loss 2.9676): 43%|βββββ | 267/625 [01:40<02:07, 2.81it/s]
Training 1/1 epoch (loss 2.8130): 43%|βββββ | 267/625 [01:41<02:07, 2.81it/s]
Training 1/1 epoch (loss 2.8130): 43%|βββββ | 268/625 [01:41<02:03, 2.90it/s]
Training 1/1 epoch (loss 2.9101): 43%|βββββ | 268/625 [01:41<02:03, 2.90it/s]
Training 1/1 epoch (loss 2.9101): 43%|βββββ | 269/625 [01:41<02:02, 2.90it/s]
Training 1/1 epoch (loss 2.7579): 43%|βββββ | 269/625 [01:41<02:02, 2.90it/s]
Training 1/1 epoch (loss 2.7579): 43%|βββββ | 270/625 [01:41<01:54, 3.09it/s]
Training 1/1 epoch (loss 2.9286): 43%|βββββ | 270/625 [01:42<01:54, 3.09it/s]
Training 1/1 epoch (loss 2.9286): 43%|βββββ | 271/625 [01:42<01:56, 3.04it/s]
Training 1/1 epoch (loss 2.8555): 43%|βββββ | 271/625 [01:42<01:56, 3.04it/s]
Training 1/1 epoch (loss 2.8555): 44%|βββββ | 272/625 [01:42<01:57, 3.01it/s]
Training 1/1 epoch (loss 2.8910): 44%|βββββ | 272/625 [01:43<01:57, 3.01it/s]
Training 1/1 epoch (loss 2.8910): 44%|βββββ | 273/625 [01:43<02:09, 2.72it/s]
Training 1/1 epoch (loss 2.7419): 44%|βββββ | 273/625 [01:43<02:09, 2.72it/s]
Training 1/1 epoch (loss 2.7419): 44%|βββββ | 274/625 [01:43<02:03, 2.85it/s]
Training 1/1 epoch (loss 2.6593): 44%|βββββ | 274/625 [01:43<02:03, 2.85it/s]
Training 1/1 epoch (loss 2.6593): 44%|βββββ | 275/625 [01:43<01:57, 2.98it/s]
Training 1/1 epoch (loss 2.7345): 44%|βββββ | 275/625 [01:43<01:57, 2.98it/s]
Training 1/1 epoch (loss 2.7345): 44%|βββββ | 276/625 [01:43<01:54, 3.05it/s]
Training 1/1 epoch (loss 2.8554): 44%|βββββ | 276/625 [01:44<01:54, 3.05it/s]
Training 1/1 epoch (loss 2.8554): 44%|βββββ | 277/625 [01:44<01:55, 3.02it/s]
Training 1/1 epoch (loss 2.8825): 44%|βββββ | 277/625 [01:44<01:55, 3.02it/s]
Training 1/1 epoch (loss 2.8825): 44%|βββββ | 278/625 [01:44<01:54, 3.02it/s]
Training 1/1 epoch (loss 2.8773): 44%|βββββ | 278/625 [01:44<01:54, 3.02it/s]
Training 1/1 epoch (loss 2.8773): 45%|βββββ | 279/625 [01:44<01:54, 3.03it/s]
Training 1/1 epoch (loss 2.8234): 45%|βββββ | 279/625 [01:45<01:54, 3.03it/s]
Training 1/1 epoch (loss 2.8234): 45%|βββββ | 280/625 [01:45<02:02, 2.83it/s]
Training 1/1 epoch (loss 3.0202): 45%|βββββ | 280/625 [01:45<02:02, 2.83it/s]
Training 1/1 epoch (loss 3.0202): 45%|βββββ | 281/625 [01:45<02:00, 2.85it/s]
Training 1/1 epoch (loss 2.9086): 45%|βββββ | 281/625 [01:45<02:00, 2.85it/s]
Training 1/1 epoch (loss 2.9086): 45%|βββββ | 282/625 [01:45<01:57, 2.92it/s]
Training 1/1 epoch (loss 2.8511): 45%|βββββ | 282/625 [01:46<01:57, 2.92it/s]
Training 1/1 epoch (loss 2.8511): 45%|βββββ | 283/625 [01:46<01:58, 2.88it/s]
Training 1/1 epoch (loss 3.0290): 45%|βββββ | 283/625 [01:46<01:58, 2.88it/s]
Training 1/1 epoch (loss 3.0290): 45%|βββββ | 284/625 [01:46<01:58, 2.88it/s]
Training 1/1 epoch (loss 2.7807): 45%|βββββ | 284/625 [01:47<01:58, 2.88it/s]
Training 1/1 epoch (loss 2.7807): 46%|βββββ | 285/625 [01:47<02:02, 2.77it/s]
Training 1/1 epoch (loss 2.8170): 46%|βββββ | 285/625 [01:47<02:02, 2.77it/s]
Training 1/1 epoch (loss 2.8170): 46%|βββββ | 286/625 [01:47<01:58, 2.86it/s]
Training 1/1 epoch (loss 2.8256): 46%|βββββ | 286/625 [01:47<01:58, 2.86it/s]
Training 1/1 epoch (loss 2.8256): 46%|βββββ | 287/625 [01:47<01:55, 2.91it/s]
Training 1/1 epoch (loss 2.6855): 46%|βββββ | 287/625 [01:48<01:55, 2.91it/s]
Training 1/1 epoch (loss 2.6855): 46%|βββββ | 288/625 [01:48<02:07, 2.65it/s]
Training 1/1 epoch (loss 2.8864): 46%|βββββ | 288/625 [01:48<02:07, 2.65it/s]
Training 1/1 epoch (loss 2.8864): 46%|βββββ | 289/625 [01:48<02:03, 2.71it/s]
Training 1/1 epoch (loss 3.0540): 46%|βββββ | 289/625 [01:48<02:03, 2.71it/s]
Training 1/1 epoch (loss 3.0540): 46%|βββββ | 290/625 [01:48<02:06, 2.65it/s]
Training 1/1 epoch (loss 2.8701): 46%|βββββ | 290/625 [01:49<02:06, 2.65it/s]
Training 1/1 epoch (loss 2.8701): 47%|βββββ | 291/625 [01:49<02:00, 2.77it/s]
Training 1/1 epoch (loss 3.2075): 47%|βββββ | 291/625 [01:49<02:00, 2.77it/s]
Training 1/1 epoch (loss 3.2075): 47%|βββββ | 292/625 [01:49<01:57, 2.82it/s]
Training 1/1 epoch (loss 2.9390): 47%|βββββ | 292/625 [01:49<01:57, 2.82it/s]
Training 1/1 epoch (loss 2.9390): 47%|βββββ | 293/625 [01:49<01:56, 2.86it/s]
Training 1/1 epoch (loss 2.6657): 47%|βββββ | 293/625 [01:50<01:56, 2.86it/s]
Training 1/1 epoch (loss 2.6657): 47%|βββββ | 294/625 [01:50<01:53, 2.91it/s]
Training 1/1 epoch (loss 2.8654): 47%|βββββ | 294/625 [01:50<01:53, 2.91it/s]
Training 1/1 epoch (loss 2.8654): 47%|βββββ | 295/625 [01:50<01:52, 2.94it/s]
Training 1/1 epoch (loss 2.7386): 47%|βββββ | 295/625 [01:51<01:52, 2.94it/s]
Training 1/1 epoch (loss 2.7386): 47%|βββββ | 296/625 [01:51<02:00, 2.73it/s]
Training 1/1 epoch (loss 2.9759): 47%|βββββ | 296/625 [01:51<02:00, 2.73it/s]
Training 1/1 epoch (loss 2.9759): 48%|βββββ | 297/625 [01:51<01:58, 2.78it/s]
Training 1/1 epoch (loss 2.6290): 48%|βββββ | 297/625 [01:51<01:58, 2.78it/s]
Training 1/1 epoch (loss 2.6290): 48%|βββββ | 298/625 [01:51<01:54, 2.85it/s]
Training 1/1 epoch (loss 2.4788): 48%|βββββ | 298/625 [01:52<01:54, 2.85it/s]
Training 1/1 epoch (loss 2.4788): 48%|βββββ | 299/625 [01:52<01:52, 2.90it/s]
Training 1/1 epoch (loss 2.7123): 48%|βββββ | 299/625 [01:52<01:52, 2.90it/s]
Training 1/1 epoch (loss 2.7123): 48%|βββββ | 300/625 [01:52<01:48, 2.98it/s]
Training 1/1 epoch (loss 2.7751): 48%|βββββ | 300/625 [01:52<01:48, 2.98it/s]
Training 1/1 epoch (loss 2.7751): 48%|βββββ | 301/625 [01:52<01:51, 2.90it/s]
Training 1/1 epoch (loss 2.8971): 48%|βββββ | 301/625 [01:53<01:51, 2.90it/s]
Training 1/1 epoch (loss 2.8971): 48%|βββββ | 302/625 [01:53<01:55, 2.80it/s]
Training 1/1 epoch (loss 2.7301): 48%|βββββ | 302/625 [01:53<01:55, 2.80it/s]
Training 1/1 epoch (loss 2.7301): 48%|βββββ | 303/625 [01:53<02:07, 2.53it/s]
Training 1/1 epoch (loss 2.9733): 48%|βββββ | 303/625 [01:54<02:07, 2.53it/s]
Training 1/1 epoch (loss 2.9733): 49%|βββββ | 304/625 [01:54<02:22, 2.26it/s]
Training 1/1 epoch (loss 2.7999): 49%|βββββ | 304/625 [01:54<02:22, 2.26it/s]
Training 1/1 epoch (loss 2.7999): 49%|βββββ | 305/625 [01:54<02:16, 2.35it/s]
Training 1/1 epoch (loss 2.6789): 49%|βββββ | 305/625 [01:54<02:16, 2.35it/s]
Training 1/1 epoch (loss 2.6789): 49%|βββββ | 306/625 [01:54<02:08, 2.49it/s]
Training 1/1 epoch (loss 3.0435): 49%|βββββ | 306/625 [01:55<02:08, 2.49it/s]
Training 1/1 epoch (loss 3.0435): 49%|βββββ | 307/625 [01:55<02:04, 2.56it/s]
Training 1/1 epoch (loss 2.8550): 49%|βββββ | 307/625 [01:55<02:04, 2.56it/s]
Training 1/1 epoch (loss 2.8550): 49%|βββββ | 308/625 [01:55<02:03, 2.56it/s]
Training 1/1 epoch (loss 2.8375): 49%|βββββ | 308/625 [01:56<02:03, 2.56it/s]
Training 1/1 epoch (loss 2.8375): 49%|βββββ | 309/625 [01:56<02:12, 2.38it/s]
Training 1/1 epoch (loss 2.7100): 49%|βββββ | 309/625 [01:56<02:12, 2.38it/s]
Training 1/1 epoch (loss 2.7100): 50%|βββββ | 310/625 [01:56<02:18, 2.28it/s]
Training 1/1 epoch (loss 2.6405): 50%|βββββ | 310/625 [01:56<02:18, 2.28it/s]
Training 1/1 epoch (loss 2.6405): 50%|βββββ | 311/625 [01:56<02:07, 2.47it/s]
Training 1/1 epoch (loss 2.9408): 50%|βββββ | 311/625 [01:57<02:07, 2.47it/s]
Training 1/1 epoch (loss 2.9408): 50%|βββββ | 312/625 [01:57<02:01, 2.57it/s]
Training 1/1 epoch (loss 2.7525): 50%|βββββ | 312/625 [01:57<02:01, 2.57it/s]
Training 1/1 epoch (loss 2.7525): 50%|βββββ | 313/625 [01:57<01:58, 2.64it/s]
Training 1/1 epoch (loss 2.7530): 50%|βββββ | 313/625 [01:58<01:58, 2.64it/s]
Training 1/1 epoch (loss 2.7530): 50%|βββββ | 314/625 [01:58<01:56, 2.67it/s]
Training 1/1 epoch (loss 3.0355): 50%|βββββ | 314/625 [01:58<01:56, 2.67it/s]
Training 1/1 epoch (loss 3.0355): 50%|βββββ | 315/625 [01:58<01:59, 2.59it/s]
Training 1/1 epoch (loss 2.9396): 50%|βββββ | 315/625 [01:58<01:59, 2.59it/s]
Training 1/1 epoch (loss 2.9396): 51%|βββββ | 316/625 [01:58<01:53, 2.72it/s]
Training 1/1 epoch (loss 2.8896): 51%|βββββ | 316/625 [01:59<01:53, 2.72it/s]
Training 1/1 epoch (loss 2.8896): 51%|βββββ | 317/625 [01:59<02:02, 2.52it/s]
Training 1/1 epoch (loss 2.7484): 51%|βββββ | 317/625 [01:59<02:02, 2.52it/s]
Training 1/1 epoch (loss 2.7484): 51%|βββββ | 318/625 [01:59<01:53, 2.71it/s]
Training 1/1 epoch (loss 2.9326): 51%|βββββ | 318/625 [01:59<01:53, 2.71it/s]
Training 1/1 epoch (loss 2.9326): 51%|βββββ | 319/625 [01:59<01:47, 2.85it/s]
Training 1/1 epoch (loss 2.7682): 51%|βββββ | 319/625 [02:00<01:47, 2.85it/s]
Training 1/1 epoch (loss 2.7682): 51%|βββββ | 320/625 [02:00<01:53, 2.68it/s]
Training 1/1 epoch (loss 2.8991): 51%|βββββ | 320/625 [02:00<01:53, 2.68it/s]
Training 1/1 epoch (loss 2.8991): 51%|ββββββ | 321/625 [02:00<01:47, 2.83it/s]
Training 1/1 epoch (loss 2.8205): 51%|ββββββ | 321/625 [02:00<01:47, 2.83it/s]
Training 1/1 epoch (loss 2.8205): 52%|ββββββ | 322/625 [02:00<01:47, 2.81it/s]
Training 1/1 epoch (loss 2.8870): 52%|ββββββ | 322/625 [02:01<01:47, 2.81it/s]
Training 1/1 epoch (loss 2.8870): 52%|ββββββ | 323/625 [02:01<01:44, 2.90it/s]
Training 1/1 epoch (loss 2.8745): 52%|ββββββ | 323/625 [02:01<01:44, 2.90it/s]
Training 1/1 epoch (loss 2.8745): 52%|ββββββ | 324/625 [02:01<01:43, 2.90it/s]
Training 1/1 epoch (loss 3.0285): 52%|ββββββ | 324/625 [02:01<01:43, 2.90it/s]
Training 1/1 epoch (loss 3.0285): 52%|ββββββ | 325/625 [02:01<01:41, 2.95it/s]
Training 1/1 epoch (loss 2.7211): 52%|ββββββ | 325/625 [02:02<01:41, 2.95it/s]
Training 1/1 epoch (loss 2.7211): 52%|ββββββ | 326/625 [02:02<01:39, 3.00it/s]
Training 1/1 epoch (loss 2.8275): 52%|ββββββ | 326/625 [02:02<01:39, 3.00it/s]
Training 1/1 epoch (loss 2.8275): 52%|ββββββ | 327/625 [02:02<01:43, 2.88it/s]
Training 1/1 epoch (loss 2.7601): 52%|ββββββ | 327/625 [02:02<01:43, 2.88it/s]
Training 1/1 epoch (loss 2.7601): 52%|ββββββ | 328/625 [02:02<01:40, 2.94it/s]
Training 1/1 epoch (loss 2.7470): 52%|ββββββ | 328/625 [02:03<01:40, 2.94it/s]
Training 1/1 epoch (loss 2.7470): 53%|ββββββ | 329/625 [02:03<01:44, 2.84it/s]
Training 1/1 epoch (loss 2.7938): 53%|ββββββ | 329/625 [02:03<01:44, 2.84it/s]
Training 1/1 epoch (loss 2.7938): 53%|ββββββ | 330/625 [02:03<01:42, 2.87it/s]
Training 1/1 epoch (loss 2.8559): 53%|ββββββ | 330/625 [02:03<01:42, 2.87it/s]
Training 1/1 epoch (loss 2.8559): 53%|ββββββ | 331/625 [02:03<01:37, 3.01it/s]
Training 1/1 epoch (loss 3.0565): 53%|ββββββ | 331/625 [02:04<01:37, 3.01it/s]
Training 1/1 epoch (loss 3.0565): 53%|ββββββ | 332/625 [02:04<01:35, 3.05it/s]
Training 1/1 epoch (loss 2.6913): 53%|ββββββ | 332/625 [02:04<01:35, 3.05it/s]
Training 1/1 epoch (loss 2.6913): 53%|ββββββ | 333/625 [02:04<01:39, 2.93it/s]
Training 1/1 epoch (loss 2.8509): 53%|ββββββ | 333/625 [02:04<01:39, 2.93it/s]
Training 1/1 epoch (loss 2.8509): 53%|ββββββ | 334/625 [02:04<01:40, 2.89it/s]
Training 1/1 epoch (loss 2.7224): 53%|ββββββ | 334/625 [02:05<01:40, 2.89it/s]
Training 1/1 epoch (loss 2.7224): 54%|ββββββ | 335/625 [02:05<01:39, 2.91it/s]
Training 1/1 epoch (loss 2.9649): 54%|ββββββ | 335/625 [02:05<01:39, 2.91it/s]
Training 1/1 epoch (loss 2.9649): 54%|ββββββ | 336/625 [02:05<01:38, 2.94it/s]
Training 1/1 epoch (loss 2.9490): 54%|ββββββ | 336/625 [02:05<01:38, 2.94it/s]
Training 1/1 epoch (loss 2.9490): 54%|ββββββ | 337/625 [02:05<01:36, 2.99it/s]
Training 1/1 epoch (loss 2.9024): 54%|ββββββ | 337/625 [02:06<01:36, 2.99it/s]
Training 1/1 epoch (loss 2.9024): 54%|ββββββ | 338/625 [02:06<01:34, 3.04it/s]
Training 1/1 epoch (loss 2.7625): 54%|ββββββ | 338/625 [02:06<01:34, 3.04it/s]
Training 1/1 epoch (loss 2.7625): 54%|ββββββ | 339/625 [02:06<01:37, 2.94it/s]
Training 1/1 epoch (loss 2.9434): 54%|ββββββ | 339/625 [02:07<01:37, 2.94it/s]
Training 1/1 epoch (loss 2.9434): 54%|ββββββ | 340/625 [02:07<01:39, 2.88it/s]
Training 1/1 epoch (loss 2.6812): 54%|ββββββ | 340/625 [02:07<01:39, 2.88it/s]
Training 1/1 epoch (loss 2.6812): 55%|ββββββ | 341/625 [02:07<01:35, 2.97it/s]
Training 1/1 epoch (loss 2.7866): 55%|ββββββ | 341/625 [02:07<01:35, 2.97it/s]
Training 1/1 epoch (loss 2.7866): 55%|ββββββ | 342/625 [02:07<01:35, 2.95it/s]
Training 1/1 epoch (loss 2.7744): 55%|ββββββ | 342/625 [02:08<01:35, 2.95it/s]
Training 1/1 epoch (loss 2.7744): 55%|ββββββ | 343/625 [02:08<01:34, 2.98it/s]
Training 1/1 epoch (loss 2.6939): 55%|ββββββ | 343/625 [02:08<01:34, 2.98it/s]
Training 1/1 epoch (loss 2.6939): 55%|ββββββ | 344/625 [02:08<01:38, 2.85it/s]
Training 1/1 epoch (loss 2.7828): 55%|ββββββ | 344/625 [02:08<01:38, 2.85it/s]
Training 1/1 epoch (loss 2.7828): 55%|ββββββ | 345/625 [02:08<01:42, 2.73it/s]
Training 1/1 epoch (loss 2.6326): 55%|ββββββ | 345/625 [02:09<01:42, 2.73it/s]
Training 1/1 epoch (loss 2.6326): 55%|ββββββ | 346/625 [02:09<01:46, 2.63it/s]
Training 1/1 epoch (loss 2.8010): 55%|ββββββ | 346/625 [02:09<01:46, 2.63it/s]
Training 1/1 epoch (loss 2.8010): 56%|ββββββ | 347/625 [02:09<01:41, 2.73it/s]
Training 1/1 epoch (loss 3.0749): 56%|ββββββ | 347/625 [02:09<01:41, 2.73it/s]
Training 1/1 epoch (loss 3.0749): 56%|ββββββ | 348/625 [02:09<01:38, 2.81it/s]
Training 1/1 epoch (loss 2.7890): 56%|ββββββ | 348/625 [02:10<01:38, 2.81it/s]
Training 1/1 epoch (loss 2.7890): 56%|ββββββ | 349/625 [02:10<01:35, 2.89it/s]
Training 1/1 epoch (loss 2.8778): 56%|ββββββ | 349/625 [02:10<01:35, 2.89it/s]
Training 1/1 epoch (loss 2.8778): 56%|ββββββ | 350/625 [02:10<01:34, 2.90it/s]
Training 1/1 epoch (loss 2.6726): 56%|ββββββ | 350/625 [02:10<01:34, 2.90it/s]
Training 1/1 epoch (loss 2.6726): 56%|ββββββ | 351/625 [02:10<01:36, 2.83it/s]
Training 1/1 epoch (loss 3.0540): 56%|ββββββ | 351/625 [02:11<01:36, 2.83it/s]
Training 1/1 epoch (loss 3.0540): 56%|ββββββ | 352/625 [02:11<01:36, 2.84it/s]
Training 1/1 epoch (loss 3.2049): 56%|ββββββ | 352/625 [02:11<01:36, 2.84it/s]
Training 1/1 epoch (loss 3.2049): 56%|ββββββ | 353/625 [02:11<01:35, 2.86it/s]
Training 1/1 epoch (loss 2.8681): 56%|ββββββ | 353/625 [02:11<01:35, 2.86it/s]
Training 1/1 epoch (loss 2.8681): 57%|ββββββ | 354/625 [02:11<01:29, 3.01it/s]
Training 1/1 epoch (loss 2.8644): 57%|ββββββ | 354/625 [02:12<01:29, 3.01it/s]
Training 1/1 epoch (loss 2.8644): 57%|ββββββ | 355/625 [02:12<01:31, 2.95it/s]
Training 1/1 epoch (loss 2.7387): 57%|ββββββ | 355/625 [02:12<01:31, 2.95it/s]
Training 1/1 epoch (loss 2.7387): 57%|ββββββ | 356/625 [02:12<01:31, 2.95it/s]
Training 1/1 epoch (loss 2.9670): 57%|ββββββ | 356/625 [02:12<01:31, 2.95it/s]
Training 1/1 epoch (loss 2.9670): 57%|ββββββ | 357/625 [02:12<01:32, 2.90it/s]
Training 1/1 epoch (loss 2.7463): 57%|ββββββ | 357/625 [02:13<01:32, 2.90it/s]
Training 1/1 epoch (loss 2.7463): 57%|ββββββ | 358/625 [02:13<01:28, 3.01it/s]
Training 1/1 epoch (loss 3.0121): 57%|ββββββ | 358/625 [02:13<01:28, 3.01it/s]
Training 1/1 epoch (loss 3.0121): 57%|ββββββ | 359/625 [02:13<01:27, 3.05it/s]
Training 1/1 epoch (loss 2.6433): 57%|ββββββ | 359/625 [02:13<01:27, 3.05it/s]
Training 1/1 epoch (loss 2.6433): 58%|ββββββ | 360/625 [02:13<01:26, 3.06it/s]
Training 1/1 epoch (loss 2.8228): 58%|ββββββ | 360/625 [02:14<01:26, 3.06it/s]
Training 1/1 epoch (loss 2.8228): 58%|ββββββ | 361/625 [02:14<01:33, 2.82it/s]
Training 1/1 epoch (loss 2.7380): 58%|ββββββ | 361/625 [02:14<01:33, 2.82it/s]
Training 1/1 epoch (loss 2.7380): 58%|ββββββ | 362/625 [02:14<01:31, 2.86it/s]
Training 1/1 epoch (loss 2.8398): 58%|ββββββ | 362/625 [02:15<01:31, 2.86it/s]
Training 1/1 epoch (loss 2.8398): 58%|ββββββ | 363/625 [02:15<01:35, 2.76it/s]
Training 1/1 epoch (loss 2.8425): 58%|ββββββ | 363/625 [02:15<01:35, 2.76it/s]
Training 1/1 epoch (loss 2.8425): 58%|ββββββ | 364/625 [02:15<01:34, 2.77it/s]
Training 1/1 epoch (loss 2.6096): 58%|ββββββ | 364/625 [02:15<01:34, 2.77it/s]
Training 1/1 epoch (loss 2.6096): 58%|ββββββ | 365/625 [02:15<01:33, 2.79it/s]
Training 1/1 epoch (loss 2.7548): 58%|ββββββ | 365/625 [02:16<01:33, 2.79it/s]
Training 1/1 epoch (loss 2.7548): 59%|ββββββ | 366/625 [02:16<01:28, 2.92it/s]
Training 1/1 epoch (loss 2.7297): 59%|ββββββ | 366/625 [02:16<01:28, 2.92it/s]
Training 1/1 epoch (loss 2.7297): 59%|ββββββ | 367/625 [02:16<01:28, 2.92it/s]
Training 1/1 epoch (loss 2.7244): 59%|ββββββ | 367/625 [02:16<01:28, 2.92it/s]
Training 1/1 epoch (loss 2.7244): 59%|ββββββ | 368/625 [02:16<01:29, 2.87it/s]
Training 1/1 epoch (loss 2.6339): 59%|ββββββ | 368/625 [02:17<01:29, 2.87it/s]
Training 1/1 epoch (loss 2.6339): 59%|ββββββ | 369/625 [02:17<01:30, 2.82it/s]
Training 1/1 epoch (loss 2.8481): 59%|ββββββ | 369/625 [02:17<01:30, 2.82it/s]
Training 1/1 epoch (loss 2.8481): 59%|ββββββ | 370/625 [02:17<01:29, 2.84it/s]
Training 1/1 epoch (loss 2.8717): 59%|ββββββ | 370/625 [02:17<01:29, 2.84it/s]
Training 1/1 epoch (loss 2.8717): 59%|ββββββ | 371/625 [02:17<01:29, 2.84it/s]
Training 1/1 epoch (loss 2.9800): 59%|ββββββ | 371/625 [02:18<01:29, 2.84it/s]
Training 1/1 epoch (loss 2.9800): 60%|ββββββ | 372/625 [02:18<01:27, 2.89it/s]
Training 1/1 epoch (loss 2.7415): 60%|ββββββ | 372/625 [02:18<01:27, 2.89it/s]
Training 1/1 epoch (loss 2.7415): 60%|ββββββ | 373/625 [02:18<01:27, 2.87it/s]
Training 1/1 epoch (loss 2.9108): 60%|ββββββ | 373/625 [02:18<01:27, 2.87it/s]
Training 1/1 epoch (loss 2.9108): 60%|ββββββ | 374/625 [02:18<01:28, 2.84it/s]
Training 1/1 epoch (loss 2.8502): 60%|ββββββ | 374/625 [02:19<01:28, 2.84it/s]
Training 1/1 epoch (loss 2.8502): 60%|ββββββ | 375/625 [02:19<01:29, 2.79it/s]
Training 1/1 epoch (loss 2.7281): 60%|ββββββ | 375/625 [02:19<01:29, 2.79it/s]
Training 1/1 epoch (loss 2.7281): 60%|ββββββ | 376/625 [02:19<01:40, 2.47it/s]
Training 1/1 epoch (loss 2.6492): 60%|ββββββ | 376/625 [02:20<01:40, 2.47it/s]
Training 1/1 epoch (loss 2.6492): 60%|ββββββ | 377/625 [02:20<01:34, 2.62it/s]
Training 1/1 epoch (loss 2.6770): 60%|ββββββ | 377/625 [02:20<01:34, 2.62it/s]
Training 1/1 epoch (loss 2.6770): 60%|ββββββ | 378/625 [02:20<01:32, 2.67it/s]
Training 1/1 epoch (loss 2.9395): 60%|ββββββ | 378/625 [02:20<01:32, 2.67it/s]
Training 1/1 epoch (loss 2.9395): 61%|ββββββ | 379/625 [02:20<01:28, 2.79it/s]
Training 1/1 epoch (loss 2.6852): 61%|ββββββ | 379/625 [02:21<01:28, 2.79it/s]
Training 1/1 epoch (loss 2.6852): 61%|ββββββ | 380/625 [02:21<01:26, 2.85it/s]
Training 1/1 epoch (loss 2.9518): 61%|ββββββ | 380/625 [02:21<01:26, 2.85it/s]
Training 1/1 epoch (loss 2.9518): 61%|ββββββ | 381/625 [02:21<01:23, 2.92it/s]
Training 1/1 epoch (loss 2.7560): 61%|ββββββ | 381/625 [02:21<01:23, 2.92it/s]
Training 1/1 epoch (loss 2.7560): 61%|ββββββ | 382/625 [02:21<01:23, 2.90it/s]
Training 1/1 epoch (loss 2.9357): 61%|ββββββ | 382/625 [02:22<01:23, 2.90it/s]
Training 1/1 epoch (loss 2.9357): 61%|βββββββ | 383/625 [02:22<01:20, 3.01it/s]
Training 1/1 epoch (loss 2.7042): 61%|βββββββ | 383/625 [02:22<01:20, 3.01it/s]
Training 1/1 epoch (loss 2.7042): 61%|βββββββ | 384/625 [02:22<01:21, 2.97it/s]
Training 1/1 epoch (loss 2.7122): 61%|βββββββ | 384/625 [02:22<01:21, 2.97it/s]
Training 1/1 epoch (loss 2.7122): 62%|βββββββ | 385/625 [02:22<01:22, 2.92it/s]
Training 1/1 epoch (loss 2.8466): 62%|βββββββ | 385/625 [02:23<01:22, 2.92it/s]
Training 1/1 epoch (loss 2.8466): 62%|βββββββ | 386/625 [02:23<01:23, 2.86it/s]
Training 1/1 epoch (loss 2.8726): 62%|βββββββ | 386/625 [02:23<01:23, 2.86it/s]
Training 1/1 epoch (loss 2.8726): 62%|βββββββ | 387/625 [02:23<01:45, 2.25it/s]
Training 1/1 epoch (loss 2.8319): 62%|βββββββ | 387/625 [02:24<01:45, 2.25it/s]
Training 1/1 epoch (loss 2.8319): 62%|βββββββ | 388/625 [02:24<01:43, 2.30it/s]
Training 1/1 epoch (loss 2.6759): 62%|βββββββ | 388/625 [02:24<01:43, 2.30it/s]
Training 1/1 epoch (loss 2.6759): 62%|βββββββ | 389/625 [02:24<01:39, 2.38it/s]
Training 1/1 epoch (loss 2.8952): 62%|βββββββ | 389/625 [02:24<01:39, 2.38it/s]
Training 1/1 epoch (loss 2.8952): 62%|βββββββ | 390/625 [02:24<01:34, 2.48it/s]
Training 1/1 epoch (loss 2.8186): 62%|βββββββ | 390/625 [02:25<01:34, 2.48it/s]
Training 1/1 epoch (loss 2.8186): 63%|βββββββ | 391/625 [02:25<01:29, 2.62it/s]
Training 1/1 epoch (loss 2.9721): 63%|βββββββ | 391/625 [02:25<01:29, 2.62it/s]
Training 1/1 epoch (loss 2.9721): 63%|βββββββ | 392/625 [02:25<01:32, 2.52it/s]
Training 1/1 epoch (loss 2.7893): 63%|βββββββ | 392/625 [02:26<01:32, 2.52it/s]
Training 1/1 epoch (loss 2.7893): 63%|βββββββ | 393/625 [02:26<01:28, 2.61it/s]
Training 1/1 epoch (loss 2.7919): 63%|βββββββ | 393/625 [02:26<01:28, 2.61it/s]
Training 1/1 epoch (loss 2.7919): 63%|βββββββ | 394/625 [02:26<01:26, 2.68it/s]
Training 1/1 epoch (loss 2.8539): 63%|βββββββ | 394/625 [02:26<01:26, 2.68it/s]
Training 1/1 epoch (loss 2.8539): 63%|βββββββ | 395/625 [02:26<01:23, 2.75it/s]
Training 1/1 epoch (loss 2.6532): 63%|βββββββ | 395/625 [02:27<01:23, 2.75it/s]
Training 1/1 epoch (loss 2.6532): 63%|βββββββ | 396/625 [02:27<01:23, 2.75it/s]
Training 1/1 epoch (loss 2.8154): 63%|βββββββ | 396/625 [02:27<01:23, 2.75it/s]
Training 1/1 epoch (loss 2.8154): 64%|βββββββ | 397/625 [02:27<01:19, 2.86it/s]
Training 1/1 epoch (loss 2.7874): 64%|βββββββ | 397/625 [02:27<01:19, 2.86it/s]
Training 1/1 epoch (loss 2.7874): 64%|βββββββ | 398/625 [02:27<01:18, 2.89it/s]
Training 1/1 epoch (loss 2.7940): 64%|βββββββ | 398/625 [02:28<01:18, 2.89it/s]
Training 1/1 epoch (loss 2.7940): 64%|βββββββ | 399/625 [02:28<01:14, 3.03it/s]
Training 1/1 epoch (loss 2.6816): 64%|βββββββ | 399/625 [02:28<01:14, 3.03it/s]
Training 1/1 epoch (loss 2.6816): 64%|βββββββ | 400/625 [02:28<01:17, 2.89it/s]
Training 1/1 epoch (loss 2.7575): 64%|βββββββ | 400/625 [02:28<01:17, 2.89it/s]
Training 1/1 epoch (loss 2.7575): 64%|βββββββ | 401/625 [02:28<01:17, 2.87it/s]
Training 1/1 epoch (loss 2.8556): 64%|βββββββ | 401/625 [02:29<01:17, 2.87it/s]
Training 1/1 epoch (loss 2.8556): 64%|βββββββ | 402/625 [02:29<01:19, 2.80it/s]
Training 1/1 epoch (loss 3.0696): 64%|βββββββ | 402/625 [02:29<01:19, 2.80it/s]
Training 1/1 epoch (loss 3.0696): 64%|βββββββ | 403/625 [02:29<01:16, 2.90it/s]
Training 1/1 epoch (loss 2.8554): 64%|βββββββ | 403/625 [02:29<01:16, 2.90it/s]
Training 1/1 epoch (loss 2.8554): 65%|βββββββ | 404/625 [02:29<01:19, 2.77it/s]
Training 1/1 epoch (loss 2.7960): 65%|βββββββ | 404/625 [02:30<01:19, 2.77it/s]
Training 1/1 epoch (loss 2.7960): 65%|βββββββ | 405/625 [02:30<01:21, 2.71it/s]
Training 1/1 epoch (loss 2.7661): 65%|βββββββ | 405/625 [02:30<01:21, 2.71it/s]
Training 1/1 epoch (loss 2.7661): 65%|βββββββ | 406/625 [02:30<01:21, 2.67it/s]
Training 1/1 epoch (loss 2.8643): 65%|βββββββ | 406/625 [02:31<01:21, 2.67it/s]
Training 1/1 epoch (loss 2.8643): 65%|βββββββ | 407/625 [02:31<01:17, 2.80it/s]
Training 1/1 epoch (loss 2.8489): 65%|βββββββ | 407/625 [02:31<01:17, 2.80it/s]
Training 1/1 epoch (loss 2.8489): 65%|βββββββ | 408/625 [02:31<01:16, 2.83it/s]
Training 1/1 epoch (loss 2.9108): 65%|βββββββ | 408/625 [02:31<01:16, 2.83it/s]
Training 1/1 epoch (loss 2.9108): 65%|βββββββ | 409/625 [02:31<01:14, 2.92it/s]
Training 1/1 epoch (loss 3.0654): 65%|βββββββ | 409/625 [02:31<01:14, 2.92it/s]
Training 1/1 epoch (loss 3.0654): 66%|βββββββ | 410/625 [02:31<01:10, 3.04it/s]
Training 1/1 epoch (loss 2.4783): 66%|βββββββ | 410/625 [02:32<01:10, 3.04it/s]
Training 1/1 epoch (loss 2.4783): 66%|βββββββ | 411/625 [02:32<01:11, 3.01it/s]
Training 1/1 epoch (loss 2.8513): 66%|βββββββ | 411/625 [02:32<01:11, 3.01it/s]
Training 1/1 epoch (loss 2.8513): 66%|βββββββ | 412/625 [02:32<01:12, 2.92it/s]
Training 1/1 epoch (loss 2.8908): 66%|βββββββ | 412/625 [02:33<01:12, 2.92it/s]
Training 1/1 epoch (loss 2.8908): 66%|βββββββ | 413/625 [02:33<01:11, 2.97it/s]
Training 1/1 epoch (loss 2.9881): 66%|βββββββ | 413/625 [02:33<01:11, 2.97it/s]
Training 1/1 epoch (loss 2.9881): 66%|βββββββ | 414/625 [02:33<01:09, 3.04it/s]
Training 1/1 epoch (loss 2.7992): 66%|βββββββ | 414/625 [02:33<01:09, 3.04it/s]
Training 1/1 epoch (loss 2.7992): 66%|βββββββ | 415/625 [02:33<01:10, 2.99it/s]
Training 1/1 epoch (loss 2.5900): 66%|βββββββ | 415/625 [02:33<01:10, 2.99it/s]
Training 1/1 epoch (loss 2.5900): 67%|βββββββ | 416/625 [02:33<01:09, 3.02it/s]
Training 1/1 epoch (loss 3.0126): 67%|βββββββ | 416/625 [02:34<01:09, 3.02it/s]
Training 1/1 epoch (loss 3.0126): 67%|βββββββ | 417/625 [02:34<01:11, 2.92it/s]
Training 1/1 epoch (loss 2.8174): 67%|βββββββ | 417/625 [02:34<01:11, 2.92it/s]
Training 1/1 epoch (loss 2.8174): 67%|βββββββ | 418/625 [02:34<01:11, 2.90it/s]
Training 1/1 epoch (loss 2.9031): 67%|βββββββ | 418/625 [02:35<01:11, 2.90it/s]
Training 1/1 epoch (loss 2.9031): 67%|βββββββ | 419/625 [02:35<01:11, 2.88it/s]
Training 1/1 epoch (loss 2.8303): 67%|βββββββ | 419/625 [02:35<01:11, 2.88it/s]
Training 1/1 epoch (loss 2.8303): 67%|βββββββ | 420/625 [02:35<01:15, 2.73it/s]
Training 1/1 epoch (loss 2.6609): 67%|βββββββ | 420/625 [02:35<01:15, 2.73it/s]
Training 1/1 epoch (loss 2.6609): 67%|βββββββ | 421/625 [02:35<01:12, 2.83it/s]
Training 1/1 epoch (loss 2.7924): 67%|βββββββ | 421/625 [02:36<01:12, 2.83it/s]
Training 1/1 epoch (loss 2.7924): 68%|βββββββ | 422/625 [02:36<01:09, 2.93it/s]
Training 1/1 epoch (loss 2.8266): 68%|βββββββ | 422/625 [02:36<01:09, 2.93it/s]
Training 1/1 epoch (loss 2.8266): 68%|βββββββ | 423/625 [02:36<01:09, 2.90it/s]
Training 1/1 epoch (loss 3.0523): 68%|βββββββ | 423/625 [02:36<01:09, 2.90it/s]
Training 1/1 epoch (loss 3.0523): 68%|βββββββ | 424/625 [02:36<01:07, 2.97it/s]
Training 1/1 epoch (loss 2.8294): 68%|βββββββ | 424/625 [02:37<01:07, 2.97it/s]
Training 1/1 epoch (loss 2.8294): 68%|βββββββ | 425/625 [02:37<01:10, 2.85it/s]
Training 1/1 epoch (loss 2.8115): 68%|βββββββ | 425/625 [02:37<01:10, 2.85it/s]
Training 1/1 epoch (loss 2.8115): 68%|βββββββ | 426/625 [02:37<01:07, 2.95it/s]
Training 1/1 epoch (loss 2.8019): 68%|βββββββ | 426/625 [02:37<01:07, 2.95it/s]
Training 1/1 epoch (loss 2.8019): 68%|βββββββ | 427/625 [02:37<01:07, 2.94it/s]
Training 1/1 epoch (loss 2.7163): 68%|βββββββ | 427/625 [02:38<01:07, 2.94it/s]
Training 1/1 epoch (loss 2.7163): 68%|βββββββ | 428/625 [02:38<01:04, 3.05it/s]
Training 1/1 epoch (loss 2.8808): 68%|βββββββ | 428/625 [02:38<01:04, 3.05it/s]
Training 1/1 epoch (loss 2.8808): 69%|βββββββ | 429/625 [02:38<01:05, 3.01it/s]
Training 1/1 epoch (loss 3.1122): 69%|βββββββ | 429/625 [02:38<01:05, 3.01it/s]
Training 1/1 epoch (loss 3.1122): 69%|βββββββ | 430/625 [02:38<01:06, 2.95it/s]
Training 1/1 epoch (loss 2.7987): 69%|βββββββ | 430/625 [02:39<01:06, 2.95it/s]
Training 1/1 epoch (loss 2.7987): 69%|βββββββ | 431/625 [02:39<01:08, 2.85it/s]
Training 1/1 epoch (loss 2.4683): 69%|βββββββ | 431/625 [02:39<01:08, 2.85it/s]
Training 1/1 epoch (loss 2.4683): 69%|βββββββ | 432/625 [02:39<01:07, 2.87it/s]
Training 1/1 epoch (loss 2.4847): 69%|βββββββ | 432/625 [02:39<01:07, 2.87it/s]
Training 1/1 epoch (loss 2.4847): 69%|βββββββ | 433/625 [02:39<01:05, 2.92it/s]
Training 1/1 epoch (loss 2.7610): 69%|βββββββ | 433/625 [02:40<01:05, 2.92it/s]
Training 1/1 epoch (loss 2.7610): 69%|βββββββ | 434/625 [02:40<01:07, 2.82it/s]
Training 1/1 epoch (loss 2.9591): 69%|βββββββ | 434/625 [02:40<01:07, 2.82it/s]
Training 1/1 epoch (loss 2.9591): 70%|βββββββ | 435/625 [02:40<01:11, 2.67it/s]
Training 1/1 epoch (loss 2.8945): 70%|βββββββ | 435/625 [02:41<01:11, 2.67it/s]
Training 1/1 epoch (loss 2.8945): 70%|βββββββ | 436/625 [02:41<01:09, 2.71it/s]
Training 1/1 epoch (loss 2.7559): 70%|βββββββ | 436/625 [02:41<01:09, 2.71it/s]
Training 1/1 epoch (loss 2.7559): 70%|βββββββ | 437/625 [02:41<01:07, 2.77it/s]
Training 1/1 epoch (loss 3.1034): 70%|βββββββ | 437/625 [02:41<01:07, 2.77it/s]
Training 1/1 epoch (loss 3.1034): 70%|βββββββ | 438/625 [02:41<01:06, 2.82it/s]
Training 1/1 epoch (loss 2.8478): 70%|βββββββ | 438/625 [02:42<01:06, 2.82it/s]
Training 1/1 epoch (loss 2.8478): 70%|βββββββ | 439/625 [02:42<01:03, 2.94it/s]
Training 1/1 epoch (loss 2.9272): 70%|βββββββ | 439/625 [02:42<01:03, 2.94it/s]
Training 1/1 epoch (loss 2.9272): 70%|βββββββ | 440/625 [02:42<01:07, 2.74it/s]
Training 1/1 epoch (loss 2.7873): 70%|βββββββ | 440/625 [02:42<01:07, 2.74it/s]
Training 1/1 epoch (loss 2.7873): 71%|βββββββ | 441/625 [02:42<01:10, 2.61it/s]
Training 1/1 epoch (loss 2.9090): 71%|βββββββ | 441/625 [02:43<01:10, 2.61it/s]
Training 1/1 epoch (loss 2.9090): 71%|βββββββ | 442/625 [02:43<01:08, 2.66it/s]
Training 1/1 epoch (loss 2.7945): 71%|βββββββ | 442/625 [02:43<01:08, 2.66it/s]
Training 1/1 epoch (loss 2.7945): 71%|βββββββ | 443/625 [02:43<01:05, 2.76it/s]
Training 1/1 epoch (loss 2.7390): 71%|βββββββ | 443/625 [02:43<01:05, 2.76it/s]
Training 1/1 epoch (loss 2.7390): 71%|βββββββ | 444/625 [02:43<01:06, 2.72it/s]
Training 1/1 epoch (loss 2.7445): 71%|βββββββ | 444/625 [02:44<01:06, 2.72it/s]
Training 1/1 epoch (loss 2.7445): 71%|βββββββ | 445/625 [02:44<01:06, 2.72it/s]
Training 1/1 epoch (loss 2.9483): 71%|βββββββ | 445/625 [02:44<01:06, 2.72it/s]
Training 1/1 epoch (loss 2.9483): 71%|ββββββββ | 446/625 [02:44<01:07, 2.66it/s]
Training 1/1 epoch (loss 2.9519): 71%|ββββββββ | 446/625 [02:45<01:07, 2.66it/s]
Training 1/1 epoch (loss 2.9519): 72%|ββββββββ | 447/625 [02:45<01:10, 2.53it/s]
Training 1/1 epoch (loss 2.7765): 72%|ββββββββ | 447/625 [02:45<01:10, 2.53it/s]
Training 1/1 epoch (loss 2.7765): 72%|ββββββββ | 448/625 [02:45<01:10, 2.50it/s]
Training 1/1 epoch (loss 2.9640): 72%|ββββββββ | 448/625 [02:45<01:10, 2.50it/s]
Training 1/1 epoch (loss 2.9640): 72%|ββββββββ | 449/625 [02:45<01:10, 2.49it/s]
Training 1/1 epoch (loss 2.7502): 72%|ββββββββ | 449/625 [02:46<01:10, 2.49it/s]
Training 1/1 epoch (loss 2.7502): 72%|ββββββββ | 450/625 [02:46<01:07, 2.60it/s]
Training 1/1 epoch (loss 2.6374): 72%|ββββββββ | 450/625 [02:46<01:07, 2.60it/s]
Training 1/1 epoch (loss 2.6374): 72%|ββββββββ | 451/625 [02:46<01:06, 2.62it/s]
Training 1/1 epoch (loss 2.7334): 72%|ββββββββ | 451/625 [02:46<01:06, 2.62it/s]
Training 1/1 epoch (loss 2.7334): 72%|ββββββββ | 452/625 [02:46<01:02, 2.77it/s]
Training 1/1 epoch (loss 2.6804): 72%|ββββββββ | 452/625 [02:47<01:02, 2.77it/s]
Training 1/1 epoch (loss 2.6804): 72%|ββββββββ | 453/625 [02:47<01:00, 2.84it/s]
Training 1/1 epoch (loss 2.6186): 72%|ββββββββ | 453/625 [02:47<01:00, 2.84it/s]
Training 1/1 epoch (loss 2.6186): 73%|ββββββββ | 454/625 [02:47<01:00, 2.83it/s]
Training 1/1 epoch (loss 2.9206): 73%|ββββββββ | 454/625 [02:48<01:00, 2.83it/s]
Training 1/1 epoch (loss 2.9206): 73%|ββββββββ | 455/625 [02:48<01:00, 2.83it/s]
Training 1/1 epoch (loss 2.8221): 73%|ββββββββ | 455/625 [02:48<01:00, 2.83it/s]
Training 1/1 epoch (loss 2.8221): 73%|ββββββββ | 456/625 [02:48<00:59, 2.86it/s]
Training 1/1 epoch (loss 2.9662): 73%|ββββββββ | 456/625 [02:48<00:59, 2.86it/s]
Training 1/1 epoch (loss 2.9662): 73%|ββββββββ | 457/625 [02:48<01:03, 2.64it/s]
Training 1/1 epoch (loss 2.8639): 73%|ββββββββ | 457/625 [02:49<01:03, 2.64it/s]
Training 1/1 epoch (loss 2.8639): 73%|ββββββββ | 458/625 [02:49<01:03, 2.62it/s]
Training 1/1 epoch (loss 2.7692): 73%|ββββββββ | 458/625 [02:49<01:03, 2.62it/s]
Training 1/1 epoch (loss 2.7692): 73%|ββββββββ | 459/625 [02:49<01:01, 2.70it/s]
Training 1/1 epoch (loss 2.5410): 73%|ββββββββ | 459/625 [02:49<01:01, 2.70it/s]
Training 1/1 epoch (loss 2.5410): 74%|ββββββββ | 460/625 [02:49<00:58, 2.80it/s]
Training 1/1 epoch (loss 3.0195): 74%|ββββββββ | 460/625 [02:50<00:58, 2.80it/s]
Training 1/1 epoch (loss 3.0195): 74%|ββββββββ | 461/625 [02:50<00:58, 2.81it/s]
Training 1/1 epoch (loss 2.9179): 74%|ββββββββ | 461/625 [02:50<00:58, 2.81it/s]
Training 1/1 epoch (loss 2.9179): 74%|ββββββββ | 462/625 [02:50<00:58, 2.81it/s]
Training 1/1 epoch (loss 2.7612): 74%|ββββββββ | 462/625 [02:50<00:58, 2.81it/s]
Training 1/1 epoch (loss 2.7612): 74%|ββββββββ | 463/625 [02:50<00:58, 2.76it/s]
Training 1/1 epoch (loss 2.9953): 74%|ββββββββ | 463/625 [02:51<00:58, 2.76it/s]
Training 1/1 epoch (loss 2.9953): 74%|ββββββββ | 464/625 [02:51<00:58, 2.73it/s]
Training 1/1 epoch (loss 2.8104): 74%|ββββββββ | 464/625 [02:51<00:58, 2.73it/s]
Training 1/1 epoch (loss 2.8104): 74%|ββββββββ | 465/625 [02:51<00:55, 2.87it/s]
Training 1/1 epoch (loss 2.9536): 74%|ββββββββ | 465/625 [02:51<00:55, 2.87it/s]
Training 1/1 epoch (loss 2.9536): 75%|ββββββββ | 466/625 [02:51<00:54, 2.91it/s]
Training 1/1 epoch (loss 2.7333): 75%|ββββββββ | 466/625 [02:52<00:54, 2.91it/s]
Training 1/1 epoch (loss 2.7333): 75%|ββββββββ | 467/625 [02:52<00:53, 2.96it/s]
Training 1/1 epoch (loss 2.7844): 75%|ββββββββ | 467/625 [02:52<00:53, 2.96it/s]
Training 1/1 epoch (loss 2.7844): 75%|ββββββββ | 468/625 [02:52<00:53, 2.94it/s]
Training 1/1 epoch (loss 3.0574): 75%|ββββββββ | 468/625 [02:53<00:53, 2.94it/s]
Training 1/1 epoch (loss 3.0574): 75%|ββββββββ | 469/625 [02:53<00:55, 2.82it/s]
Training 1/1 epoch (loss 2.7936): 75%|ββββββββ | 469/625 [02:53<00:55, 2.82it/s]
Training 1/1 epoch (loss 2.7936): 75%|ββββββββ | 470/625 [02:53<00:55, 2.77it/s]
Training 1/1 epoch (loss 2.7412): 75%|ββββββββ | 470/625 [02:53<00:55, 2.77it/s]
Training 1/1 epoch (loss 2.7412): 75%|ββββββββ | 471/625 [02:53<01:00, 2.53it/s]
Training 1/1 epoch (loss 3.0290): 75%|ββββββββ | 471/625 [02:54<01:00, 2.53it/s]
Training 1/1 epoch (loss 3.0290): 76%|ββββββββ | 472/625 [02:54<01:05, 2.35it/s]
Training 1/1 epoch (loss 2.5504): 76%|ββββββββ | 472/625 [02:54<01:05, 2.35it/s]
Training 1/1 epoch (loss 2.5504): 76%|ββββββββ | 473/625 [02:54<01:02, 2.45it/s]
Training 1/1 epoch (loss 2.9463): 76%|ββββββββ | 473/625 [02:55<01:02, 2.45it/s]
Training 1/1 epoch (loss 2.9463): 76%|ββββββββ | 474/625 [02:55<00:59, 2.56it/s]
Training 1/1 epoch (loss 3.0401): 76%|ββββββββ | 474/625 [02:55<00:59, 2.56it/s]
Training 1/1 epoch (loss 3.0401): 76%|ββββββββ | 475/625 [02:55<00:59, 2.51it/s]
Training 1/1 epoch (loss 3.0732): 76%|ββββββββ | 475/625 [02:55<00:59, 2.51it/s]
Training 1/1 epoch (loss 3.0732): 76%|ββββββββ | 476/625 [02:55<00:56, 2.65it/s]
Training 1/1 epoch (loss 2.8242): 76%|ββββββββ | 476/625 [02:56<00:56, 2.65it/s]
Training 1/1 epoch (loss 2.8242): 76%|ββββββββ | 477/625 [02:56<00:53, 2.75it/s]
Training 1/1 epoch (loss 2.7439): 76%|ββββββββ | 477/625 [02:56<00:53, 2.75it/s]
Training 1/1 epoch (loss 2.7439): 76%|ββββββββ | 478/625 [02:56<00:55, 2.66it/s]
Training 1/1 epoch (loss 2.7988): 76%|ββββββββ | 478/625 [02:56<00:55, 2.66it/s]
Training 1/1 epoch (loss 2.7988): 77%|ββββββββ | 479/625 [02:56<00:52, 2.78it/s]
Training 1/1 epoch (loss 2.8290): 77%|ββββββββ | 479/625 [02:57<00:52, 2.78it/s]
Training 1/1 epoch (loss 2.8290): 77%|ββββββββ | 480/625 [02:57<00:51, 2.80it/s]
Training 1/1 epoch (loss 2.8109): 77%|ββββββββ | 480/625 [02:57<00:51, 2.80it/s]
Training 1/1 epoch (loss 2.8109): 77%|ββββββββ | 481/625 [02:57<00:51, 2.78it/s]
Training 1/1 epoch (loss 2.5858): 77%|ββββββββ | 481/625 [02:57<00:51, 2.78it/s]
Training 1/1 epoch (loss 2.5858): 77%|ββββββββ | 482/625 [02:57<00:50, 2.85it/s]
Training 1/1 epoch (loss 2.7331): 77%|ββββββββ | 482/625 [02:58<00:50, 2.85it/s]
Training 1/1 epoch (loss 2.7331): 77%|ββββββββ | 483/625 [02:58<00:48, 2.94it/s]
Training 1/1 epoch (loss 2.7650): 77%|ββββββββ | 483/625 [02:58<00:48, 2.94it/s]
Training 1/1 epoch (loss 2.7650): 77%|ββββββββ | 484/625 [02:58<00:48, 2.90it/s]
Training 1/1 epoch (loss 2.7569): 77%|ββββββββ | 484/625 [02:58<00:48, 2.90it/s]
Training 1/1 epoch (loss 2.7569): 78%|ββββββββ | 485/625 [02:58<00:48, 2.88it/s]
Training 1/1 epoch (loss 2.7927): 78%|ββββββββ | 485/625 [02:59<00:48, 2.88it/s]
Training 1/1 epoch (loss 2.7927): 78%|ββββββββ | 486/625 [02:59<00:49, 2.78it/s]
Training 1/1 epoch (loss 2.8837): 78%|ββββββββ | 486/625 [02:59<00:49, 2.78it/s]
Training 1/1 epoch (loss 2.8837): 78%|ββββββββ | 487/625 [02:59<00:47, 2.89it/s]
Training 1/1 epoch (loss 2.7278): 78%|ββββββββ | 487/625 [03:00<00:47, 2.89it/s]
Training 1/1 epoch (loss 2.7278): 78%|ββββββββ | 488/625 [03:00<00:46, 2.92it/s]
Training 1/1 epoch (loss 2.7559): 78%|ββββββββ | 488/625 [03:00<00:46, 2.92it/s]
Training 1/1 epoch (loss 2.7559): 78%|ββββββββ | 489/625 [03:00<00:46, 2.95it/s]
Training 1/1 epoch (loss 2.8310): 78%|ββββββββ | 489/625 [03:00<00:46, 2.95it/s]
Training 1/1 epoch (loss 2.8310): 78%|ββββββββ | 490/625 [03:00<00:47, 2.86it/s]
Training 1/1 epoch (loss 2.8122): 78%|ββββββββ | 490/625 [03:01<00:47, 2.86it/s]
Training 1/1 epoch (loss 2.8122): 79%|ββββββββ | 491/625 [03:01<00:47, 2.85it/s]
Training 1/1 epoch (loss 2.6936): 79%|ββββββββ | 491/625 [03:01<00:47, 2.85it/s]
Training 1/1 epoch (loss 2.6936): 79%|ββββββββ | 492/625 [03:01<00:46, 2.84it/s]
Training 1/1 epoch (loss 2.7491): 79%|ββββββββ | 492/625 [03:01<00:46, 2.84it/s]
Training 1/1 epoch (loss 2.7491): 79%|ββββββββ | 493/625 [03:01<00:46, 2.84it/s]
Training 1/1 epoch (loss 2.5865): 79%|ββββββββ | 493/625 [03:02<00:46, 2.84it/s]
Training 1/1 epoch (loss 2.5865): 79%|ββββββββ | 494/625 [03:02<00:44, 2.95it/s]
Training 1/1 epoch (loss 2.5514): 79%|ββββββββ | 494/625 [03:02<00:44, 2.95it/s]
Training 1/1 epoch (loss 2.5514): 79%|ββββββββ | 495/625 [03:02<00:43, 3.01it/s]
Training 1/1 epoch (loss 2.7625): 79%|ββββββββ | 495/625 [03:02<00:43, 3.01it/s]
Training 1/1 epoch (loss 2.7625): 79%|ββββββββ | 496/625 [03:02<00:44, 2.91it/s]
Training 1/1 epoch (loss 2.8739): 79%|ββββββββ | 496/625 [03:03<00:44, 2.91it/s]
Training 1/1 epoch (loss 2.8739): 80%|ββββββββ | 497/625 [03:03<00:46, 2.78it/s]
Training 1/1 epoch (loss 2.7449): 80%|ββββββββ | 497/625 [03:03<00:46, 2.78it/s]
Training 1/1 epoch (loss 2.7449): 80%|ββββββββ | 498/625 [03:03<00:44, 2.83it/s]
Training 1/1 epoch (loss 2.9907): 80%|ββββββββ | 498/625 [03:03<00:44, 2.83it/s]
Training 1/1 epoch (loss 2.9907): 80%|ββββββββ | 499/625 [03:03<00:42, 2.95it/s]
Training 1/1 epoch (loss 2.7084): 80%|ββββββββ | 499/625 [03:04<00:42, 2.95it/s]
Training 1/1 epoch (loss 2.7084): 80%|ββββββββ | 500/625 [03:04<00:41, 3.04it/s]
Training 1/1 epoch (loss 2.7993): 80%|ββββββββ | 500/625 [03:04<00:41, 3.04it/s]
Training 1/1 epoch (loss 2.7993): 80%|ββββββββ | 501/625 [03:04<00:43, 2.88it/s]
Training 1/1 epoch (loss 2.7875): 80%|ββββββββ | 501/625 [03:04<00:43, 2.88it/s]
Training 1/1 epoch (loss 2.7875): 80%|ββββββββ | 502/625 [03:04<00:42, 2.89it/s]
Training 1/1 epoch (loss 2.7844): 80%|ββββββββ | 502/625 [03:05<00:42, 2.89it/s]
Training 1/1 epoch (loss 2.7844): 80%|ββββββββ | 503/625 [03:05<00:45, 2.67it/s]
Training 1/1 epoch (loss 2.9316): 80%|ββββββββ | 503/625 [03:05<00:45, 2.67it/s]
Training 1/1 epoch (loss 2.9316): 81%|ββββββββ | 504/625 [03:05<00:46, 2.61it/s]
Training 1/1 epoch (loss 2.8487): 81%|ββββββββ | 504/625 [03:06<00:46, 2.61it/s]
Training 1/1 epoch (loss 2.8487): 81%|ββββββββ | 505/625 [03:06<00:45, 2.66it/s]
Training 1/1 epoch (loss 2.9062): 81%|ββββββββ | 505/625 [03:06<00:45, 2.66it/s]
Training 1/1 epoch (loss 2.9062): 81%|ββββββββ | 506/625 [03:06<00:43, 2.77it/s]
Training 1/1 epoch (loss 2.6058): 81%|ββββββββ | 506/625 [03:06<00:43, 2.77it/s]
Training 1/1 epoch (loss 2.6058): 81%|ββββββββ | 507/625 [03:06<00:40, 2.90it/s]
Training 1/1 epoch (loss 2.6438): 81%|ββββββββ | 507/625 [03:07<00:40, 2.90it/s]
Training 1/1 epoch (loss 2.6438): 81%|βββββββββ | 508/625 [03:07<00:41, 2.85it/s]
Training 1/1 epoch (loss 2.7247): 81%|βββββββββ | 508/625 [03:07<00:41, 2.85it/s]
Training 1/1 epoch (loss 2.7247): 81%|βββββββββ | 509/625 [03:07<00:40, 2.88it/s]
Training 1/1 epoch (loss 2.7595): 81%|βββββββββ | 509/625 [03:07<00:40, 2.88it/s]
Training 1/1 epoch (loss 2.7595): 82%|βββββββββ | 510/625 [03:07<00:39, 2.93it/s]
Training 1/1 epoch (loss 2.9722): 82%|βββββββββ | 510/625 [03:08<00:39, 2.93it/s]
Training 1/1 epoch (loss 2.9722): 82%|βββββββββ | 511/625 [03:08<00:37, 3.01it/s]
Training 1/1 epoch (loss 2.5620): 82%|βββββββββ | 511/625 [03:08<00:37, 3.01it/s]
Training 1/1 epoch (loss 2.5620): 82%|βββββββββ | 512/625 [03:08<00:37, 3.02it/s]
Training 1/1 epoch (loss 2.5723): 82%|βββββββββ | 512/625 [03:08<00:37, 3.02it/s]
Training 1/1 epoch (loss 2.5723): 82%|βββββββββ | 513/625 [03:08<00:39, 2.86it/s]
Training 1/1 epoch (loss 2.7609): 82%|βββββββββ | 513/625 [03:09<00:39, 2.86it/s]
Training 1/1 epoch (loss 2.7609): 82%|βββββββββ | 514/625 [03:09<00:38, 2.88it/s]
Training 1/1 epoch (loss 2.7482): 82%|βββββββββ | 514/625 [03:09<00:38, 2.88it/s]
Training 1/1 epoch (loss 2.7482): 82%|βββββββββ | 515/625 [03:09<00:39, 2.82it/s]
Training 1/1 epoch (loss 2.9090): 82%|βββββββββ | 515/625 [03:09<00:39, 2.82it/s]
Training 1/1 epoch (loss 2.9090): 83%|βββββββββ | 516/625 [03:09<00:37, 2.91it/s]
Training 1/1 epoch (loss 2.7319): 83%|βββββββββ | 516/625 [03:10<00:37, 2.91it/s]
Training 1/1 epoch (loss 2.7319): 83%|βββββββββ | 517/625 [03:10<00:35, 3.06it/s]
Training 1/1 epoch (loss 2.9256): 83%|βββββββββ | 517/625 [03:10<00:35, 3.06it/s]
Training 1/1 epoch (loss 2.9256): 83%|βββββββββ | 518/625 [03:10<00:36, 2.96it/s]
Training 1/1 epoch (loss 2.9337): 83%|βββββββββ | 518/625 [03:10<00:36, 2.96it/s]
Training 1/1 epoch (loss 2.9337): 83%|βββββββββ | 519/625 [03:10<00:36, 2.91it/s]
Training 1/1 epoch (loss 2.9572): 83%|βββββββββ | 519/625 [03:11<00:36, 2.91it/s]
Training 1/1 epoch (loss 2.9572): 83%|βββββββββ | 520/625 [03:11<00:39, 2.66it/s]
Training 1/1 epoch (loss 2.9138): 83%|βββββββββ | 520/625 [03:11<00:39, 2.66it/s]
Training 1/1 epoch (loss 2.9138): 83%|βββββββββ | 521/625 [03:11<00:38, 2.67it/s]
Training 1/1 epoch (loss 2.7466): 83%|βββββββββ | 521/625 [03:11<00:38, 2.67it/s]
Training 1/1 epoch (loss 2.7466): 84%|βββββββββ | 522/625 [03:11<00:36, 2.79it/s]
Training 1/1 epoch (loss 2.8385): 84%|βββββββββ | 522/625 [03:12<00:36, 2.79it/s]
Training 1/1 epoch (loss 2.8385): 84%|βββββββββ | 523/625 [03:12<00:35, 2.86it/s]
Training 1/1 epoch (loss 2.7083): 84%|βββββββββ | 523/625 [03:12<00:35, 2.86it/s]
Training 1/1 epoch (loss 2.7083): 84%|βββββββββ | 524/625 [03:12<00:34, 2.94it/s]
Training 1/1 epoch (loss 2.8809): 84%|βββββββββ | 524/625 [03:12<00:34, 2.94it/s]
Training 1/1 epoch (loss 2.8809): 84%|βββββββββ | 525/625 [03:12<00:32, 3.07it/s]
Training 1/1 epoch (loss 3.0197): 84%|βββββββββ | 525/625 [03:13<00:32, 3.07it/s]
Training 1/1 epoch (loss 3.0197): 84%|βββββββββ | 526/625 [03:13<00:33, 2.95it/s]
Training 1/1 epoch (loss 2.8566): 84%|βββββββββ | 526/625 [03:13<00:33, 2.95it/s]
Training 1/1 epoch (loss 2.8566): 84%|βββββββββ | 527/625 [03:13<00:33, 2.91it/s]
Training 1/1 epoch (loss 2.7612): 84%|βββββββββ | 527/625 [03:13<00:33, 2.91it/s]
Training 1/1 epoch (loss 2.7612): 84%|βββββββββ | 528/625 [03:13<00:32, 2.96it/s]
Training 1/1 epoch (loss 2.7683): 84%|βββββββββ | 528/625 [03:14<00:32, 2.96it/s]
Training 1/1 epoch (loss 2.7683): 85%|βββββββββ | 529/625 [03:14<00:32, 2.92it/s]
Training 1/1 epoch (loss 2.7322): 85%|βββββββββ | 529/625 [03:14<00:32, 2.92it/s]
Training 1/1 epoch (loss 2.7322): 85%|βββββββββ | 530/625 [03:14<00:33, 2.86it/s]
Training 1/1 epoch (loss 2.9428): 85%|βββββββββ | 530/625 [03:14<00:33, 2.86it/s]
Training 1/1 epoch (loss 2.9428): 85%|βββββββββ | 531/625 [03:14<00:31, 2.95it/s]
Training 1/1 epoch (loss 2.7373): 85%|βββββββββ | 531/625 [03:15<00:31, 2.95it/s]
Training 1/1 epoch (loss 2.7373): 85%|βββββββββ | 532/625 [03:15<00:32, 2.82it/s]
Training 1/1 epoch (loss 3.0109): 85%|βββββββββ | 532/625 [03:15<00:32, 2.82it/s]
Training 1/1 epoch (loss 3.0109): 85%|βββββββββ | 533/625 [03:15<00:31, 2.89it/s]
Training 1/1 epoch (loss 2.9986): 85%|βββββββββ | 533/625 [03:15<00:31, 2.89it/s]
Training 1/1 epoch (loss 2.9986): 85%|βββββββββ | 534/625 [03:15<00:31, 2.93it/s]
Training 1/1 epoch (loss 2.6860): 85%|βββββββββ | 534/625 [03:16<00:31, 2.93it/s]
Training 1/1 epoch (loss 2.6860): 86%|βββββββββ | 535/625 [03:16<00:30, 2.98it/s]
Training 1/1 epoch (loss 2.8362): 86%|βββββββββ | 535/625 [03:16<00:30, 2.98it/s]
Training 1/1 epoch (loss 2.8362): 86%|βββββββββ | 536/625 [03:16<00:30, 2.90it/s]
Training 1/1 epoch (loss 2.9195): 86%|βββββββββ | 536/625 [03:17<00:30, 2.90it/s]
Training 1/1 epoch (loss 2.9195): 86%|βββββββββ | 537/625 [03:17<00:31, 2.81it/s]
Training 1/1 epoch (loss 2.7081): 86%|βββββββββ | 537/625 [03:17<00:31, 2.81it/s]
Training 1/1 epoch (loss 2.7081): 86%|βββββββββ | 538/625 [03:17<00:32, 2.66it/s]
Training 1/1 epoch (loss 2.7439): 86%|βββββββββ | 538/625 [03:17<00:32, 2.66it/s]
Training 1/1 epoch (loss 2.7439): 86%|βββββββββ | 539/625 [03:17<00:31, 2.77it/s]
Training 1/1 epoch (loss 2.6518): 86%|βββββββββ | 539/625 [03:18<00:31, 2.77it/s]
Training 1/1 epoch (loss 2.6518): 86%|βββββββββ | 540/625 [03:18<00:29, 2.88it/s]
Training 1/1 epoch (loss 2.6319): 86%|βββββββββ | 540/625 [03:18<00:29, 2.88it/s]
Training 1/1 epoch (loss 2.6319): 87%|βββββββββ | 541/625 [03:18<00:29, 2.86it/s]
Training 1/1 epoch (loss 2.7472): 87%|βββββββββ | 541/625 [03:18<00:29, 2.86it/s]
Training 1/1 epoch (loss 2.7472): 87%|βββββββββ | 542/625 [03:18<00:28, 2.88it/s]
Training 1/1 epoch (loss 2.6845): 87%|βββββββββ | 542/625 [03:19<00:28, 2.88it/s]
Training 1/1 epoch (loss 2.6845): 87%|βββββββββ | 543/625 [03:19<00:28, 2.85it/s]
Training 1/1 epoch (loss 2.8964): 87%|βββββββββ | 543/625 [03:19<00:28, 2.85it/s]
Training 1/1 epoch (loss 2.8964): 87%|βββββββββ | 544/625 [03:19<00:29, 2.77it/s]
Training 1/1 epoch (loss 3.0570): 87%|βββββββββ | 544/625 [03:19<00:29, 2.77it/s]
Training 1/1 epoch (loss 3.0570): 87%|βββββββββ | 545/625 [03:19<00:27, 2.86it/s]
Training 1/1 epoch (loss 2.8007): 87%|βββββββββ | 545/625 [03:20<00:27, 2.86it/s]
Training 1/1 epoch (loss 2.8007): 87%|βββββββββ | 546/625 [03:20<00:26, 2.94it/s]
Training 1/1 epoch (loss 2.6792): 87%|βββββββββ | 546/625 [03:20<00:26, 2.94it/s]
Training 1/1 epoch (loss 2.6792): 88%|βββββββββ | 547/625 [03:20<00:26, 2.93it/s]
Training 1/1 epoch (loss 2.5320): 88%|βββββββββ | 547/625 [03:20<00:26, 2.93it/s]
Training 1/1 epoch (loss 2.5320): 88%|βββββββββ | 548/625 [03:20<00:25, 2.98it/s]
Training 1/1 epoch (loss 2.8052): 88%|βββββββββ | 548/625 [03:21<00:25, 2.98it/s]
Training 1/1 epoch (loss 2.8052): 88%|βββββββββ | 549/625 [03:21<00:26, 2.86it/s]
Training 1/1 epoch (loss 2.5935): 88%|βββββββββ | 549/625 [03:21<00:26, 2.86it/s]
Training 1/1 epoch (loss 2.5935): 88%|βββββββββ | 550/625 [03:21<00:25, 2.98it/s]
Training 1/1 epoch (loss 2.9607): 88%|βββββββββ | 550/625 [03:21<00:25, 2.98it/s]
Training 1/1 epoch (loss 2.9607): 88%|βββββββββ | 551/625 [03:21<00:24, 3.03it/s]
Training 1/1 epoch (loss 2.7292): 88%|βββββββββ | 551/625 [03:22<00:24, 3.03it/s]
Training 1/1 epoch (loss 2.7292): 88%|βββββββββ | 552/625 [03:22<00:24, 3.04it/s]
Training 1/1 epoch (loss 2.7166): 88%|βββββββββ | 552/625 [03:22<00:24, 3.04it/s]
Training 1/1 epoch (loss 2.7166): 88%|βββββββββ | 553/625 [03:22<00:25, 2.87it/s]
Training 1/1 epoch (loss 2.7814): 88%|βββββββββ | 553/625 [03:22<00:25, 2.87it/s]
Training 1/1 epoch (loss 2.7814): 89%|βββββββββ | 554/625 [03:22<00:24, 2.90it/s]
Training 1/1 epoch (loss 2.8241): 89%|βββββββββ | 554/625 [03:23<00:24, 2.90it/s]
Training 1/1 epoch (loss 2.8241): 89%|βββββββββ | 555/625 [03:23<00:24, 2.89it/s]
Training 1/1 epoch (loss 2.6266): 89%|βββββββββ | 555/625 [03:23<00:24, 2.89it/s]
Training 1/1 epoch (loss 2.6266): 89%|βββββββββ | 556/625 [03:23<00:28, 2.44it/s]
Training 1/1 epoch (loss 2.7476): 89%|βββββββββ | 556/625 [03:24<00:28, 2.44it/s]
Training 1/1 epoch (loss 2.7476): 89%|βββββββββ | 557/625 [03:24<00:28, 2.38it/s]
Training 1/1 epoch (loss 2.8370): 89%|βββββββββ | 557/625 [03:24<00:28, 2.38it/s]
Training 1/1 epoch (loss 2.8370): 89%|βββββββββ | 558/625 [03:24<00:26, 2.52it/s]
Training 1/1 epoch (loss 2.6017): 89%|βββββββββ | 558/625 [03:24<00:26, 2.52it/s]
Training 1/1 epoch (loss 2.6017): 89%|βββββββββ | 559/625 [03:24<00:25, 2.60it/s]
Training 1/1 epoch (loss 2.7902): 89%|βββββββββ | 559/625 [03:25<00:25, 2.60it/s]
Training 1/1 epoch (loss 2.7902): 90%|βββββββββ | 560/625 [03:25<00:25, 2.50it/s]
Training 1/1 epoch (loss 2.8408): 90%|βββββββββ | 560/625 [03:25<00:25, 2.50it/s]
Training 1/1 epoch (loss 2.8408): 90%|βββββββββ | 561/625 [03:25<00:27, 2.31it/s]
Training 1/1 epoch (loss 2.9371): 90%|βββββββββ | 561/625 [03:26<00:27, 2.31it/s]
Training 1/1 epoch (loss 2.9371): 90%|βββββββββ | 562/625 [03:26<00:24, 2.53it/s]
Training 1/1 epoch (loss 2.8057): 90%|βββββββββ | 562/625 [03:26<00:24, 2.53it/s]
Training 1/1 epoch (loss 2.8057): 90%|βββββββββ | 563/625 [03:26<00:23, 2.67it/s]
Training 1/1 epoch (loss 2.5171): 90%|βββββββββ | 563/625 [03:26<00:23, 2.67it/s]
Training 1/1 epoch (loss 2.5171): 90%|βββββββββ | 564/625 [03:26<00:22, 2.72it/s]
Training 1/1 epoch (loss 2.7437): 90%|βββββββββ | 564/625 [03:27<00:22, 2.72it/s]
Training 1/1 epoch (loss 2.7437): 90%|βββββββββ | 565/625 [03:27<00:21, 2.75it/s]
Training 1/1 epoch (loss 2.8151): 90%|βββββββββ | 565/625 [03:27<00:21, 2.75it/s]
Training 1/1 epoch (loss 2.8151): 91%|βββββββββ | 566/625 [03:27<00:20, 2.82it/s]
Training 1/1 epoch (loss 2.6687): 91%|βββββββββ | 566/625 [03:28<00:20, 2.82it/s]
Training 1/1 epoch (loss 2.6687): 91%|βββββββββ | 567/625 [03:28<00:21, 2.70it/s]
Training 1/1 epoch (loss 2.7038): 91%|βββββββββ | 567/625 [03:28<00:21, 2.70it/s]
Training 1/1 epoch (loss 2.7038): 91%|βββββββββ | 568/625 [03:28<00:20, 2.73it/s]
Training 1/1 epoch (loss 2.9593): 91%|βββββββββ | 568/625 [03:28<00:20, 2.73it/s]
Training 1/1 epoch (loss 2.9593): 91%|βββββββββ | 569/625 [03:28<00:20, 2.76it/s]
Training 1/1 epoch (loss 2.9866): 91%|βββββββββ | 569/625 [03:29<00:20, 2.76it/s]
Training 1/1 epoch (loss 2.9866): 91%|βββββββββ | 570/625 [03:29<00:19, 2.80it/s]
Training 1/1 epoch (loss 2.9397): 91%|βββββββββ | 570/625 [03:29<00:19, 2.80it/s]
Training 1/1 epoch (loss 2.9397): 91%|ββββββββββ| 571/625 [03:29<00:19, 2.73it/s]
Training 1/1 epoch (loss 2.9702): 91%|ββββββββββ| 571/625 [03:29<00:19, 2.73it/s]
Training 1/1 epoch (loss 2.9702): 92%|ββββββββββ| 572/625 [03:29<00:19, 2.76it/s]
Training 1/1 epoch (loss 2.7411): 92%|ββββββββββ| 572/625 [03:30<00:19, 2.76it/s]
Training 1/1 epoch (loss 2.7411): 92%|ββββββββββ| 573/625 [03:30<00:18, 2.84it/s]
Training 1/1 epoch (loss 2.5706): 92%|ββββββββββ| 573/625 [03:30<00:18, 2.84it/s]
Training 1/1 epoch (loss 2.5706): 92%|ββββββββββ| 574/625 [03:30<00:18, 2.79it/s]
Training 1/1 epoch (loss 2.6168): 92%|ββββββββββ| 574/625 [03:30<00:18, 2.79it/s]
Training 1/1 epoch (loss 2.6168): 92%|ββββββββββ| 575/625 [03:30<00:17, 2.87it/s]
Training 1/1 epoch (loss 2.6007): 92%|ββββββββββ| 575/625 [03:31<00:17, 2.87it/s]
Training 1/1 epoch (loss 2.6007): 92%|ββββββββββ| 576/625 [03:31<00:17, 2.74it/s]
Training 1/1 epoch (loss 2.7978): 92%|ββββββββββ| 576/625 [03:31<00:17, 2.74it/s]
Training 1/1 epoch (loss 2.7978): 92%|ββββββββββ| 577/625 [03:31<00:18, 2.55it/s]
Training 1/1 epoch (loss 2.7620): 92%|ββββββββββ| 577/625 [03:32<00:18, 2.55it/s]
Training 1/1 epoch (loss 2.7620): 92%|ββββββββββ| 578/625 [03:32<00:17, 2.66it/s]
Training 1/1 epoch (loss 2.9856): 92%|ββββββββββ| 578/625 [03:32<00:17, 2.66it/s]
Training 1/1 epoch (loss 2.9856): 93%|ββββββββββ| 579/625 [03:32<00:16, 2.80it/s]
Training 1/1 epoch (loss 2.7237): 93%|ββββββββββ| 579/625 [03:32<00:16, 2.80it/s]
Training 1/1 epoch (loss 2.7237): 93%|ββββββββββ| 580/625 [03:32<00:15, 2.83it/s]
Training 1/1 epoch (loss 2.6852): 93%|ββββββββββ| 580/625 [03:33<00:15, 2.83it/s]
Training 1/1 epoch (loss 2.6852): 93%|ββββββββββ| 581/625 [03:33<00:15, 2.85it/s]
Training 1/1 epoch (loss 2.8118): 93%|ββββββββββ| 581/625 [03:33<00:15, 2.85it/s]
Training 1/1 epoch (loss 2.8118): 93%|ββββββββββ| 582/625 [03:33<00:15, 2.79it/s]
Training 1/1 epoch (loss 2.9108): 93%|ββββββββββ| 582/625 [03:33<00:15, 2.79it/s]
Training 1/1 epoch (loss 2.9108): 93%|ββββββββββ| 583/625 [03:33<00:14, 2.85it/s]
Training 1/1 epoch (loss 2.5352): 93%|ββββββββββ| 583/625 [03:34<00:14, 2.85it/s]
Training 1/1 epoch (loss 2.5352): 93%|ββββββββββ| 584/625 [03:34<00:14, 2.87it/s]
Training 1/1 epoch (loss 2.7558): 93%|ββββββββββ| 584/625 [03:34<00:14, 2.87it/s]
Training 1/1 epoch (loss 2.7558): 94%|ββββββββββ| 585/625 [03:34<00:13, 2.88it/s]
Training 1/1 epoch (loss 2.6458): 94%|ββββββββββ| 585/625 [03:34<00:13, 2.88it/s]
Training 1/1 epoch (loss 2.6458): 94%|ββββββββββ| 586/625 [03:34<00:13, 2.89it/s]
Training 1/1 epoch (loss 2.9458): 94%|ββββββββββ| 586/625 [03:35<00:13, 2.89it/s]
Training 1/1 epoch (loss 2.9458): 94%|ββββββββββ| 587/625 [03:35<00:12, 2.97it/s]
Training 1/1 epoch (loss 2.8020): 94%|ββββββββββ| 587/625 [03:35<00:12, 2.97it/s]
Training 1/1 epoch (loss 2.8020): 94%|ββββββββββ| 588/625 [03:35<00:12, 2.97it/s]
Training 1/1 epoch (loss 2.7818): 94%|ββββββββββ| 588/625 [03:35<00:12, 2.97it/s]
Training 1/1 epoch (loss 2.7818): 94%|ββββββββββ| 589/625 [03:35<00:12, 2.84it/s]
Training 1/1 epoch (loss 2.6753): 94%|ββββββββββ| 589/625 [03:36<00:12, 2.84it/s]
Training 1/1 epoch (loss 2.6753): 94%|ββββββββββ| 590/625 [03:36<00:12, 2.87it/s]
Training 1/1 epoch (loss 3.0763): 94%|ββββββββββ| 590/625 [03:36<00:12, 2.87it/s]
Training 1/1 epoch (loss 3.0763): 95%|ββββββββββ| 591/625 [03:36<00:11, 2.91it/s]
Training 1/1 epoch (loss 2.6754): 95%|ββββββββββ| 591/625 [03:36<00:11, 2.91it/s]
Training 1/1 epoch (loss 2.6754): 95%|ββββββββββ| 592/625 [03:36<00:11, 2.91it/s]
Training 1/1 epoch (loss 2.7792): 95%|ββββββββββ| 592/625 [03:37<00:11, 2.91it/s]
Training 1/1 epoch (loss 2.7792): 95%|ββββββββββ| 593/625 [03:37<00:11, 2.74it/s]
Training 1/1 epoch (loss 2.7998): 95%|ββββββββββ| 593/625 [03:37<00:11, 2.74it/s]
Training 1/1 epoch (loss 2.7998): 95%|ββββββββββ| 594/625 [03:37<00:10, 2.83it/s]
Training 1/1 epoch (loss 2.6130): 95%|ββββββββββ| 594/625 [03:37<00:10, 2.83it/s]
Training 1/1 epoch (loss 2.6130): 95%|ββββββββββ| 595/625 [03:37<00:10, 2.84it/s]
Training 1/1 epoch (loss 2.7066): 95%|ββββββββββ| 595/625 [03:38<00:10, 2.84it/s]
Training 1/1 epoch (loss 2.7066): 95%|ββββββββββ| 596/625 [03:38<00:10, 2.84it/s]
Training 1/1 epoch (loss 2.6338): 95%|ββββββββββ| 596/625 [03:38<00:10, 2.84it/s]
Training 1/1 epoch (loss 2.6338): 96%|ββββββββββ| 597/625 [03:38<00:09, 2.85it/s]
Training 1/1 epoch (loss 2.8189): 96%|ββββββββββ| 597/625 [03:38<00:09, 2.85it/s]
Training 1/1 epoch (loss 2.8189): 96%|ββββββββββ| 598/625 [03:38<00:09, 2.85it/s]
Training 1/1 epoch (loss 2.7980): 96%|ββββββββββ| 598/625 [03:39<00:09, 2.85it/s]
Training 1/1 epoch (loss 2.7980): 96%|ββββββββββ| 599/625 [03:39<00:08, 2.90it/s]
Training 1/1 epoch (loss 2.8033): 96%|ββββββββββ| 599/625 [03:39<00:08, 2.90it/s]
Training 1/1 epoch (loss 2.8033): 96%|ββββββββββ| 600/625 [03:39<00:08, 2.89it/s]
Training 1/1 epoch (loss 2.6637): 96%|ββββββββββ| 600/625 [03:40<00:08, 2.89it/s]
Training 1/1 epoch (loss 2.6637): 96%|ββββββββββ| 601/625 [03:40<00:08, 2.85it/s]
Training 1/1 epoch (loss 2.7906): 96%|ββββββββββ| 601/625 [03:40<00:08, 2.85it/s]
Training 1/1 epoch (loss 2.7906): 96%|ββββββββββ| 602/625 [03:40<00:07, 2.92it/s]
Training 1/1 epoch (loss 2.6611): 96%|ββββββββββ| 602/625 [03:40<00:07, 2.92it/s]
Training 1/1 epoch (loss 2.6611): 96%|ββββββββββ| 603/625 [03:40<00:07, 2.94it/s]
Training 1/1 epoch (loss 2.7570): 96%|ββββββββββ| 603/625 [03:40<00:07, 2.94it/s]
Training 1/1 epoch (loss 2.7570): 97%|ββββββββββ| 604/625 [03:40<00:06, 3.02it/s]
Training 1/1 epoch (loss 2.8464): 97%|ββββββββββ| 604/625 [03:41<00:06, 3.02it/s]
Training 1/1 epoch (loss 2.8464): 97%|ββββββββββ| 605/625 [03:41<00:06, 2.99it/s]
Training 1/1 epoch (loss 2.7590): 97%|ββββββββββ| 605/625 [03:41<00:06, 2.99it/s]
Training 1/1 epoch (loss 2.7590): 97%|ββββββββββ| 606/625 [03:41<00:06, 2.91it/s]
Training 1/1 epoch (loss 2.7083): 97%|ββββββββββ| 606/625 [03:41<00:06, 2.91it/s]
Training 1/1 epoch (loss 2.7083): 97%|ββββββββββ| 607/625 [03:41<00:05, 3.03it/s]
Training 1/1 epoch (loss 2.8852): 97%|ββββββββββ| 607/625 [03:42<00:05, 3.03it/s]
Training 1/1 epoch (loss 2.8852): 97%|ββββββββββ| 608/625 [03:42<00:05, 2.95it/s]
Training 1/1 epoch (loss 2.8266): 97%|ββββββββββ| 608/625 [03:42<00:05, 2.95it/s]
Training 1/1 epoch (loss 2.8266): 97%|ββββββββββ| 609/625 [03:42<00:05, 2.96it/s]
Training 1/1 epoch (loss 2.8352): 97%|ββββββββββ| 609/625 [03:43<00:05, 2.96it/s]
Training 1/1 epoch (loss 2.8352): 98%|ββββββββββ| 610/625 [03:43<00:05, 2.99it/s]
Training 1/1 epoch (loss 2.7206): 98%|ββββββββββ| 610/625 [03:43<00:05, 2.99it/s]
Training 1/1 epoch (loss 2.7206): 98%|ββββββββββ| 611/625 [03:43<00:04, 3.03it/s]
Training 1/1 epoch (loss 3.0205): 98%|ββββββββββ| 611/625 [03:43<00:04, 3.03it/s]
Training 1/1 epoch (loss 3.0205): 98%|ββββββββββ| 612/625 [03:43<00:04, 2.88it/s]
Training 1/1 epoch (loss 2.6336): 98%|ββββββββββ| 612/625 [03:44<00:04, 2.88it/s]
Training 1/1 epoch (loss 2.6336): 98%|ββββββββββ| 613/625 [03:44<00:04, 2.91it/s]
Training 1/1 epoch (loss 2.8791): 98%|ββββββββββ| 613/625 [03:44<00:04, 2.91it/s]
Training 1/1 epoch (loss 2.8791): 98%|ββββββββββ| 614/625 [03:44<00:03, 2.88it/s]
Training 1/1 epoch (loss 2.6587): 98%|ββββββββββ| 614/625 [03:44<00:03, 2.88it/s]
Training 1/1 epoch (loss 2.6587): 98%|ββββββββββ| 615/625 [03:44<00:03, 2.92it/s]
Training 1/1 epoch (loss 2.7368): 98%|ββββββββββ| 615/625 [03:45<00:03, 2.92it/s]
Training 1/1 epoch (loss 2.7368): 99%|ββββββββββ| 616/625 [03:45<00:03, 2.84it/s]
Training 1/1 epoch (loss 2.8773): 99%|ββββββββββ| 616/625 [03:45<00:03, 2.84it/s]
Training 1/1 epoch (loss 2.8773): 99%|ββββββββββ| 617/625 [03:45<00:02, 2.77it/s]
Training 1/1 epoch (loss 2.9389): 99%|ββββββββββ| 617/625 [03:45<00:02, 2.77it/s]
Training 1/1 epoch (loss 2.9389): 99%|ββββββββββ| 618/625 [03:45<00:02, 2.82it/s]
Training 1/1 epoch (loss 2.7756): 99%|ββββββββββ| 618/625 [03:46<00:02, 2.82it/s]
Training 1/1 epoch (loss 2.7756): 99%|ββββββββββ| 619/625 [03:46<00:02, 2.87it/s]
Training 1/1 epoch (loss 2.6288): 99%|ββββββββββ| 619/625 [03:46<00:02, 2.87it/s]
Training 1/1 epoch (loss 2.6288): 99%|ββββββββββ| 620/625 [03:46<00:01, 2.77it/s]
Training 1/1 epoch (loss 2.8780): 99%|ββββββββββ| 620/625 [03:46<00:01, 2.77it/s]
Training 1/1 epoch (loss 2.8780): 99%|ββββββββββ| 621/625 [03:46<00:01, 2.86it/s]
Training 1/1 epoch (loss 2.7361): 99%|ββββββββββ| 621/625 [03:47<00:01, 2.86it/s]
Training 1/1 epoch (loss 2.7361): 100%|ββββββββββ| 622/625 [03:47<00:01, 2.93it/s]
Training 1/1 epoch (loss 2.8506): 100%|ββββββββββ| 622/625 [03:47<00:01, 2.93it/s]
Training 1/1 epoch (loss 2.8506): 100%|ββββββββββ| 623/625 [03:47<00:00, 2.75it/s]
Training 1/1 epoch (loss 2.5905): 100%|ββββββββββ| 623/625 [03:48<00:00, 2.75it/s]
Training 1/1 epoch (loss 2.5905): 100%|ββββββββββ| 624/625 [03:48<00:00, 2.71it/s]
Training 1/1 epoch (loss 2.7995): 100%|ββββββββββ| 624/625 [03:48<00:00, 2.71it/s]
Training 1/1 epoch (loss 2.7995): 100%|ββββββββββ| 625/625 [03:48<00:00, 2.78it/s]
Training 1/1 epoch (loss 2.7995): 100%|ββββββββββ| 625/625 [03:48<00:00, 2.74it/s] |