|
Training 1/1 epoch (loss 2.6216): 0%| | 0/625 [00:08<?, ?it/s]
Training 1/1 epoch (loss 2.6216): 0%| | 1/625 [00:08<1:29:56, 8.65s/it]
Training 1/1 epoch (loss 2.8992): 0%| | 1/625 [00:10<1:29:56, 8.65s/it]
Training 1/1 epoch (loss 2.8992): 0%| | 2/625 [00:10<50:40, 4.88s/it]
Training 1/1 epoch (loss 2.6595): 0%| | 2/625 [00:12<50:40, 4.88s/it]
Training 1/1 epoch (loss 2.6595): 0%| | 3/625 [00:12<34:31, 3.33s/it]
Training 1/1 epoch (loss 2.5304): 0%| | 3/625 [00:13<34:31, 3.33s/it]
Training 1/1 epoch (loss 2.5304): 1%| | 4/625 [00:13<25:30, 2.46s/it]
Training 1/1 epoch (loss 2.9317): 1%| | 4/625 [00:14<25:30, 2.46s/it]
Training 1/1 epoch (loss 2.9317): 1%| | 5/625 [00:14<21:12, 2.05s/it]
Training 1/1 epoch (loss 2.7400): 1%| | 5/625 [00:17<21:12, 2.05s/it]
Training 1/1 epoch (loss 2.7400): 1%| | 6/625 [00:17<21:54, 2.12s/it]
Training 1/1 epoch (loss 2.7308): 1%| | 6/625 [00:18<21:54, 2.12s/it]
Training 1/1 epoch (loss 2.7308): 1%| | 7/625 [00:18<18:15, 1.77s/it]
Training 1/1 epoch (loss 3.1010): 1%| | 7/625 [00:19<18:15, 1.77s/it]
Training 1/1 epoch (loss 3.1010): 1%|β | 8/625 [00:19<15:22, 1.49s/it]
Training 1/1 epoch (loss 2.6791): 1%|β | 8/625 [00:21<15:22, 1.49s/it]
Training 1/1 epoch (loss 2.6791): 1%|β | 9/625 [00:21<17:03, 1.66s/it]
Training 1/1 epoch (loss 3.0133): 1%|β | 9/625 [00:21<17:03, 1.66s/it]
Training 1/1 epoch (loss 3.0133): 2%|β | 10/625 [00:21<13:06, 1.28s/it]
Training 1/1 epoch (loss 2.5884): 2%|β | 10/625 [00:22<13:06, 1.28s/it]
Training 1/1 epoch (loss 2.5884): 2%|β | 11/625 [00:22<13:23, 1.31s/it]
Training 1/1 epoch (loss 2.6717): 2%|β | 11/625 [00:24<13:23, 1.31s/it]
Training 1/1 epoch (loss 2.6717): 2%|β | 12/625 [00:24<14:12, 1.39s/it]
Training 1/1 epoch (loss 2.7501): 2%|β | 12/625 [00:24<14:12, 1.39s/it]
Training 1/1 epoch (loss 2.7501): 2%|β | 13/625 [00:24<11:18, 1.11s/it]
Training 1/1 epoch (loss 3.0674): 2%|β | 13/625 [00:26<11:18, 1.11s/it]
Training 1/1 epoch (loss 3.0674): 2%|β | 14/625 [00:26<11:28, 1.13s/it]
Training 1/1 epoch (loss 2.6760): 2%|β | 14/625 [00:27<11:28, 1.13s/it]
Training 1/1 epoch (loss 2.6760): 2%|β | 15/625 [00:27<13:49, 1.36s/it]
Training 1/1 epoch (loss 2.9117): 2%|β | 15/625 [00:28<13:49, 1.36s/it]
Training 1/1 epoch (loss 2.9117): 3%|β | 16/625 [00:28<11:49, 1.16s/it]
Training 1/1 epoch (loss 2.8622): 3%|β | 16/625 [00:30<11:49, 1.16s/it]
Training 1/1 epoch (loss 2.8622): 3%|β | 17/625 [00:30<14:10, 1.40s/it]
Training 1/1 epoch (loss 2.7824): 3%|β | 17/625 [00:31<14:10, 1.40s/it]
Training 1/1 epoch (loss 2.7824): 3%|β | 18/625 [00:31<13:05, 1.29s/it]
Training 1/1 epoch (loss 2.9268): 3%|β | 18/625 [00:32<13:05, 1.29s/it]
Training 1/1 epoch (loss 2.9268): 3%|β | 19/625 [00:32<13:01, 1.29s/it]
Training 1/1 epoch (loss 2.9557): 3%|β | 19/625 [00:34<13:01, 1.29s/it]
Training 1/1 epoch (loss 2.9557): 3%|β | 20/625 [00:34<14:48, 1.47s/it]
Training 1/1 epoch (loss 2.6752): 3%|β | 20/625 [00:36<14:48, 1.47s/it]
Training 1/1 epoch (loss 2.6752): 3%|β | 21/625 [00:36<15:03, 1.50s/it]
Training 1/1 epoch (loss 2.8513): 3%|β | 21/625 [00:37<15:03, 1.50s/it]
Training 1/1 epoch (loss 2.8513): 4%|β | 22/625 [00:37<14:27, 1.44s/it]
Training 1/1 epoch (loss 2.7812): 4%|β | 22/625 [00:39<14:27, 1.44s/it]
Training 1/1 epoch (loss 2.7812): 4%|β | 23/625 [00:39<15:01, 1.50s/it]
Training 1/1 epoch (loss 2.8110): 4%|β | 23/625 [00:40<15:01, 1.50s/it]
Training 1/1 epoch (loss 2.8110): 4%|β | 24/625 [00:40<13:56, 1.39s/it]
Training 1/1 epoch (loss 2.5622): 4%|β | 24/625 [00:42<13:56, 1.39s/it]
Training 1/1 epoch (loss 2.5622): 4%|β | 25/625 [00:42<15:59, 1.60s/it]
Training 1/1 epoch (loss 2.7560): 4%|β | 25/625 [00:44<15:59, 1.60s/it]
Training 1/1 epoch (loss 2.7560): 4%|β | 26/625 [00:44<16:33, 1.66s/it]
Training 1/1 epoch (loss 2.4519): 4%|β | 26/625 [00:44<16:33, 1.66s/it]
Training 1/1 epoch (loss 2.4519): 4%|β | 27/625 [00:44<12:51, 1.29s/it]
Training 1/1 epoch (loss 2.8400): 4%|β | 27/625 [00:46<12:51, 1.29s/it]
Training 1/1 epoch (loss 2.8400): 4%|β | 28/625 [00:46<14:52, 1.50s/it]
Training 1/1 epoch (loss 2.6152): 4%|β | 28/625 [00:49<14:52, 1.50s/it]
Training 1/1 epoch (loss 2.6152): 5%|β | 29/625 [00:49<17:14, 1.74s/it]
Training 1/1 epoch (loss 2.8047): 5%|β | 29/625 [00:50<17:14, 1.74s/it]
Training 1/1 epoch (loss 2.8047): 5%|β | 30/625 [00:50<14:55, 1.51s/it]
Training 1/1 epoch (loss 2.5663): 5%|β | 30/625 [00:51<14:55, 1.51s/it]
Training 1/1 epoch (loss 2.5663): 5%|β | 31/625 [00:51<15:27, 1.56s/it]
Training 1/1 epoch (loss 2.8810): 5%|β | 31/625 [00:53<15:27, 1.56s/it]
Training 1/1 epoch (loss 2.8810): 5%|β | 32/625 [00:53<17:19, 1.75s/it]
Training 1/1 epoch (loss 2.9019): 5%|β | 32/625 [00:54<17:19, 1.75s/it]
Training 1/1 epoch (loss 2.9019): 5%|β | 33/625 [00:54<13:51, 1.40s/it]
Training 1/1 epoch (loss 2.7067): 5%|β | 33/625 [00:56<13:51, 1.40s/it]
Training 1/1 epoch (loss 2.7067): 5%|β | 34/625 [00:56<16:24, 1.67s/it]
Training 1/1 epoch (loss 2.6744): 5%|β | 34/625 [00:58<16:24, 1.67s/it]
Training 1/1 epoch (loss 2.6744): 6%|β | 35/625 [00:58<15:18, 1.56s/it]
Training 1/1 epoch (loss 2.6831): 6%|β | 35/625 [00:58<15:18, 1.56s/it]
Training 1/1 epoch (loss 2.6831): 6%|β | 36/625 [00:58<12:54, 1.31s/it]
Training 1/1 epoch (loss 2.9608): 6%|β | 36/625 [01:00<12:54, 1.31s/it]
Training 1/1 epoch (loss 2.9608): 6%|β | 37/625 [01:00<14:55, 1.52s/it]
Training 1/1 epoch (loss 2.6910): 6%|β | 37/625 [01:01<14:55, 1.52s/it]
Training 1/1 epoch (loss 2.6910): 6%|β | 38/625 [01:01<13:14, 1.35s/it]
Training 1/1 epoch (loss 3.0130): 6%|β | 38/625 [01:02<13:14, 1.35s/it]
Training 1/1 epoch (loss 3.0130): 6%|β | 39/625 [01:02<12:10, 1.25s/it]
Training 1/1 epoch (loss 2.8321): 6%|β | 39/625 [01:05<12:10, 1.25s/it]
Training 1/1 epoch (loss 2.8321): 6%|β | 40/625 [01:05<15:02, 1.54s/it]
Training 1/1 epoch (loss 2.9385): 6%|β | 40/625 [01:06<15:02, 1.54s/it]
Training 1/1 epoch (loss 2.9385): 7%|β | 41/625 [01:06<13:17, 1.37s/it]
Training 1/1 epoch (loss 2.9911): 7%|β | 41/625 [01:07<13:17, 1.37s/it]
Training 1/1 epoch (loss 2.9911): 7%|β | 42/625 [01:07<14:38, 1.51s/it]
Training 1/1 epoch (loss 2.8132): 7%|β | 42/625 [01:09<14:38, 1.51s/it]
Training 1/1 epoch (loss 2.8132): 7%|β | 43/625 [01:09<15:11, 1.57s/it]
Training 1/1 epoch (loss 2.7654): 7%|β | 43/625 [01:10<15:11, 1.57s/it]
Training 1/1 epoch (loss 2.7654): 7%|β | 44/625 [01:10<14:31, 1.50s/it]
Training 1/1 epoch (loss 2.8984): 7%|β | 44/625 [01:13<14:31, 1.50s/it]
Training 1/1 epoch (loss 2.8984): 7%|β | 45/625 [01:13<17:21, 1.79s/it]
Training 1/1 epoch (loss 2.7951): 7%|β | 45/625 [01:14<17:21, 1.79s/it]
Training 1/1 epoch (loss 2.7951): 7%|β | 46/625 [01:14<16:02, 1.66s/it]
Training 1/1 epoch (loss 2.6438): 7%|β | 46/625 [01:16<16:02, 1.66s/it]
Training 1/1 epoch (loss 2.6438): 8%|β | 47/625 [01:16<17:26, 1.81s/it]
Training 1/1 epoch (loss 2.5889): 8%|β | 47/625 [01:18<17:26, 1.81s/it]
Training 1/1 epoch (loss 2.5889): 8%|β | 48/625 [01:18<17:35, 1.83s/it]
Training 1/1 epoch (loss 2.9499): 8%|β | 48/625 [01:19<17:35, 1.83s/it]
Training 1/1 epoch (loss 2.9499): 8%|β | 49/625 [01:19<13:52, 1.44s/it]
Training 1/1 epoch (loss 3.1632): 8%|β | 49/625 [01:21<13:52, 1.44s/it]
Training 1/1 epoch (loss 3.1632): 8%|β | 50/625 [01:21<16:28, 1.72s/it]
Training 1/1 epoch (loss 2.8747): 8%|β | 50/625 [01:23<16:28, 1.72s/it]
Training 1/1 epoch (loss 2.8747): 8%|β | 51/625 [01:23<16:39, 1.74s/it]
Training 1/1 epoch (loss 2.8363): 8%|β | 51/625 [01:24<16:39, 1.74s/it]
Training 1/1 epoch (loss 2.8363): 8%|β | 52/625 [01:24<14:59, 1.57s/it]
Training 1/1 epoch (loss 2.6711): 8%|β | 52/625 [01:26<14:59, 1.57s/it]
Training 1/1 epoch (loss 2.6711): 8%|β | 53/625 [01:26<14:50, 1.56s/it]
Training 1/1 epoch (loss 2.5208): 8%|β | 53/625 [01:27<14:50, 1.56s/it]
Training 1/1 epoch (loss 2.5208): 9%|β | 54/625 [01:27<13:59, 1.47s/it]
Training 1/1 epoch (loss 2.8917): 9%|β | 54/625 [01:28<13:59, 1.47s/it]
Training 1/1 epoch (loss 2.8917): 9%|β | 55/625 [01:28<13:56, 1.47s/it]
Training 1/1 epoch (loss 2.7251): 9%|β | 55/625 [01:30<13:56, 1.47s/it]
Training 1/1 epoch (loss 2.7251): 9%|β | 56/625 [01:30<14:10, 1.50s/it]
Training 1/1 epoch (loss 2.7122): 9%|β | 56/625 [01:31<14:10, 1.50s/it]
Training 1/1 epoch (loss 2.7122): 9%|β | 57/625 [01:31<12:47, 1.35s/it]
Training 1/1 epoch (loss 2.7971): 9%|β | 57/625 [01:33<12:47, 1.35s/it]
Training 1/1 epoch (loss 2.7971): 9%|β | 58/625 [01:33<14:47, 1.57s/it]
Training 1/1 epoch (loss 2.7512): 9%|β | 58/625 [01:35<14:47, 1.57s/it]
Training 1/1 epoch (loss 2.7512): 9%|β | 59/625 [01:35<15:22, 1.63s/it]
Training 1/1 epoch (loss 2.7287): 9%|β | 59/625 [01:36<15:22, 1.63s/it]
Training 1/1 epoch (loss 2.7287): 10%|β | 60/625 [01:36<13:51, 1.47s/it]
Training 1/1 epoch (loss 2.6824): 10%|β | 60/625 [01:38<13:51, 1.47s/it]
Training 1/1 epoch (loss 2.6824): 10%|β | 61/625 [01:38<14:19, 1.52s/it]
Training 1/1 epoch (loss 2.8612): 10%|β | 61/625 [01:38<14:19, 1.52s/it]
Training 1/1 epoch (loss 2.8612): 10%|β | 62/625 [01:38<12:24, 1.32s/it]
Training 1/1 epoch (loss 2.7659): 10%|β | 62/625 [01:40<12:24, 1.32s/it]
Training 1/1 epoch (loss 2.7659): 10%|β | 63/625 [01:40<12:35, 1.34s/it]
Training 1/1 epoch (loss 2.6421): 10%|β | 63/625 [01:42<12:35, 1.34s/it]
Training 1/1 epoch (loss 2.6421): 10%|β | 64/625 [01:42<13:34, 1.45s/it]
Training 1/1 epoch (loss 2.6360): 10%|β | 64/625 [01:42<13:34, 1.45s/it]
Training 1/1 epoch (loss 2.6360): 10%|β | 65/625 [01:42<11:41, 1.25s/it]
Training 1/1 epoch (loss 2.8398): 10%|β | 65/625 [01:45<11:41, 1.25s/it]
Training 1/1 epoch (loss 2.8398): 11%|β | 66/625 [01:45<14:32, 1.56s/it]
Training 1/1 epoch (loss 2.8108): 11%|β | 66/625 [01:46<14:32, 1.56s/it]
Training 1/1 epoch (loss 2.8108): 11%|β | 67/625 [01:46<14:35, 1.57s/it]
Training 1/1 epoch (loss 2.6959): 11%|β | 67/625 [01:47<14:35, 1.57s/it]
Training 1/1 epoch (loss 2.6959): 11%|β | 68/625 [01:47<13:13, 1.42s/it]
Training 1/1 epoch (loss 2.8993): 11%|β | 68/625 [01:50<13:13, 1.42s/it]
Training 1/1 epoch (loss 2.8993): 11%|β | 69/625 [01:50<15:58, 1.72s/it]
Training 1/1 epoch (loss 2.6406): 11%|β | 69/625 [01:51<15:58, 1.72s/it]
Training 1/1 epoch (loss 2.6406): 11%|β | 70/625 [01:51<13:28, 1.46s/it]
Training 1/1 epoch (loss 2.7322): 11%|β | 70/625 [01:52<13:28, 1.46s/it]
Training 1/1 epoch (loss 2.7322): 11%|ββ | 71/625 [01:52<12:23, 1.34s/it]
Training 1/1 epoch (loss 2.6255): 11%|ββ | 71/625 [01:53<12:23, 1.34s/it]
Training 1/1 epoch (loss 2.6255): 12%|ββ | 72/625 [01:53<13:08, 1.43s/it]
Training 1/1 epoch (loss 2.6395): 12%|ββ | 72/625 [01:54<13:08, 1.43s/it]
Training 1/1 epoch (loss 2.6395): 12%|ββ | 73/625 [01:54<11:00, 1.20s/it]
Training 1/1 epoch (loss 2.9052): 12%|ββ | 73/625 [01:55<11:00, 1.20s/it]
Training 1/1 epoch (loss 2.9052): 12%|ββ | 74/625 [01:55<11:06, 1.21s/it]
Training 1/1 epoch (loss 2.5084): 12%|ββ | 74/625 [01:57<11:06, 1.21s/it]
Training 1/1 epoch (loss 2.5084): 12%|ββ | 75/625 [01:57<12:29, 1.36s/it]
Training 1/1 epoch (loss 2.6406): 12%|ββ | 75/625 [01:57<12:29, 1.36s/it]
Training 1/1 epoch (loss 2.6406): 12%|ββ | 76/625 [01:57<10:33, 1.15s/it]
Training 1/1 epoch (loss 2.6824): 12%|ββ | 76/625 [01:59<10:33, 1.15s/it]
Training 1/1 epoch (loss 2.6824): 12%|ββ | 77/625 [01:59<12:34, 1.38s/it]
Training 1/1 epoch (loss 2.7714): 12%|ββ | 77/625 [02:01<12:34, 1.38s/it]
Training 1/1 epoch (loss 2.7714): 12%|ββ | 78/625 [02:01<12:53, 1.41s/it]
Training 1/1 epoch (loss 2.8321): 12%|ββ | 78/625 [02:02<12:53, 1.41s/it]
Training 1/1 epoch (loss 2.8321): 13%|ββ | 79/625 [02:02<12:24, 1.36s/it]
Training 1/1 epoch (loss 2.7882): 13%|ββ | 79/625 [02:04<12:24, 1.36s/it]
Training 1/1 epoch (loss 2.7882): 13%|ββ | 80/625 [02:04<14:32, 1.60s/it]
Training 1/1 epoch (loss 2.8281): 13%|ββ | 80/625 [02:06<14:32, 1.60s/it]
Training 1/1 epoch (loss 2.8281): 13%|ββ | 81/625 [02:06<13:39, 1.51s/it]
Training 1/1 epoch (loss 2.6974): 13%|ββ | 81/625 [02:07<13:39, 1.51s/it]
Training 1/1 epoch (loss 2.6974): 13%|ββ | 82/625 [02:07<13:25, 1.48s/it]
Training 1/1 epoch (loss 2.7126): 13%|ββ | 82/625 [02:08<13:25, 1.48s/it]
Training 1/1 epoch (loss 2.7126): 13%|ββ | 83/625 [02:08<12:10, 1.35s/it]
Training 1/1 epoch (loss 2.4729): 13%|ββ | 83/625 [02:09<12:10, 1.35s/it]
Training 1/1 epoch (loss 2.4729): 13%|ββ | 84/625 [02:09<10:31, 1.17s/it]
Training 1/1 epoch (loss 2.8106): 13%|ββ | 84/625 [02:11<10:31, 1.17s/it]
Training 1/1 epoch (loss 2.8106): 14%|ββ | 85/625 [02:11<13:49, 1.54s/it]
Training 1/1 epoch (loss 2.7770): 14%|ββ | 85/625 [02:13<13:49, 1.54s/it]
Training 1/1 epoch (loss 2.7770): 14%|ββ | 86/625 [02:13<13:29, 1.50s/it]
Training 1/1 epoch (loss 2.8303): 14%|ββ | 86/625 [02:13<13:29, 1.50s/it]
Training 1/1 epoch (loss 2.8303): 14%|ββ | 87/625 [02:13<10:54, 1.22s/it]
Training 1/1 epoch (loss 2.8681): 14%|ββ | 87/625 [02:16<10:54, 1.22s/it]
Training 1/1 epoch (loss 2.8681): 14%|ββ | 88/625 [02:16<14:20, 1.60s/it]
Training 1/1 epoch (loss 2.8245): 14%|ββ | 88/625 [02:18<14:20, 1.60s/it]
Training 1/1 epoch (loss 2.8245): 14%|ββ | 89/625 [02:18<15:53, 1.78s/it]
Training 1/1 epoch (loss 2.9174): 14%|ββ | 89/625 [02:19<15:53, 1.78s/it]
Training 1/1 epoch (loss 2.9174): 14%|ββ | 90/625 [02:19<13:45, 1.54s/it]
Training 1/1 epoch (loss 2.6116): 14%|ββ | 90/625 [02:20<13:45, 1.54s/it]
Training 1/1 epoch (loss 2.6116): 15%|ββ | 91/625 [02:20<13:58, 1.57s/it]
Training 1/1 epoch (loss 2.7563): 15%|ββ | 91/625 [02:22<13:58, 1.57s/it]
Training 1/1 epoch (loss 2.7563): 15%|ββ | 92/625 [02:22<12:42, 1.43s/it]
Training 1/1 epoch (loss 2.6839): 15%|ββ | 92/625 [02:22<12:42, 1.43s/it]
Training 1/1 epoch (loss 2.6839): 15%|ββ | 93/625 [02:22<10:32, 1.19s/it]
Training 1/1 epoch (loss 2.8243): 15%|ββ | 93/625 [02:24<10:32, 1.19s/it]
Training 1/1 epoch (loss 2.8243): 15%|ββ | 94/625 [02:24<11:56, 1.35s/it]
Training 1/1 epoch (loss 2.8820): 15%|ββ | 94/625 [02:25<11:56, 1.35s/it]
Training 1/1 epoch (loss 2.8820): 15%|ββ | 95/625 [02:25<11:56, 1.35s/it]
Training 1/1 epoch (loss 2.5768): 15%|ββ | 95/625 [02:26<11:56, 1.35s/it]
Training 1/1 epoch (loss 2.5768): 15%|ββ | 96/625 [02:26<10:35, 1.20s/it]
Training 1/1 epoch (loss 2.7374): 15%|ββ | 96/625 [02:28<10:35, 1.20s/it]
Training 1/1 epoch (loss 2.7374): 16%|ββ | 97/625 [02:28<11:26, 1.30s/it]
Training 1/1 epoch (loss 2.6485): 16%|ββ | 97/625 [02:28<11:26, 1.30s/it]
Training 1/1 epoch (loss 2.6485): 16%|ββ | 98/625 [02:28<09:52, 1.12s/it]
Training 1/1 epoch (loss 2.8613): 16%|ββ | 98/625 [02:30<09:52, 1.12s/it]
Training 1/1 epoch (loss 2.8613): 16%|ββ | 99/625 [02:30<11:30, 1.31s/it]
Training 1/1 epoch (loss 2.7998): 16%|ββ | 99/625 [02:31<11:30, 1.31s/it]
Training 1/1 epoch (loss 2.7998): 16%|ββ | 100/625 [02:31<10:43, 1.23s/it]
Training 1/1 epoch (loss 2.6826): 16%|ββ | 100/625 [02:32<10:43, 1.23s/it]
Training 1/1 epoch (loss 2.6826): 16%|ββ | 101/625 [02:32<09:51, 1.13s/it]
Training 1/1 epoch (loss 2.7399): 16%|ββ | 101/625 [02:34<09:51, 1.13s/it]
Training 1/1 epoch (loss 2.7399): 16%|ββ | 102/625 [02:34<13:08, 1.51s/it]
Training 1/1 epoch (loss 2.7333): 16%|ββ | 102/625 [02:36<13:08, 1.51s/it]
Training 1/1 epoch (loss 2.7333): 16%|ββ | 103/625 [02:36<12:46, 1.47s/it]
Training 1/1 epoch (loss 2.7282): 16%|ββ | 103/625 [02:37<12:46, 1.47s/it]
Training 1/1 epoch (loss 2.7282): 17%|ββ | 104/625 [02:37<10:49, 1.25s/it]
Training 1/1 epoch (loss 2.7224): 17%|ββ | 104/625 [02:39<10:49, 1.25s/it]
Training 1/1 epoch (loss 2.7224): 17%|ββ | 105/625 [02:39<12:42, 1.47s/it]
Training 1/1 epoch (loss 2.5715): 17%|ββ | 105/625 [02:40<12:42, 1.47s/it]
Training 1/1 epoch (loss 2.5715): 17%|ββ | 106/625 [02:40<12:49, 1.48s/it]
Training 1/1 epoch (loss 3.0608): 17%|ββ | 106/625 [02:41<12:49, 1.48s/it]
Training 1/1 epoch (loss 3.0608): 17%|ββ | 107/625 [02:41<10:12, 1.18s/it]
Training 1/1 epoch (loss 2.7917): 17%|ββ | 107/625 [02:43<10:12, 1.18s/it]
Training 1/1 epoch (loss 2.7917): 17%|ββ | 108/625 [02:43<12:37, 1.47s/it]
Training 1/1 epoch (loss 2.8504): 17%|ββ | 108/625 [02:44<12:37, 1.47s/it]
Training 1/1 epoch (loss 2.8504): 17%|ββ | 109/625 [02:44<11:45, 1.37s/it]
Training 1/1 epoch (loss 2.9682): 17%|ββ | 109/625 [02:44<11:45, 1.37s/it]
Training 1/1 epoch (loss 2.9682): 18%|ββ | 110/625 [02:44<09:41, 1.13s/it]
Training 1/1 epoch (loss 2.8916): 18%|ββ | 110/625 [02:46<09:41, 1.13s/it]
Training 1/1 epoch (loss 2.8916): 18%|ββ | 111/625 [02:46<11:30, 1.34s/it]
Training 1/1 epoch (loss 2.8188): 18%|ββ | 111/625 [02:48<11:30, 1.34s/it]
Training 1/1 epoch (loss 2.8188): 18%|ββ | 112/625 [02:48<12:00, 1.40s/it]
Training 1/1 epoch (loss 2.6491): 18%|ββ | 112/625 [02:49<12:00, 1.40s/it]
Training 1/1 epoch (loss 2.6491): 18%|ββ | 113/625 [02:49<10:48, 1.27s/it]
Training 1/1 epoch (loss 2.8014): 18%|ββ | 113/625 [02:51<10:48, 1.27s/it]
Training 1/1 epoch (loss 2.8014): 18%|ββ | 114/625 [02:51<12:59, 1.53s/it]
Training 1/1 epoch (loss 2.8539): 18%|ββ | 114/625 [02:52<12:59, 1.53s/it]
Training 1/1 epoch (loss 2.8539): 18%|ββ | 115/625 [02:52<11:47, 1.39s/it]
Training 1/1 epoch (loss 2.9068): 18%|ββ | 115/625 [02:54<11:47, 1.39s/it]
Training 1/1 epoch (loss 2.9068): 19%|ββ | 116/625 [02:54<13:49, 1.63s/it]
Training 1/1 epoch (loss 2.8915): 19%|ββ | 116/625 [02:56<13:49, 1.63s/it]
Training 1/1 epoch (loss 2.8915): 19%|ββ | 117/625 [02:56<14:28, 1.71s/it]
Training 1/1 epoch (loss 2.4722): 19%|ββ | 117/625 [02:57<14:28, 1.71s/it]
Training 1/1 epoch (loss 2.4722): 19%|ββ | 118/625 [02:57<11:34, 1.37s/it]
Training 1/1 epoch (loss 2.5689): 19%|ββ | 118/625 [02:59<11:34, 1.37s/it]
Training 1/1 epoch (loss 2.5689): 19%|ββ | 119/625 [02:59<14:20, 1.70s/it]
Training 1/1 epoch (loss 2.6336): 19%|ββ | 119/625 [03:01<14:20, 1.70s/it]
Training 1/1 epoch (loss 2.6336): 19%|ββ | 120/625 [03:01<14:49, 1.76s/it]
Training 1/1 epoch (loss 2.5929): 19%|ββ | 120/625 [03:02<14:49, 1.76s/it]
Training 1/1 epoch (loss 2.5929): 19%|ββ | 121/625 [03:02<12:59, 1.55s/it]
Training 1/1 epoch (loss 2.6509): 19%|ββ | 121/625 [03:04<12:59, 1.55s/it]
Training 1/1 epoch (loss 2.6509): 20%|ββ | 122/625 [03:04<13:31, 1.61s/it]
Training 1/1 epoch (loss 2.5967): 20%|ββ | 122/625 [03:05<13:31, 1.61s/it]
Training 1/1 epoch (loss 2.5967): 20%|ββ | 123/625 [03:05<12:14, 1.46s/it]
Training 1/1 epoch (loss 2.8025): 20%|ββ | 123/625 [03:06<12:14, 1.46s/it]
Training 1/1 epoch (loss 2.8025): 20%|ββ | 124/625 [03:06<11:05, 1.33s/it]
Training 1/1 epoch (loss 2.6608): 20%|ββ | 124/625 [03:08<11:05, 1.33s/it]
Training 1/1 epoch (loss 2.6608): 20%|ββ | 125/625 [03:08<12:28, 1.50s/it]
Training 1/1 epoch (loss 2.8467): 20%|ββ | 125/625 [03:09<12:28, 1.50s/it]
Training 1/1 epoch (loss 2.8467): 20%|ββ | 126/625 [03:09<10:47, 1.30s/it]
Training 1/1 epoch (loss 2.7613): 20%|ββ | 126/625 [03:10<10:47, 1.30s/it]
Training 1/1 epoch (loss 2.7613): 20%|ββ | 127/625 [03:10<11:18, 1.36s/it]
Training 1/1 epoch (loss 2.6609): 20%|ββ | 127/625 [03:12<11:18, 1.36s/it]
Training 1/1 epoch (loss 2.6609): 20%|ββ | 128/625 [03:12<12:11, 1.47s/it]
Training 1/1 epoch (loss 2.5671): 20%|ββ | 128/625 [03:12<12:11, 1.47s/it]
Training 1/1 epoch (loss 2.5671): 21%|ββ | 129/625 [03:12<09:55, 1.20s/it]
Training 1/1 epoch (loss 2.7294): 21%|ββ | 129/625 [03:14<09:55, 1.20s/it]
Training 1/1 epoch (loss 2.7294): 21%|ββ | 130/625 [03:14<12:05, 1.46s/it]
Training 1/1 epoch (loss 2.5377): 21%|ββ | 130/625 [03:16<12:05, 1.46s/it]
Training 1/1 epoch (loss 2.5377): 21%|ββ | 131/625 [03:16<12:46, 1.55s/it]
Training 1/1 epoch (loss 2.8872): 21%|ββ | 131/625 [03:17<12:46, 1.55s/it]
Training 1/1 epoch (loss 2.8872): 21%|ββ | 132/625 [03:17<11:57, 1.46s/it]
Training 1/1 epoch (loss 2.7330): 21%|ββ | 132/625 [03:19<11:57, 1.46s/it]
Training 1/1 epoch (loss 2.7330): 21%|βββ | 133/625 [03:19<11:29, 1.40s/it]
Training 1/1 epoch (loss 2.6686): 21%|βββ | 133/625 [03:20<11:29, 1.40s/it]
Training 1/1 epoch (loss 2.6686): 21%|βββ | 134/625 [03:20<12:08, 1.48s/it]
Training 1/1 epoch (loss 2.7897): 21%|βββ | 134/625 [03:22<12:08, 1.48s/it]
Training 1/1 epoch (loss 2.7897): 22%|βββ | 135/625 [03:22<11:45, 1.44s/it]
Training 1/1 epoch (loss 2.8338): 22%|βββ | 135/625 [03:24<11:45, 1.44s/it]
Training 1/1 epoch (loss 2.8338): 22%|βββ | 136/625 [03:24<13:48, 1.69s/it]
Training 1/1 epoch (loss 2.6272): 22%|βββ | 136/625 [03:25<13:48, 1.69s/it]
Training 1/1 epoch (loss 2.6272): 22%|βββ | 137/625 [03:25<11:39, 1.43s/it]
Training 1/1 epoch (loss 2.7788): 22%|βββ | 137/625 [03:27<11:39, 1.43s/it]
Training 1/1 epoch (loss 2.7788): 22%|βββ | 138/625 [03:27<13:51, 1.71s/it]
Training 1/1 epoch (loss 2.8316): 22%|βββ | 138/625 [03:29<13:51, 1.71s/it]
Training 1/1 epoch (loss 2.8316): 22%|βββ | 139/625 [03:29<13:33, 1.67s/it]
Training 1/1 epoch (loss 2.5735): 22%|βββ | 139/625 [03:30<13:33, 1.67s/it]
Training 1/1 epoch (loss 2.5735): 22%|βββ | 140/625 [03:30<11:05, 1.37s/it]
Training 1/1 epoch (loss 2.5801): 22%|βββ | 140/625 [03:32<11:05, 1.37s/it]
Training 1/1 epoch (loss 2.5801): 23%|βββ | 141/625 [03:32<12:47, 1.59s/it]
Training 1/1 epoch (loss 2.6922): 23%|βββ | 141/625 [03:34<12:47, 1.59s/it]
Training 1/1 epoch (loss 2.6922): 23%|βββ | 142/625 [03:34<14:37, 1.82s/it]
Training 1/1 epoch (loss 2.7115): 23%|βββ | 142/625 [03:34<14:37, 1.82s/it]
Training 1/1 epoch (loss 2.7115): 23%|βββ | 143/625 [03:34<11:23, 1.42s/it]
Training 1/1 epoch (loss 2.8272): 23%|βββ | 143/625 [03:36<11:23, 1.42s/it]
Training 1/1 epoch (loss 2.8272): 23%|βββ | 144/625 [03:36<12:46, 1.59s/it]
Training 1/1 epoch (loss 2.9543): 23%|βββ | 144/625 [03:37<12:46, 1.59s/it]
Training 1/1 epoch (loss 2.9543): 23%|βββ | 145/625 [03:37<11:24, 1.43s/it]
Training 1/1 epoch (loss 2.7806): 23%|βββ | 145/625 [03:38<11:24, 1.43s/it]
Training 1/1 epoch (loss 2.7806): 23%|βββ | 146/625 [03:38<09:31, 1.19s/it]
Training 1/1 epoch (loss 2.6470): 23%|βββ | 146/625 [03:40<09:31, 1.19s/it]
Training 1/1 epoch (loss 2.6470): 24%|βββ | 147/625 [03:40<10:58, 1.38s/it]
Training 1/1 epoch (loss 2.6106): 24%|βββ | 147/625 [03:41<10:58, 1.38s/it]
Training 1/1 epoch (loss 2.6106): 24%|βββ | 148/625 [03:41<10:00, 1.26s/it]
Training 1/1 epoch (loss 2.8026): 24%|βββ | 148/625 [03:42<10:00, 1.26s/it]
Training 1/1 epoch (loss 2.8026): 24%|βββ | 149/625 [03:42<09:59, 1.26s/it]
Training 1/1 epoch (loss 2.7438): 24%|βββ | 149/625 [03:44<09:59, 1.26s/it]
Training 1/1 epoch (loss 2.7438): 24%|βββ | 150/625 [03:44<11:19, 1.43s/it]
Training 1/1 epoch (loss 2.7568): 24%|βββ | 150/625 [03:45<11:19, 1.43s/it]
Training 1/1 epoch (loss 2.7568): 24%|βββ | 151/625 [03:45<10:08, 1.28s/it]
Training 1/1 epoch (loss 2.7235): 24%|βββ | 151/625 [03:47<10:08, 1.28s/it]
Training 1/1 epoch (loss 2.7235): 24%|βββ | 152/625 [03:47<11:01, 1.40s/it]
Training 1/1 epoch (loss 2.7581): 24%|βββ | 152/625 [03:48<11:01, 1.40s/it]
Training 1/1 epoch (loss 2.7581): 24%|βββ | 153/625 [03:48<11:20, 1.44s/it]
Training 1/1 epoch (loss 2.7864): 24%|βββ | 153/625 [03:49<11:20, 1.44s/it]
Training 1/1 epoch (loss 2.7864): 25%|βββ | 154/625 [03:49<09:03, 1.15s/it]
Training 1/1 epoch (loss 2.7826): 25%|βββ | 154/625 [03:51<09:03, 1.15s/it]
Training 1/1 epoch (loss 2.7826): 25%|βββ | 155/625 [03:51<11:59, 1.53s/it]
Training 1/1 epoch (loss 2.6805): 25%|βββ | 155/625 [03:53<11:59, 1.53s/it]
Training 1/1 epoch (loss 2.6805): 25%|βββ | 156/625 [03:53<13:24, 1.72s/it]
Training 1/1 epoch (loss 2.9544): 25%|βββ | 156/625 [03:54<13:24, 1.72s/it]
Training 1/1 epoch (loss 2.9544): 25%|βββ | 157/625 [03:54<10:23, 1.33s/it]
Training 1/1 epoch (loss 2.7069): 25%|βββ | 157/625 [03:55<10:23, 1.33s/it]
Training 1/1 epoch (loss 2.7069): 25%|βββ | 158/625 [03:55<11:22, 1.46s/it]
Training 1/1 epoch (loss 2.7147): 25%|βββ | 158/625 [03:57<11:22, 1.46s/it]
Training 1/1 epoch (loss 2.7147): 25%|βββ | 159/625 [03:57<12:12, 1.57s/it]
Training 1/1 epoch (loss 2.5593): 25%|βββ | 159/625 [03:58<12:12, 1.57s/it]
Training 1/1 epoch (loss 2.5593): 26%|βββ | 160/625 [03:58<10:34, 1.36s/it]
Training 1/1 epoch (loss 2.7867): 26%|βββ | 160/625 [04:01<10:34, 1.36s/it]
Training 1/1 epoch (loss 2.7867): 26%|βββ | 161/625 [04:01<13:17, 1.72s/it]
Training 1/1 epoch (loss 2.7577): 26%|βββ | 161/625 [04:02<13:17, 1.72s/it]
Training 1/1 epoch (loss 2.7577): 26%|βββ | 162/625 [04:02<13:23, 1.74s/it]
Training 1/1 epoch (loss 2.8017): 26%|βββ | 162/625 [04:03<13:23, 1.74s/it]
Training 1/1 epoch (loss 2.8017): 26%|βββ | 163/625 [04:03<11:40, 1.52s/it]
Training 1/1 epoch (loss 2.7626): 26%|βββ | 163/625 [04:05<11:40, 1.52s/it]
Training 1/1 epoch (loss 2.7626): 26%|βββ | 164/625 [04:05<12:41, 1.65s/it]
Training 1/1 epoch (loss 2.7071): 26%|βββ | 164/625 [04:07<12:41, 1.65s/it]
Training 1/1 epoch (loss 2.7071): 26%|βββ | 165/625 [04:07<11:58, 1.56s/it]
Training 1/1 epoch (loss 2.6584): 26%|βββ | 165/625 [04:08<11:58, 1.56s/it]
Training 1/1 epoch (loss 2.6584): 27%|βββ | 166/625 [04:08<11:45, 1.54s/it]
Training 1/1 epoch (loss 2.5430): 27%|βββ | 166/625 [04:10<11:45, 1.54s/it]
Training 1/1 epoch (loss 2.5430): 27%|βββ | 167/625 [04:10<11:25, 1.50s/it]
Training 1/1 epoch (loss 2.6625): 27%|βββ | 167/625 [04:10<11:25, 1.50s/it]
Training 1/1 epoch (loss 2.6625): 27%|βββ | 168/625 [04:10<09:44, 1.28s/it]
Training 1/1 epoch (loss 2.4720): 27%|βββ | 168/625 [04:12<09:44, 1.28s/it]
Training 1/1 epoch (loss 2.4720): 27%|βββ | 169/625 [04:12<09:50, 1.30s/it]
Training 1/1 epoch (loss 2.7247): 27%|βββ | 169/625 [04:13<09:50, 1.30s/it]
Training 1/1 epoch (loss 2.7247): 27%|βββ | 170/625 [04:13<09:05, 1.20s/it]
Training 1/1 epoch (loss 2.6597): 27%|βββ | 170/625 [04:13<09:05, 1.20s/it]
Training 1/1 epoch (loss 2.6597): 27%|βββ | 171/625 [04:13<07:53, 1.04s/it]
Training 1/1 epoch (loss 2.8597): 27%|βββ | 171/625 [04:16<07:53, 1.04s/it]
Training 1/1 epoch (loss 2.8597): 28%|βββ | 172/625 [04:16<11:15, 1.49s/it]
Training 1/1 epoch (loss 2.6245): 28%|βββ | 172/625 [04:17<11:15, 1.49s/it]
Training 1/1 epoch (loss 2.6245): 28%|βββ | 173/625 [04:17<11:16, 1.50s/it]
Training 1/1 epoch (loss 2.4941): 28%|βββ | 173/625 [04:19<11:16, 1.50s/it]
Training 1/1 epoch (loss 2.4941): 28%|βββ | 174/625 [04:19<11:09, 1.48s/it]
Training 1/1 epoch (loss 2.7916): 28%|βββ | 174/625 [04:20<11:09, 1.48s/it]
Training 1/1 epoch (loss 2.7916): 28%|βββ | 175/625 [04:20<10:27, 1.39s/it]
Training 1/1 epoch (loss 2.5891): 28%|βββ | 175/625 [04:21<10:27, 1.39s/it]
Training 1/1 epoch (loss 2.5891): 28%|βββ | 176/625 [04:21<09:24, 1.26s/it]
Training 1/1 epoch (loss 2.7181): 28%|βββ | 176/625 [04:24<09:24, 1.26s/it]
Training 1/1 epoch (loss 2.7181): 28%|βββ | 177/625 [04:24<12:15, 1.64s/it]
Training 1/1 epoch (loss 2.6107): 28%|βββ | 177/625 [04:25<12:15, 1.64s/it]
Training 1/1 epoch (loss 2.6107): 28%|βββ | 178/625 [04:25<12:21, 1.66s/it]
Training 1/1 epoch (loss 2.6347): 28%|βββ | 178/625 [04:26<12:21, 1.66s/it]
Training 1/1 epoch (loss 2.6347): 29%|βββ | 179/625 [04:26<09:47, 1.32s/it]
Training 1/1 epoch (loss 2.6102): 29%|βββ | 179/625 [04:28<09:47, 1.32s/it]
Training 1/1 epoch (loss 2.6102): 29%|βββ | 180/625 [04:28<10:43, 1.45s/it]
Training 1/1 epoch (loss 2.7446): 29%|βββ | 180/625 [04:30<10:43, 1.45s/it]
Training 1/1 epoch (loss 2.7446): 29%|βββ | 181/625 [04:30<12:02, 1.63s/it]
Training 1/1 epoch (loss 2.7110): 29%|βββ | 181/625 [04:30<12:02, 1.63s/it]
Training 1/1 epoch (loss 2.7110): 29%|βββ | 182/625 [04:30<09:47, 1.33s/it]
Training 1/1 epoch (loss 2.8523): 29%|βββ | 182/625 [04:31<09:47, 1.33s/it]
Training 1/1 epoch (loss 2.8523): 29%|βββ | 183/625 [04:31<09:01, 1.23s/it]
Training 1/1 epoch (loss 2.6704): 29%|βββ | 183/625 [04:33<09:01, 1.23s/it]
Training 1/1 epoch (loss 2.6704): 29%|βββ | 184/625 [04:33<09:18, 1.27s/it]
Training 1/1 epoch (loss 2.6045): 29%|βββ | 184/625 [04:33<09:18, 1.27s/it]
Training 1/1 epoch (loss 2.6045): 30%|βββ | 185/625 [04:33<08:18, 1.13s/it]
Training 1/1 epoch (loss 2.5882): 30%|βββ | 185/625 [04:35<08:18, 1.13s/it]
Training 1/1 epoch (loss 2.5882): 30%|βββ | 186/625 [04:35<09:17, 1.27s/it]
Training 1/1 epoch (loss 2.8107): 30%|βββ | 186/625 [04:36<09:17, 1.27s/it]
Training 1/1 epoch (loss 2.8107): 30%|βββ | 187/625 [04:36<09:04, 1.24s/it]
Training 1/1 epoch (loss 2.5160): 30%|βββ | 187/625 [04:38<09:04, 1.24s/it]
Training 1/1 epoch (loss 2.5160): 30%|βββ | 188/625 [04:38<09:53, 1.36s/it]
Training 1/1 epoch (loss 2.6575): 30%|βββ | 188/625 [04:40<09:53, 1.36s/it]
Training 1/1 epoch (loss 2.6575): 30%|βββ | 189/625 [04:40<12:03, 1.66s/it]
Training 1/1 epoch (loss 2.8783): 30%|βββ | 189/625 [04:41<12:03, 1.66s/it]
Training 1/1 epoch (loss 2.8783): 30%|βββ | 190/625 [04:41<09:28, 1.31s/it]
Training 1/1 epoch (loss 2.6745): 30%|βββ | 190/625 [04:43<09:28, 1.31s/it]
Training 1/1 epoch (loss 2.6745): 31%|βββ | 191/625 [04:43<11:52, 1.64s/it]
Training 1/1 epoch (loss 2.7075): 31%|βββ | 191/625 [04:45<11:52, 1.64s/it]
Training 1/1 epoch (loss 2.7075): 31%|βββ | 192/625 [04:45<12:04, 1.67s/it]
Training 1/1 epoch (loss 2.5331): 31%|βββ | 192/625 [04:46<12:04, 1.67s/it]
Training 1/1 epoch (loss 2.5331): 31%|βββ | 193/625 [04:46<10:34, 1.47s/it]
Training 1/1 epoch (loss 2.6948): 31%|βββ | 193/625 [04:48<10:34, 1.47s/it]
Training 1/1 epoch (loss 2.6948): 31%|βββ | 194/625 [04:48<12:12, 1.70s/it]
Training 1/1 epoch (loss 2.5772): 31%|βββ | 194/625 [04:50<12:12, 1.70s/it]
Training 1/1 epoch (loss 2.5772): 31%|βββ | 195/625 [04:50<11:53, 1.66s/it]
Training 1/1 epoch (loss 2.9320): 31%|βββ | 195/625 [04:51<11:53, 1.66s/it]
Training 1/1 epoch (loss 2.9320): 31%|ββββ | 196/625 [04:51<11:16, 1.58s/it]
Training 1/1 epoch (loss 2.8862): 31%|ββββ | 196/625 [04:53<11:16, 1.58s/it]
Training 1/1 epoch (loss 2.8862): 32%|ββββ | 197/625 [04:53<12:51, 1.80s/it]
Training 1/1 epoch (loss 2.6539): 32%|ββββ | 197/625 [04:54<12:51, 1.80s/it]
Training 1/1 epoch (loss 2.6539): 32%|ββββ | 198/625 [04:54<10:40, 1.50s/it]
Training 1/1 epoch (loss 2.6874): 32%|ββββ | 198/625 [04:55<10:40, 1.50s/it]
Training 1/1 epoch (loss 2.6874): 32%|ββββ | 199/625 [04:55<09:21, 1.32s/it]
Training 1/1 epoch (loss 2.7558): 32%|ββββ | 199/625 [04:57<09:21, 1.32s/it]
Training 1/1 epoch (loss 2.7558): 32%|ββββ | 200/625 [04:57<11:49, 1.67s/it]
Training 1/1 epoch (loss 2.8161): 32%|ββββ | 200/625 [04:58<11:49, 1.67s/it]
Training 1/1 epoch (loss 2.8161): 32%|ββββ | 201/625 [04:58<09:52, 1.40s/it]
Training 1/1 epoch (loss 2.6128): 32%|ββββ | 201/625 [05:00<09:52, 1.40s/it]
Training 1/1 epoch (loss 2.6128): 32%|ββββ | 202/625 [05:00<10:54, 1.55s/it]
Training 1/1 epoch (loss 2.6654): 32%|ββββ | 202/625 [05:02<10:54, 1.55s/it]
Training 1/1 epoch (loss 2.6654): 32%|ββββ | 203/625 [05:02<10:58, 1.56s/it]
Training 1/1 epoch (loss 2.7485): 32%|ββββ | 203/625 [05:02<10:58, 1.56s/it]
Training 1/1 epoch (loss 2.7485): 33%|ββββ | 204/625 [05:02<09:06, 1.30s/it]
Training 1/1 epoch (loss 2.7811): 33%|ββββ | 204/625 [05:04<09:06, 1.30s/it]
Training 1/1 epoch (loss 2.7811): 33%|ββββ | 205/625 [05:04<10:35, 1.51s/it]
Training 1/1 epoch (loss 2.9098): 33%|ββββ | 205/625 [05:06<10:35, 1.51s/it]
Training 1/1 epoch (loss 2.9098): 33%|ββββ | 206/625 [05:06<11:31, 1.65s/it]
Training 1/1 epoch (loss 2.4645): 33%|ββββ | 206/625 [05:07<11:31, 1.65s/it]
Training 1/1 epoch (loss 2.4645): 33%|ββββ | 207/625 [05:07<10:01, 1.44s/it]
Training 1/1 epoch (loss 2.6918): 33%|ββββ | 207/625 [05:09<10:01, 1.44s/it]
Training 1/1 epoch (loss 2.6918): 33%|ββββ | 208/625 [05:09<11:13, 1.61s/it]
Training 1/1 epoch (loss 2.4261): 33%|ββββ | 208/625 [05:10<11:13, 1.61s/it]
Training 1/1 epoch (loss 2.4261): 33%|ββββ | 209/625 [05:10<09:35, 1.38s/it]
Training 1/1 epoch (loss 2.6887): 33%|ββββ | 209/625 [05:12<09:35, 1.38s/it]
Training 1/1 epoch (loss 2.6887): 34%|ββββ | 210/625 [05:12<11:10, 1.62s/it]
Training 1/1 epoch (loss 2.8725): 34%|ββββ | 210/625 [05:15<11:10, 1.62s/it]
Training 1/1 epoch (loss 2.8725): 34%|ββββ | 211/625 [05:15<12:28, 1.81s/it]
Training 1/1 epoch (loss 2.7904): 34%|ββββ | 211/625 [05:15<12:28, 1.81s/it]
Training 1/1 epoch (loss 2.7904): 34%|ββββ | 212/625 [05:15<10:15, 1.49s/it]
Training 1/1 epoch (loss 2.9303): 34%|ββββ | 212/625 [05:18<10:15, 1.49s/it]
Training 1/1 epoch (loss 2.9303): 34%|ββββ | 213/625 [05:18<12:08, 1.77s/it]
Training 1/1 epoch (loss 2.8143): 34%|ββββ | 213/625 [05:19<12:08, 1.77s/it]
Training 1/1 epoch (loss 2.8143): 34%|ββββ | 214/625 [05:19<10:51, 1.59s/it]
Training 1/1 epoch (loss 2.4853): 34%|ββββ | 214/625 [05:20<10:51, 1.59s/it]
Training 1/1 epoch (loss 2.4853): 34%|ββββ | 215/625 [05:20<09:03, 1.33s/it]
Training 1/1 epoch (loss 2.4569): 34%|ββββ | 215/625 [05:22<09:03, 1.33s/it]
Training 1/1 epoch (loss 2.4569): 35%|ββββ | 216/625 [05:22<10:41, 1.57s/it]
Training 1/1 epoch (loss 2.7469): 35%|ββββ | 216/625 [05:23<10:41, 1.57s/it]
Training 1/1 epoch (loss 2.7469): 35%|ββββ | 217/625 [05:23<09:38, 1.42s/it]
Training 1/1 epoch (loss 2.4640): 35%|ββββ | 217/625 [05:25<09:38, 1.42s/it]
Training 1/1 epoch (loss 2.4640): 35%|ββββ | 218/625 [05:25<11:15, 1.66s/it]
Training 1/1 epoch (loss 2.8972): 35%|ββββ | 218/625 [05:27<11:15, 1.66s/it]
Training 1/1 epoch (loss 2.8972): 35%|ββββ | 219/625 [05:27<11:28, 1.70s/it]
Training 1/1 epoch (loss 2.9165): 35%|ββββ | 219/625 [05:28<11:28, 1.70s/it]
Training 1/1 epoch (loss 2.9165): 35%|ββββ | 220/625 [05:28<10:10, 1.51s/it]
Training 1/1 epoch (loss 2.8074): 35%|ββββ | 220/625 [05:30<10:10, 1.51s/it]
Training 1/1 epoch (loss 2.8074): 35%|ββββ | 221/625 [05:30<11:06, 1.65s/it]
Training 1/1 epoch (loss 2.6938): 35%|ββββ | 221/625 [05:31<11:06, 1.65s/it]
Training 1/1 epoch (loss 2.6938): 36%|ββββ | 222/625 [05:31<09:38, 1.44s/it]
Training 1/1 epoch (loss 2.9537): 36%|ββββ | 222/625 [05:32<09:38, 1.44s/it]
Training 1/1 epoch (loss 2.9537): 36%|ββββ | 223/625 [05:32<09:31, 1.42s/it]
Training 1/1 epoch (loss 2.7288): 36%|ββββ | 223/625 [05:34<09:31, 1.42s/it]
Training 1/1 epoch (loss 2.7288): 36%|ββββ | 224/625 [05:34<10:29, 1.57s/it]
Training 1/1 epoch (loss 2.7337): 36%|ββββ | 224/625 [05:35<10:29, 1.57s/it]
Training 1/1 epoch (loss 2.7337): 36%|ββββ | 225/625 [05:35<08:46, 1.32s/it]
Training 1/1 epoch (loss 2.9146): 36%|ββββ | 225/625 [05:36<08:46, 1.32s/it]
Training 1/1 epoch (loss 2.9146): 36%|ββββ | 226/625 [05:36<09:20, 1.40s/it]
Training 1/1 epoch (loss 2.6339): 36%|ββββ | 226/625 [05:38<09:20, 1.40s/it]
Training 1/1 epoch (loss 2.6339): 36%|ββββ | 227/625 [05:38<10:32, 1.59s/it]
Training 1/1 epoch (loss 2.5008): 36%|ββββ | 227/625 [05:39<10:32, 1.59s/it]
Training 1/1 epoch (loss 2.5008): 36%|ββββ | 228/625 [05:39<09:05, 1.38s/it]
Training 1/1 epoch (loss 2.7786): 36%|ββββ | 228/625 [05:42<09:05, 1.38s/it]
Training 1/1 epoch (loss 2.7786): 37%|ββββ | 229/625 [05:42<10:53, 1.65s/it]
Training 1/1 epoch (loss 2.7262): 37%|ββββ | 229/625 [05:43<10:53, 1.65s/it]
Training 1/1 epoch (loss 2.7262): 37%|ββββ | 230/625 [05:43<10:07, 1.54s/it]
Training 1/1 epoch (loss 2.7217): 37%|ββββ | 230/625 [05:43<10:07, 1.54s/it]
Training 1/1 epoch (loss 2.7217): 37%|ββββ | 231/625 [05:43<08:01, 1.22s/it]
Training 1/1 epoch (loss 2.8493): 37%|ββββ | 231/625 [05:46<08:01, 1.22s/it]
Training 1/1 epoch (loss 2.8493): 37%|ββββ | 232/625 [05:46<10:00, 1.53s/it]
Training 1/1 epoch (loss 2.6410): 37%|ββββ | 232/625 [05:47<10:00, 1.53s/it]
Training 1/1 epoch (loss 2.6410): 37%|ββββ | 233/625 [05:47<10:08, 1.55s/it]
Training 1/1 epoch (loss 2.7297): 37%|ββββ | 233/625 [05:48<10:08, 1.55s/it]
Training 1/1 epoch (loss 2.7297): 37%|ββββ | 234/625 [05:48<08:17, 1.27s/it]
Training 1/1 epoch (loss 2.7449): 37%|ββββ | 234/625 [05:50<08:17, 1.27s/it]
Training 1/1 epoch (loss 2.7449): 38%|ββββ | 235/625 [05:50<09:15, 1.43s/it]
Training 1/1 epoch (loss 2.7767): 38%|ββββ | 235/625 [05:51<09:15, 1.43s/it]
Training 1/1 epoch (loss 2.7767): 38%|ββββ | 236/625 [05:51<08:58, 1.38s/it]
Training 1/1 epoch (loss 2.7343): 38%|ββββ | 236/625 [05:52<08:58, 1.38s/it]
Training 1/1 epoch (loss 2.7343): 38%|ββββ | 237/625 [05:52<08:43, 1.35s/it]
Training 1/1 epoch (loss 2.5849): 38%|ββββ | 237/625 [05:54<08:43, 1.35s/it]
Training 1/1 epoch (loss 2.5849): 38%|ββββ | 238/625 [05:54<09:50, 1.53s/it]
Training 1/1 epoch (loss 2.6790): 38%|ββββ | 238/625 [05:55<09:50, 1.53s/it]
Training 1/1 epoch (loss 2.6790): 38%|ββββ | 239/625 [05:55<09:15, 1.44s/it]
Training 1/1 epoch (loss 2.5352): 38%|ββββ | 239/625 [05:57<09:15, 1.44s/it]
Training 1/1 epoch (loss 2.5352): 38%|ββββ | 240/625 [05:57<10:22, 1.62s/it]
Training 1/1 epoch (loss 2.6289): 38%|ββββ | 240/625 [05:59<10:22, 1.62s/it]
Training 1/1 epoch (loss 2.6289): 39%|ββββ | 241/625 [05:59<09:30, 1.49s/it]
Training 1/1 epoch (loss 2.9170): 39%|ββββ | 241/625 [05:59<09:30, 1.49s/it]
Training 1/1 epoch (loss 2.9170): 39%|ββββ | 242/625 [05:59<07:28, 1.17s/it]
Training 1/1 epoch (loss 2.7297): 39%|ββββ | 242/625 [06:00<07:28, 1.17s/it]
Training 1/1 epoch (loss 2.7297): 39%|ββββ | 243/625 [06:00<07:22, 1.16s/it]
Training 1/1 epoch (loss 2.6903): 39%|ββββ | 243/625 [06:02<07:22, 1.16s/it]
Training 1/1 epoch (loss 2.6903): 39%|ββββ | 244/625 [06:02<08:40, 1.37s/it]
Training 1/1 epoch (loss 2.7994): 39%|ββββ | 244/625 [06:02<08:40, 1.37s/it]
Training 1/1 epoch (loss 2.7994): 39%|ββββ | 245/625 [06:02<06:56, 1.10s/it]
Training 1/1 epoch (loss 2.6855): 39%|ββββ | 245/625 [06:05<06:56, 1.10s/it]
Training 1/1 epoch (loss 2.6855): 39%|ββββ | 246/625 [06:05<09:18, 1.47s/it]
Training 1/1 epoch (loss 2.7200): 39%|ββββ | 246/625 [06:07<09:18, 1.47s/it]
Training 1/1 epoch (loss 2.7200): 40%|ββββ | 247/625 [06:07<10:35, 1.68s/it]
Training 1/1 epoch (loss 2.4868): 40%|ββββ | 247/625 [06:08<10:35, 1.68s/it]
Training 1/1 epoch (loss 2.4868): 40%|ββββ | 248/625 [06:08<08:52, 1.41s/it]
Training 1/1 epoch (loss 2.6062): 40%|ββββ | 248/625 [06:10<08:52, 1.41s/it]
Training 1/1 epoch (loss 2.6062): 40%|ββββ | 249/625 [06:10<11:00, 1.76s/it]
Training 1/1 epoch (loss 2.9702): 40%|ββββ | 249/625 [06:12<11:00, 1.76s/it]
Training 1/1 epoch (loss 2.9702): 40%|ββββ | 250/625 [06:12<10:25, 1.67s/it]
Training 1/1 epoch (loss 2.5977): 40%|ββββ | 250/625 [06:13<10:25, 1.67s/it]
Training 1/1 epoch (loss 2.5977): 40%|ββββ | 251/625 [06:13<09:31, 1.53s/it]
Training 1/1 epoch (loss 2.8784): 40%|ββββ | 251/625 [06:14<09:31, 1.53s/it]
Training 1/1 epoch (loss 2.8784): 40%|ββββ | 252/625 [06:14<09:22, 1.51s/it]
Training 1/1 epoch (loss 2.7659): 40%|ββββ | 252/625 [06:16<09:22, 1.51s/it]
Training 1/1 epoch (loss 2.7659): 40%|ββββ | 253/625 [06:16<08:27, 1.37s/it]
Training 1/1 epoch (loss 2.5698): 40%|ββββ | 253/625 [06:18<08:27, 1.37s/it]
Training 1/1 epoch (loss 2.5698): 41%|ββββ | 254/625 [06:18<10:16, 1.66s/it]
Training 1/1 epoch (loss 2.8076): 41%|ββββ | 254/625 [06:20<10:16, 1.66s/it]
Training 1/1 epoch (loss 2.8076): 41%|ββββ | 255/625 [06:20<10:56, 1.77s/it]
Training 1/1 epoch (loss 2.6730): 41%|ββββ | 255/625 [06:21<10:56, 1.77s/it]
Training 1/1 epoch (loss 2.6730): 41%|ββββ | 256/625 [06:21<08:53, 1.45s/it]
Training 1/1 epoch (loss 2.5759): 41%|ββββ | 256/625 [06:22<08:53, 1.45s/it]
Training 1/1 epoch (loss 2.5759): 41%|ββββ | 257/625 [06:22<08:36, 1.40s/it]
Training 1/1 epoch (loss 2.6367): 41%|ββββ | 257/625 [06:24<08:36, 1.40s/it]
Training 1/1 epoch (loss 2.6367): 41%|βββββ | 258/625 [06:24<09:20, 1.53s/it]
Training 1/1 epoch (loss 2.8692): 41%|βββββ | 258/625 [06:25<09:20, 1.53s/it]
Training 1/1 epoch (loss 2.8692): 41%|βββββ | 259/625 [06:25<08:56, 1.47s/it]
Training 1/1 epoch (loss 2.7190): 41%|βββββ | 259/625 [06:26<08:56, 1.47s/it]
Training 1/1 epoch (loss 2.7190): 42%|βββββ | 260/625 [06:26<08:48, 1.45s/it]
Training 1/1 epoch (loss 2.6526): 42%|βββββ | 260/625 [06:27<08:48, 1.45s/it]
Training 1/1 epoch (loss 2.6526): 42%|βββββ | 261/625 [06:27<07:44, 1.28s/it]
Training 1/1 epoch (loss 2.6804): 42%|βββββ | 261/625 [06:29<07:44, 1.28s/it]
Training 1/1 epoch (loss 2.6804): 42%|βββββ | 262/625 [06:29<08:41, 1.44s/it]
Training 1/1 epoch (loss 2.6112): 42%|βββββ | 262/625 [06:31<08:41, 1.44s/it]
Training 1/1 epoch (loss 2.6112): 42%|βββββ | 263/625 [06:31<09:57, 1.65s/it]
Training 1/1 epoch (loss 2.3878): 42%|βββββ | 263/625 [06:32<09:57, 1.65s/it]
Training 1/1 epoch (loss 2.3878): 42%|βββββ | 264/625 [06:32<08:51, 1.47s/it]
Training 1/1 epoch (loss 2.5889): 42%|βββββ | 264/625 [06:34<08:51, 1.47s/it]
Training 1/1 epoch (loss 2.5889): 42%|βββββ | 265/625 [06:34<09:51, 1.64s/it]
Training 1/1 epoch (loss 2.5578): 42%|βββββ | 265/625 [06:36<09:51, 1.64s/it]
Training 1/1 epoch (loss 2.5578): 43%|βββββ | 266/625 [06:36<10:10, 1.70s/it]
Training 1/1 epoch (loss 2.8340): 43%|βββββ | 266/625 [06:38<10:10, 1.70s/it]
Training 1/1 epoch (loss 2.8340): 43%|βββββ | 267/625 [06:38<09:43, 1.63s/it]
Training 1/1 epoch (loss 2.6524): 43%|βββββ | 267/625 [06:40<09:43, 1.63s/it]
Training 1/1 epoch (loss 2.6524): 43%|βββββ | 268/625 [06:40<10:19, 1.74s/it]
Training 1/1 epoch (loss 2.7538): 43%|βββββ | 268/625 [06:40<10:19, 1.74s/it]
Training 1/1 epoch (loss 2.7538): 43%|βββββ | 269/625 [06:40<08:33, 1.44s/it]
Training 1/1 epoch (loss 2.6006): 43%|βββββ | 269/625 [06:42<08:33, 1.44s/it]
Training 1/1 epoch (loss 2.6006): 43%|βββββ | 270/625 [06:42<08:40, 1.47s/it]
Training 1/1 epoch (loss 2.8052): 43%|βββββ | 270/625 [06:44<08:40, 1.47s/it]
Training 1/1 epoch (loss 2.8052): 43%|βββββ | 271/625 [06:44<10:13, 1.73s/it]
Training 1/1 epoch (loss 2.6955): 43%|βββββ | 271/625 [06:46<10:13, 1.73s/it]
Training 1/1 epoch (loss 2.6955): 44%|βββββ | 272/625 [06:46<09:21, 1.59s/it]
Training 1/1 epoch (loss 2.6966): 44%|βββββ | 272/625 [06:47<09:21, 1.59s/it]
Training 1/1 epoch (loss 2.6966): 44%|βββββ | 273/625 [06:47<09:47, 1.67s/it]
Training 1/1 epoch (loss 2.5798): 44%|βββββ | 273/625 [06:50<09:47, 1.67s/it]
Training 1/1 epoch (loss 2.5798): 44%|βββββ | 274/625 [06:50<10:39, 1.82s/it]
Training 1/1 epoch (loss 2.4737): 44%|βββββ | 274/625 [06:50<10:39, 1.82s/it]
Training 1/1 epoch (loss 2.4737): 44%|βββββ | 275/625 [06:50<08:21, 1.43s/it]
Training 1/1 epoch (loss 2.5168): 44%|βββββ | 275/625 [06:52<08:21, 1.43s/it]
Training 1/1 epoch (loss 2.5168): 44%|βββββ | 276/625 [06:52<09:48, 1.69s/it]
Training 1/1 epoch (loss 2.6905): 44%|βββββ | 276/625 [06:54<09:48, 1.69s/it]
Training 1/1 epoch (loss 2.6905): 44%|βββββ | 277/625 [06:54<09:56, 1.71s/it]
Training 1/1 epoch (loss 2.7114): 44%|βββββ | 277/625 [06:55<09:56, 1.71s/it]
Training 1/1 epoch (loss 2.7114): 44%|βββββ | 278/625 [06:55<07:44, 1.34s/it]
Training 1/1 epoch (loss 2.7283): 44%|βββββ | 278/625 [06:56<07:44, 1.34s/it]
Training 1/1 epoch (loss 2.7283): 45%|βββββ | 279/625 [06:56<07:42, 1.34s/it]
Training 1/1 epoch (loss 2.6153): 45%|βββββ | 279/625 [06:58<07:42, 1.34s/it]
Training 1/1 epoch (loss 2.6153): 45%|βββββ | 280/625 [06:58<09:00, 1.57s/it]
Training 1/1 epoch (loss 2.8296): 45%|βββββ | 280/625 [06:59<09:00, 1.57s/it]
Training 1/1 epoch (loss 2.8296): 45%|βββββ | 281/625 [06:59<07:37, 1.33s/it]
Training 1/1 epoch (loss 2.7119): 45%|βββββ | 281/625 [07:01<07:37, 1.33s/it]
Training 1/1 epoch (loss 2.7119): 45%|βββββ | 282/625 [07:01<08:12, 1.44s/it]
Training 1/1 epoch (loss 2.6736): 45%|βββββ | 282/625 [07:02<08:12, 1.44s/it]
Training 1/1 epoch (loss 2.6736): 45%|βββββ | 283/625 [07:02<07:49, 1.37s/it]
Training 1/1 epoch (loss 2.8747): 45%|βββββ | 283/625 [07:04<07:49, 1.37s/it]
Training 1/1 epoch (loss 2.8747): 45%|βββββ | 284/625 [07:04<08:43, 1.53s/it]
Training 1/1 epoch (loss 2.6124): 45%|βββββ | 284/625 [07:06<08:43, 1.53s/it]
Training 1/1 epoch (loss 2.6124): 46%|βββββ | 285/625 [07:06<09:45, 1.72s/it]
Training 1/1 epoch (loss 2.6019): 46%|βββββ | 285/625 [07:06<09:45, 1.72s/it]
Training 1/1 epoch (loss 2.6019): 46%|βββββ | 286/625 [07:06<07:43, 1.37s/it]
Training 1/1 epoch (loss 2.6453): 46%|βββββ | 286/625 [07:09<07:43, 1.37s/it]
Training 1/1 epoch (loss 2.6453): 46%|βββββ | 287/625 [07:09<09:29, 1.69s/it]
Training 1/1 epoch (loss 2.5293): 46%|βββββ | 287/625 [07:11<09:29, 1.69s/it]
Training 1/1 epoch (loss 2.5293): 46%|βββββ | 288/625 [07:11<10:07, 1.80s/it]
Training 1/1 epoch (loss 2.6486): 46%|βββββ | 288/625 [07:11<10:07, 1.80s/it]
Training 1/1 epoch (loss 2.6486): 46%|βββββ | 289/625 [07:11<07:58, 1.42s/it]
Training 1/1 epoch (loss 2.8243): 46%|βββββ | 289/625 [07:13<07:58, 1.42s/it]
Training 1/1 epoch (loss 2.8243): 46%|βββββ | 290/625 [07:13<07:58, 1.43s/it]
Training 1/1 epoch (loss 2.7588): 46%|βββββ | 290/625 [07:14<07:58, 1.43s/it]
Training 1/1 epoch (loss 2.7588): 47%|βββββ | 291/625 [07:14<07:55, 1.43s/it]
Training 1/1 epoch (loss 3.0016): 47%|βββββ | 291/625 [07:16<07:55, 1.43s/it]
Training 1/1 epoch (loss 3.0016): 47%|βββββ | 292/625 [07:16<08:48, 1.59s/it]
Training 1/1 epoch (loss 2.7822): 47%|βββββ | 292/625 [07:17<08:48, 1.59s/it]
Training 1/1 epoch (loss 2.7822): 47%|βββββ | 293/625 [07:17<08:05, 1.46s/it]
Training 1/1 epoch (loss 2.5584): 47%|βββββ | 293/625 [07:18<08:05, 1.46s/it]
Training 1/1 epoch (loss 2.5584): 47%|βββββ | 294/625 [07:18<06:53, 1.25s/it]
Training 1/1 epoch (loss 2.7073): 47%|βββββ | 294/625 [07:20<06:53, 1.25s/it]
Training 1/1 epoch (loss 2.7073): 47%|βββββ | 295/625 [07:20<07:47, 1.42s/it]
Training 1/1 epoch (loss 2.5609): 47%|βββββ | 295/625 [07:22<07:47, 1.42s/it]
Training 1/1 epoch (loss 2.5609): 47%|βββββ | 296/625 [07:22<09:03, 1.65s/it]
Training 1/1 epoch (loss 2.8594): 47%|βββββ | 296/625 [07:23<09:03, 1.65s/it]
Training 1/1 epoch (loss 2.8594): 48%|βββββ | 297/625 [07:23<07:51, 1.44s/it]
Training 1/1 epoch (loss 2.4890): 48%|βββββ | 297/625 [07:24<07:51, 1.44s/it]
Training 1/1 epoch (loss 2.4890): 48%|βββββ | 298/625 [07:24<07:37, 1.40s/it]
Training 1/1 epoch (loss 2.3370): 48%|βββββ | 298/625 [07:26<07:37, 1.40s/it]
Training 1/1 epoch (loss 2.3370): 48%|βββββ | 299/625 [07:26<07:32, 1.39s/it]
Training 1/1 epoch (loss 2.5486): 48%|βββββ | 299/625 [07:27<07:32, 1.39s/it]
Training 1/1 epoch (loss 2.5486): 48%|βββββ | 300/625 [07:27<07:01, 1.30s/it]
Training 1/1 epoch (loss 2.5924): 48%|βββββ | 300/625 [07:28<07:01, 1.30s/it]
Training 1/1 epoch (loss 2.5924): 48%|βββββ | 301/625 [07:28<07:23, 1.37s/it]
Training 1/1 epoch (loss 2.7215): 48%|βββββ | 301/625 [07:29<07:23, 1.37s/it]
Training 1/1 epoch (loss 2.7215): 48%|βββββ | 302/625 [07:29<06:34, 1.22s/it]
Training 1/1 epoch (loss 2.5973): 48%|βββββ | 302/625 [07:30<06:34, 1.22s/it]
Training 1/1 epoch (loss 2.5973): 48%|βββββ | 303/625 [07:30<06:25, 1.20s/it]
Training 1/1 epoch (loss 2.8029): 48%|βββββ | 303/625 [07:32<06:25, 1.20s/it]
Training 1/1 epoch (loss 2.8029): 49%|βββββ | 304/625 [07:32<06:57, 1.30s/it]
Training 1/1 epoch (loss 2.6778): 49%|βββββ | 304/625 [07:33<06:57, 1.30s/it]
Training 1/1 epoch (loss 2.6778): 49%|βββββ | 305/625 [07:33<06:24, 1.20s/it]
Training 1/1 epoch (loss 2.4968): 49%|βββββ | 305/625 [07:35<06:24, 1.20s/it]
Training 1/1 epoch (loss 2.4968): 49%|βββββ | 306/625 [07:35<07:36, 1.43s/it]
Training 1/1 epoch (loss 2.8947): 49%|βββββ | 306/625 [07:37<07:36, 1.43s/it]
Training 1/1 epoch (loss 2.8947): 49%|βββββ | 307/625 [07:37<08:25, 1.59s/it]
Training 1/1 epoch (loss 2.7061): 49%|βββββ | 307/625 [07:37<08:25, 1.59s/it]
Training 1/1 epoch (loss 2.7061): 49%|βββββ | 308/625 [07:37<06:47, 1.29s/it]
Training 1/1 epoch (loss 2.6390): 49%|βββββ | 308/625 [07:39<06:47, 1.29s/it]
Training 1/1 epoch (loss 2.6390): 49%|βββββ | 309/625 [07:39<07:15, 1.38s/it]
Training 1/1 epoch (loss 2.5625): 49%|βββββ | 309/625 [07:41<07:15, 1.38s/it]
Training 1/1 epoch (loss 2.5625): 50%|βββββ | 310/625 [07:41<07:32, 1.44s/it]
Training 1/1 epoch (loss 2.4717): 50%|βββββ | 310/625 [07:41<07:32, 1.44s/it]
Training 1/1 epoch (loss 2.4717): 50%|βββββ | 311/625 [07:41<06:16, 1.20s/it]
Training 1/1 epoch (loss 2.7386): 50%|βββββ | 311/625 [07:43<06:16, 1.20s/it]
Training 1/1 epoch (loss 2.7386): 50%|βββββ | 312/625 [07:43<07:56, 1.52s/it]
Training 1/1 epoch (loss 2.5739): 50%|βββββ | 312/625 [07:45<07:56, 1.52s/it]
Training 1/1 epoch (loss 2.5739): 50%|βββββ | 313/625 [07:45<07:34, 1.46s/it]
Training 1/1 epoch (loss 2.6126): 50%|βββββ | 313/625 [07:47<07:34, 1.46s/it]
Training 1/1 epoch (loss 2.6126): 50%|βββββ | 314/625 [07:47<08:43, 1.68s/it]
Training 1/1 epoch (loss 2.8712): 50%|βββββ | 314/625 [07:49<08:43, 1.68s/it]
Training 1/1 epoch (loss 2.8712): 50%|βββββ | 315/625 [07:49<08:53, 1.72s/it]
Training 1/1 epoch (loss 2.7967): 50%|βββββ | 315/625 [07:50<08:53, 1.72s/it]
Training 1/1 epoch (loss 2.7967): 51%|βββββ | 316/625 [07:50<07:33, 1.47s/it]
Training 1/1 epoch (loss 2.7933): 51%|βββββ | 316/625 [07:52<07:33, 1.47s/it]
Training 1/1 epoch (loss 2.7933): 51%|βββββ | 317/625 [07:52<08:56, 1.74s/it]
Training 1/1 epoch (loss 2.6033): 51%|βββββ | 317/625 [07:54<08:56, 1.74s/it]
Training 1/1 epoch (loss 2.6033): 51%|βββββ | 318/625 [07:54<09:05, 1.78s/it]
Training 1/1 epoch (loss 2.7563): 51%|βββββ | 318/625 [07:54<09:05, 1.78s/it]
Training 1/1 epoch (loss 2.7563): 51%|βββββ | 319/625 [07:54<07:09, 1.40s/it]
Training 1/1 epoch (loss 2.6090): 51%|βββββ | 319/625 [07:57<07:09, 1.40s/it]
Training 1/1 epoch (loss 2.6090): 51%|βββββ | 320/625 [07:57<09:02, 1.78s/it]
Training 1/1 epoch (loss 2.7274): 51%|βββββ | 320/625 [07:59<09:02, 1.78s/it]
Training 1/1 epoch (loss 2.7274): 51%|ββββββ | 321/625 [07:59<08:35, 1.69s/it]
Training 1/1 epoch (loss 2.6626): 51%|ββββββ | 321/625 [07:59<08:35, 1.69s/it]
Training 1/1 epoch (loss 2.6626): 52%|ββββββ | 322/625 [07:59<07:05, 1.41s/it]
Training 1/1 epoch (loss 2.7500): 52%|ββββββ | 322/625 [08:01<07:05, 1.41s/it]
Training 1/1 epoch (loss 2.7500): 52%|ββββββ | 323/625 [08:01<07:28, 1.48s/it]
Training 1/1 epoch (loss 2.7563): 52%|ββββββ | 323/625 [08:02<07:28, 1.48s/it]
Training 1/1 epoch (loss 2.7563): 52%|ββββββ | 324/625 [08:02<07:11, 1.43s/it]
Training 1/1 epoch (loss 2.8814): 52%|ββββββ | 324/625 [08:04<07:11, 1.43s/it]
Training 1/1 epoch (loss 2.8814): 52%|ββββββ | 325/625 [08:04<07:27, 1.49s/it]
Training 1/1 epoch (loss 2.6032): 52%|ββββββ | 325/625 [08:06<07:27, 1.49s/it]
Training 1/1 epoch (loss 2.6032): 52%|ββββββ | 326/625 [08:06<07:39, 1.54s/it]
Training 1/1 epoch (loss 2.6840): 52%|ββββββ | 326/625 [08:07<07:39, 1.54s/it]
Training 1/1 epoch (loss 2.6840): 52%|ββββββ | 327/625 [08:07<07:03, 1.42s/it]
Training 1/1 epoch (loss 2.5940): 52%|ββββββ | 327/625 [08:09<07:03, 1.42s/it]
Training 1/1 epoch (loss 2.5940): 52%|ββββββ | 328/625 [08:09<07:49, 1.58s/it]
Training 1/1 epoch (loss 2.6007): 52%|ββββββ | 328/625 [08:10<07:49, 1.58s/it]
Training 1/1 epoch (loss 2.6007): 53%|ββββββ | 329/625 [08:10<07:23, 1.50s/it]
Training 1/1 epoch (loss 2.6303): 53%|ββββββ | 329/625 [08:10<07:23, 1.50s/it]
Training 1/1 epoch (loss 2.6303): 53%|ββββββ | 330/625 [08:10<05:47, 1.18s/it]
Training 1/1 epoch (loss 2.7225): 53%|ββββββ | 330/625 [08:12<05:47, 1.18s/it]
Training 1/1 epoch (loss 2.7225): 53%|ββββββ | 331/625 [08:12<06:26, 1.32s/it]
Training 1/1 epoch (loss 2.8988): 53%|ββββββ | 331/625 [08:14<06:26, 1.32s/it]
Training 1/1 epoch (loss 2.8988): 53%|ββββββ | 332/625 [08:14<06:36, 1.35s/it]
Training 1/1 epoch (loss 2.5648): 53%|ββββββ | 332/625 [08:14<06:36, 1.35s/it]
Training 1/1 epoch (loss 2.5648): 53%|ββββββ | 333/625 [08:14<05:56, 1.22s/it]
Training 1/1 epoch (loss 2.6894): 53%|ββββββ | 333/625 [08:16<05:56, 1.22s/it]
Training 1/1 epoch (loss 2.6894): 53%|ββββββ | 334/625 [08:16<06:07, 1.26s/it]
Training 1/1 epoch (loss 2.5832): 53%|ββββββ | 334/625 [08:17<06:07, 1.26s/it]
Training 1/1 epoch (loss 2.5832): 54%|ββββββ | 335/625 [08:17<05:56, 1.23s/it]
Training 1/1 epoch (loss 2.7667): 54%|ββββββ | 335/625 [08:19<05:56, 1.23s/it]
Training 1/1 epoch (loss 2.7667): 54%|ββββββ | 336/625 [08:19<07:06, 1.48s/it]
Training 1/1 epoch (loss 2.7912): 54%|ββββββ | 336/625 [08:21<07:06, 1.48s/it]
Training 1/1 epoch (loss 2.7912): 54%|ββββββ | 337/625 [08:21<07:58, 1.66s/it]
Training 1/1 epoch (loss 2.7620): 54%|ββββββ | 337/625 [08:22<07:58, 1.66s/it]
Training 1/1 epoch (loss 2.7620): 54%|ββββββ | 338/625 [08:22<06:32, 1.37s/it]
Training 1/1 epoch (loss 2.6084): 54%|ββββββ | 338/625 [08:23<06:32, 1.37s/it]
Training 1/1 epoch (loss 2.6084): 54%|ββββββ | 339/625 [08:23<06:54, 1.45s/it]
Training 1/1 epoch (loss 2.7774): 54%|ββββββ | 339/625 [08:25<06:54, 1.45s/it]
Training 1/1 epoch (loss 2.7774): 54%|ββββββ | 340/625 [08:25<06:50, 1.44s/it]
Training 1/1 epoch (loss 2.5333): 54%|ββββββ | 340/625 [08:25<06:50, 1.44s/it]
Training 1/1 epoch (loss 2.5333): 55%|ββββββ | 341/625 [08:25<05:25, 1.15s/it]
Training 1/1 epoch (loss 2.6582): 55%|ββββββ | 341/625 [08:27<05:25, 1.15s/it]
Training 1/1 epoch (loss 2.6582): 55%|ββββββ | 342/625 [08:27<06:42, 1.42s/it]
Training 1/1 epoch (loss 2.6243): 55%|ββββββ | 342/625 [08:29<06:42, 1.42s/it]
Training 1/1 epoch (loss 2.6243): 55%|ββββββ | 343/625 [08:29<07:30, 1.60s/it]
Training 1/1 epoch (loss 2.5161): 55%|ββββββ | 343/625 [08:30<07:30, 1.60s/it]
Training 1/1 epoch (loss 2.5161): 55%|ββββββ | 344/625 [08:30<06:28, 1.38s/it]
Training 1/1 epoch (loss 2.6769): 55%|ββββββ | 344/625 [08:32<06:28, 1.38s/it]
Training 1/1 epoch (loss 2.6769): 55%|ββββββ | 345/625 [08:32<07:25, 1.59s/it]
Training 1/1 epoch (loss 2.5079): 55%|ββββββ | 345/625 [08:34<07:25, 1.59s/it]
Training 1/1 epoch (loss 2.5079): 55%|ββββββ | 346/625 [08:34<06:58, 1.50s/it]
Training 1/1 epoch (loss 2.6976): 55%|ββββββ | 346/625 [08:35<06:58, 1.50s/it]
Training 1/1 epoch (loss 2.6976): 56%|ββββββ | 347/625 [08:35<06:18, 1.36s/it]
Training 1/1 epoch (loss 2.9246): 56%|ββββββ | 347/625 [08:37<06:18, 1.36s/it]
Training 1/1 epoch (loss 2.9246): 56%|ββββββ | 348/625 [08:37<07:32, 1.63s/it]
Training 1/1 epoch (loss 2.6422): 56%|ββββββ | 348/625 [08:38<07:32, 1.63s/it]
Training 1/1 epoch (loss 2.6422): 56%|ββββββ | 349/625 [08:38<06:38, 1.45s/it]
Training 1/1 epoch (loss 2.7289): 56%|ββββββ | 349/625 [08:40<06:38, 1.45s/it]
Training 1/1 epoch (loss 2.7289): 56%|ββββββ | 350/625 [08:40<07:55, 1.73s/it]
Training 1/1 epoch (loss 2.5300): 56%|ββββββ | 350/625 [08:42<07:55, 1.73s/it]
Training 1/1 epoch (loss 2.5300): 56%|ββββββ | 351/625 [08:42<07:43, 1.69s/it]
Training 1/1 epoch (loss 2.9257): 56%|ββββββ | 351/625 [08:42<07:43, 1.69s/it]
Training 1/1 epoch (loss 2.9257): 56%|ββββββ | 352/625 [08:42<06:08, 1.35s/it]
Training 1/1 epoch (loss 3.0715): 56%|ββββββ | 352/625 [08:44<06:08, 1.35s/it]
Training 1/1 epoch (loss 3.0715): 56%|ββββββ | 353/625 [08:44<06:05, 1.35s/it]
Training 1/1 epoch (loss 2.7211): 56%|ββββββ | 353/625 [08:45<06:05, 1.35s/it]
Training 1/1 epoch (loss 2.7211): 57%|ββββββ | 354/625 [08:45<05:52, 1.30s/it]
Training 1/1 epoch (loss 2.7461): 57%|ββββββ | 354/625 [08:45<05:52, 1.30s/it]
Training 1/1 epoch (loss 2.7461): 57%|ββββββ | 355/625 [08:45<04:44, 1.05s/it]
Training 1/1 epoch (loss 2.5793): 57%|ββββββ | 355/625 [08:48<04:44, 1.05s/it]
Training 1/1 epoch (loss 2.5793): 57%|ββββββ | 356/625 [08:48<06:01, 1.34s/it]
Training 1/1 epoch (loss 2.8152): 57%|ββββββ | 356/625 [08:49<06:01, 1.34s/it]
Training 1/1 epoch (loss 2.8152): 57%|ββββββ | 357/625 [08:49<06:48, 1.53s/it]
Training 1/1 epoch (loss 2.6230): 57%|ββββββ | 357/625 [08:50<06:48, 1.53s/it]
Training 1/1 epoch (loss 2.6230): 57%|ββββββ | 358/625 [08:50<05:35, 1.26s/it]
Training 1/1 epoch (loss 2.8848): 57%|ββββββ | 358/625 [08:52<05:35, 1.26s/it]
Training 1/1 epoch (loss 2.8848): 57%|ββββββ | 359/625 [08:52<06:17, 1.42s/it]
Training 1/1 epoch (loss 2.4867): 57%|ββββββ | 359/625 [08:54<06:17, 1.42s/it]
Training 1/1 epoch (loss 2.4867): 58%|ββββββ | 360/625 [08:54<07:18, 1.66s/it]
Training 1/1 epoch (loss 2.7221): 58%|ββββββ | 360/625 [08:55<07:18, 1.66s/it]
Training 1/1 epoch (loss 2.7221): 58%|ββββββ | 361/625 [08:55<06:45, 1.54s/it]
Training 1/1 epoch (loss 2.5670): 58%|ββββββ | 361/625 [08:57<06:45, 1.54s/it]
Training 1/1 epoch (loss 2.5670): 58%|ββββββ | 362/625 [08:57<06:45, 1.54s/it]
Training 1/1 epoch (loss 2.7014): 58%|ββββββ | 362/625 [08:58<06:45, 1.54s/it]
Training 1/1 epoch (loss 2.7014): 58%|ββββββ | 363/625 [08:58<05:51, 1.34s/it]
Training 1/1 epoch (loss 2.6966): 58%|ββββββ | 363/625 [08:59<05:51, 1.34s/it]
Training 1/1 epoch (loss 2.6966): 58%|ββββββ | 364/625 [08:59<06:01, 1.39s/it]
Training 1/1 epoch (loss 2.4678): 58%|ββββββ | 364/625 [09:01<06:01, 1.39s/it]
Training 1/1 epoch (loss 2.4678): 58%|ββββββ | 365/625 [09:01<06:33, 1.51s/it]
Training 1/1 epoch (loss 2.6309): 58%|ββββββ | 365/625 [09:02<06:33, 1.51s/it]
Training 1/1 epoch (loss 2.6309): 59%|ββββββ | 366/625 [09:02<06:02, 1.40s/it]
Training 1/1 epoch (loss 2.5862): 59%|ββββββ | 366/625 [09:04<06:02, 1.40s/it]
Training 1/1 epoch (loss 2.5862): 59%|ββββββ | 367/625 [09:04<06:54, 1.61s/it]
Training 1/1 epoch (loss 2.5952): 59%|ββββββ | 367/625 [09:06<06:54, 1.61s/it]
Training 1/1 epoch (loss 2.5952): 59%|ββββββ | 368/625 [09:06<07:21, 1.72s/it]
Training 1/1 epoch (loss 2.5394): 59%|ββββββ | 368/625 [09:07<07:21, 1.72s/it]
Training 1/1 epoch (loss 2.5394): 59%|ββββββ | 369/625 [09:07<06:07, 1.44s/it]
Training 1/1 epoch (loss 2.6859): 59%|ββββββ | 369/625 [09:09<06:07, 1.44s/it]
Training 1/1 epoch (loss 2.6859): 59%|ββββββ | 370/625 [09:09<06:21, 1.50s/it]
Training 1/1 epoch (loss 2.7699): 59%|ββββββ | 370/625 [09:10<06:21, 1.50s/it]
Training 1/1 epoch (loss 2.7699): 59%|ββββββ | 371/625 [09:10<06:09, 1.45s/it]
Training 1/1 epoch (loss 2.8491): 59%|ββββββ | 371/625 [09:11<06:09, 1.45s/it]
Training 1/1 epoch (loss 2.8491): 60%|ββββββ | 372/625 [09:11<05:21, 1.27s/it]
Training 1/1 epoch (loss 2.5608): 60%|ββββββ | 372/625 [09:13<05:21, 1.27s/it]
Training 1/1 epoch (loss 2.5608): 60%|ββββββ | 373/625 [09:13<06:29, 1.55s/it]
Training 1/1 epoch (loss 2.7884): 60%|ββββββ | 373/625 [09:14<06:29, 1.55s/it]
Training 1/1 epoch (loss 2.7884): 60%|ββββββ | 374/625 [09:14<06:05, 1.45s/it]
Training 1/1 epoch (loss 2.6884): 60%|ββββββ | 374/625 [09:16<06:05, 1.45s/it]
Training 1/1 epoch (loss 2.6884): 60%|ββββββ | 375/625 [09:16<05:48, 1.39s/it]
Training 1/1 epoch (loss 2.6364): 60%|ββββββ | 375/625 [09:18<05:48, 1.39s/it]
Training 1/1 epoch (loss 2.6364): 60%|ββββββ | 376/625 [09:18<06:31, 1.57s/it]
Training 1/1 epoch (loss 2.5406): 60%|ββββββ | 376/625 [09:19<06:31, 1.57s/it]
Training 1/1 epoch (loss 2.5406): 60%|ββββββ | 377/625 [09:19<05:50, 1.42s/it]
Training 1/1 epoch (loss 2.5701): 60%|ββββββ | 377/625 [09:21<05:50, 1.42s/it]
Training 1/1 epoch (loss 2.5701): 60%|ββββββ | 378/625 [09:21<06:42, 1.63s/it]
Training 1/1 epoch (loss 2.8079): 60%|ββββββ | 378/625 [09:22<06:42, 1.63s/it]
Training 1/1 epoch (loss 2.8079): 61%|ββββββ | 379/625 [09:22<06:16, 1.53s/it]
Training 1/1 epoch (loss 2.5098): 61%|ββββββ | 379/625 [09:23<06:16, 1.53s/it]
Training 1/1 epoch (loss 2.5098): 61%|ββββββ | 380/625 [09:23<05:35, 1.37s/it]
Training 1/1 epoch (loss 2.8347): 61%|ββββββ | 380/625 [09:25<05:35, 1.37s/it]
Training 1/1 epoch (loss 2.8347): 61%|ββββββ | 381/625 [09:25<05:51, 1.44s/it]
Training 1/1 epoch (loss 2.6036): 61%|ββββββ | 381/625 [09:26<05:51, 1.44s/it]
Training 1/1 epoch (loss 2.6036): 61%|ββββββ | 382/625 [09:26<05:52, 1.45s/it]
Training 1/1 epoch (loss 2.7820): 61%|ββββββ | 382/625 [09:27<05:52, 1.45s/it]
Training 1/1 epoch (loss 2.7820): 61%|βββββββ | 383/625 [09:27<05:09, 1.28s/it]
Training 1/1 epoch (loss 2.5902): 61%|βββββββ | 383/625 [09:29<05:09, 1.28s/it]
Training 1/1 epoch (loss 2.5902): 61%|βββββββ | 384/625 [09:29<05:26, 1.36s/it]
Training 1/1 epoch (loss 2.5795): 61%|βββββββ | 384/625 [09:29<05:26, 1.36s/it]
Training 1/1 epoch (loss 2.5795): 62%|βββββββ | 385/625 [09:29<04:41, 1.17s/it]
Training 1/1 epoch (loss 2.7667): 62%|βββββββ | 385/625 [09:31<04:41, 1.17s/it]
Training 1/1 epoch (loss 2.7667): 62%|βββββββ | 386/625 [09:31<05:25, 1.36s/it]
Training 1/1 epoch (loss 2.7636): 62%|βββββββ | 386/625 [09:32<05:25, 1.36s/it]
Training 1/1 epoch (loss 2.7636): 62%|βββββββ | 387/625 [09:32<05:17, 1.33s/it]
Training 1/1 epoch (loss 2.6971): 62%|βββββββ | 387/625 [09:33<05:17, 1.33s/it]
Training 1/1 epoch (loss 2.6971): 62%|βββββββ | 388/625 [09:33<04:19, 1.09s/it]
Training 1/1 epoch (loss 2.5028): 62%|βββββββ | 388/625 [09:34<04:19, 1.09s/it]
Training 1/1 epoch (loss 2.5028): 62%|βββββββ | 389/625 [09:34<04:48, 1.22s/it]
Training 1/1 epoch (loss 2.7182): 62%|βββββββ | 389/625 [09:36<04:48, 1.22s/it]
Training 1/1 epoch (loss 2.7182): 62%|βββββββ | 390/625 [09:36<04:55, 1.26s/it]
Training 1/1 epoch (loss 2.6782): 62%|βββββββ | 390/625 [09:37<04:55, 1.26s/it]
Training 1/1 epoch (loss 2.6782): 63%|βββββββ | 391/625 [09:37<04:40, 1.20s/it]
Training 1/1 epoch (loss 2.8603): 63%|βββββββ | 391/625 [09:40<04:40, 1.20s/it]
Training 1/1 epoch (loss 2.8603): 63%|βββββββ | 392/625 [09:40<06:27, 1.66s/it]
Training 1/1 epoch (loss 2.6467): 63%|βββββββ | 392/625 [09:40<06:27, 1.66s/it]
Training 1/1 epoch (loss 2.6467): 63%|βββββββ | 393/625 [09:40<05:06, 1.32s/it]
Training 1/1 epoch (loss 2.6638): 63%|βββββββ | 393/625 [09:42<05:06, 1.32s/it]
Training 1/1 epoch (loss 2.6638): 63%|βββββββ | 394/625 [09:42<06:13, 1.62s/it]
Training 1/1 epoch (loss 2.7325): 63%|βββββββ | 394/625 [09:45<06:13, 1.62s/it]
Training 1/1 epoch (loss 2.7325): 63%|βββββββ | 395/625 [09:45<07:00, 1.83s/it]
Training 1/1 epoch (loss 2.5307): 63%|βββββββ | 395/625 [09:45<07:00, 1.83s/it]
Training 1/1 epoch (loss 2.5307): 63%|βββββββ | 396/625 [09:45<05:27, 1.43s/it]
Training 1/1 epoch (loss 2.6794): 63%|βββββββ | 396/625 [09:47<05:27, 1.43s/it]
Training 1/1 epoch (loss 2.6794): 64%|βββββββ | 397/625 [09:47<05:42, 1.50s/it]
Training 1/1 epoch (loss 2.6760): 64%|βββββββ | 397/625 [09:49<05:42, 1.50s/it]
Training 1/1 epoch (loss 2.6760): 64%|βββββββ | 398/625 [09:49<06:16, 1.66s/it]
Training 1/1 epoch (loss 2.6691): 64%|βββββββ | 398/625 [09:50<06:16, 1.66s/it]
Training 1/1 epoch (loss 2.6691): 64%|βββββββ | 399/625 [09:50<05:34, 1.48s/it]
Training 1/1 epoch (loss 2.5693): 64%|βββββββ | 399/625 [09:52<05:34, 1.48s/it]
Training 1/1 epoch (loss 2.5693): 64%|βββββββ | 400/625 [09:52<06:10, 1.65s/it]
Training 1/1 epoch (loss 2.6850): 64%|βββββββ | 400/625 [09:53<06:10, 1.65s/it]
Training 1/1 epoch (loss 2.6850): 64%|βββββββ | 401/625 [09:53<05:30, 1.48s/it]
Training 1/1 epoch (loss 2.7215): 64%|βββββββ | 401/625 [09:54<05:30, 1.48s/it]
Training 1/1 epoch (loss 2.7215): 64%|βββββββ | 402/625 [09:54<05:19, 1.43s/it]
Training 1/1 epoch (loss 2.9235): 64%|βββββββ | 402/625 [09:56<05:19, 1.43s/it]
Training 1/1 epoch (loss 2.9235): 64%|βββββββ | 403/625 [09:56<05:35, 1.51s/it]
Training 1/1 epoch (loss 2.7086): 64%|βββββββ | 403/625 [09:57<05:35, 1.51s/it]
Training 1/1 epoch (loss 2.7086): 65%|βββββββ | 404/625 [09:57<05:20, 1.45s/it]
Training 1/1 epoch (loss 2.6475): 65%|βββββββ | 404/625 [09:58<05:20, 1.45s/it]
Training 1/1 epoch (loss 2.6475): 65%|βββββββ | 405/625 [09:58<04:23, 1.20s/it]
Training 1/1 epoch (loss 2.6413): 65%|βββββββ | 405/625 [10:00<04:23, 1.20s/it]
Training 1/1 epoch (loss 2.6413): 65%|βββββββ | 406/625 [10:00<04:59, 1.37s/it]
Training 1/1 epoch (loss 2.7599): 65%|βββββββ | 406/625 [10:01<04:59, 1.37s/it]
Training 1/1 epoch (loss 2.7599): 65%|βββββββ | 407/625 [10:01<05:12, 1.44s/it]
Training 1/1 epoch (loss 2.7005): 65%|βββββββ | 407/625 [10:03<05:12, 1.44s/it]
Training 1/1 epoch (loss 2.7005): 65%|βββββββ | 408/625 [10:03<05:06, 1.41s/it]
Training 1/1 epoch (loss 2.8355): 65%|βββββββ | 408/625 [10:04<05:06, 1.41s/it]
Training 1/1 epoch (loss 2.8355): 65%|βββββββ | 409/625 [10:04<05:13, 1.45s/it]
Training 1/1 epoch (loss 2.9383): 65%|βββββββ | 409/625 [10:05<05:13, 1.45s/it]
Training 1/1 epoch (loss 2.9383): 66%|βββββββ | 410/625 [10:05<04:08, 1.16s/it]
Training 1/1 epoch (loss 2.4043): 66%|βββββββ | 410/625 [10:07<04:08, 1.16s/it]
Training 1/1 epoch (loss 2.4043): 66%|βββββββ | 411/625 [10:07<04:49, 1.35s/it]
Training 1/1 epoch (loss 2.7290): 66%|βββββββ | 411/625 [10:08<04:49, 1.35s/it]
Training 1/1 epoch (loss 2.7290): 66%|βββββββ | 412/625 [10:08<04:53, 1.38s/it]
Training 1/1 epoch (loss 2.7223): 66%|βββββββ | 412/625 [10:09<04:53, 1.38s/it]
Training 1/1 epoch (loss 2.7223): 66%|βββββββ | 413/625 [10:09<04:27, 1.26s/it]
Training 1/1 epoch (loss 2.8465): 66%|βββββββ | 413/625 [10:11<04:27, 1.26s/it]
Training 1/1 epoch (loss 2.8465): 66%|βββββββ | 414/625 [10:11<05:01, 1.43s/it]
Training 1/1 epoch (loss 2.6617): 66%|βββββββ | 414/625 [10:13<05:01, 1.43s/it]
Training 1/1 epoch (loss 2.6617): 66%|βββββββ | 415/625 [10:13<05:17, 1.51s/it]
Training 1/1 epoch (loss 2.5177): 66%|βββββββ | 415/625 [10:14<05:17, 1.51s/it]
Training 1/1 epoch (loss 2.5177): 67%|βββββββ | 416/625 [10:14<05:32, 1.59s/it]
Training 1/1 epoch (loss 2.8188): 67%|βββββββ | 416/625 [10:16<05:32, 1.59s/it]
Training 1/1 epoch (loss 2.8188): 67%|βββββββ | 417/625 [10:16<05:43, 1.65s/it]
Training 1/1 epoch (loss 2.6921): 67%|βββββββ | 417/625 [10:17<05:43, 1.65s/it]
Training 1/1 epoch (loss 2.6921): 67%|βββββββ | 418/625 [10:17<04:31, 1.31s/it]
Training 1/1 epoch (loss 2.7872): 67%|βββββββ | 418/625 [10:18<04:31, 1.31s/it]
Training 1/1 epoch (loss 2.7872): 67%|βββββββ | 419/625 [10:18<04:58, 1.45s/it]
Training 1/1 epoch (loss 2.7335): 67%|βββββββ | 419/625 [10:21<04:58, 1.45s/it]
Training 1/1 epoch (loss 2.7335): 67%|βββββββ | 420/625 [10:21<05:53, 1.72s/it]
Training 1/1 epoch (loss 2.5345): 67%|βββββββ | 420/625 [10:21<05:53, 1.72s/it]
Training 1/1 epoch (loss 2.5345): 67%|βββββββ | 421/625 [10:21<04:43, 1.39s/it]
Training 1/1 epoch (loss 2.6944): 67%|βββββββ | 421/625 [10:23<04:43, 1.39s/it]
Training 1/1 epoch (loss 2.6944): 68%|βββββββ | 422/625 [10:23<05:02, 1.49s/it]
Training 1/1 epoch (loss 2.7067): 68%|βββββββ | 422/625 [10:25<05:02, 1.49s/it]
Training 1/1 epoch (loss 2.7067): 68%|βββββββ | 423/625 [10:25<05:17, 1.57s/it]
Training 1/1 epoch (loss 2.8695): 68%|βββββββ | 423/625 [10:26<05:17, 1.57s/it]
Training 1/1 epoch (loss 2.8695): 68%|βββββββ | 424/625 [10:26<05:08, 1.53s/it]
Training 1/1 epoch (loss 2.7190): 68%|βββββββ | 424/625 [10:28<05:08, 1.53s/it]
Training 1/1 epoch (loss 2.7190): 68%|βββββββ | 425/625 [10:28<04:59, 1.50s/it]
Training 1/1 epoch (loss 2.6897): 68%|βββββββ | 425/625 [10:29<04:59, 1.50s/it]
Training 1/1 epoch (loss 2.6897): 68%|βββββββ | 426/625 [10:29<04:41, 1.42s/it]
Training 1/1 epoch (loss 2.6889): 68%|βββββββ | 426/625 [10:30<04:41, 1.42s/it]
Training 1/1 epoch (loss 2.6889): 68%|βββββββ | 427/625 [10:30<04:40, 1.42s/it]
Training 1/1 epoch (loss 2.5913): 68%|βββββββ | 427/625 [10:33<04:40, 1.42s/it]
Training 1/1 epoch (loss 2.5913): 68%|βββββββ | 428/625 [10:33<05:40, 1.73s/it]
Training 1/1 epoch (loss 2.6868): 68%|βββββββ | 428/625 [10:34<05:40, 1.73s/it]
Training 1/1 epoch (loss 2.6868): 69%|βββββββ | 429/625 [10:34<04:42, 1.44s/it]
Training 1/1 epoch (loss 2.9564): 69%|βββββββ | 429/625 [10:35<04:42, 1.44s/it]
Training 1/1 epoch (loss 2.9564): 69%|βββββββ | 430/625 [10:35<04:58, 1.53s/it]
Training 1/1 epoch (loss 2.7001): 69%|βββββββ | 430/625 [10:37<04:58, 1.53s/it]
Training 1/1 epoch (loss 2.7001): 69%|βββββββ | 431/625 [10:37<05:35, 1.73s/it]
Training 1/1 epoch (loss 2.3958): 69%|βββββββ | 431/625 [10:38<05:35, 1.73s/it]
Training 1/1 epoch (loss 2.3958): 69%|βββββββ | 432/625 [10:38<04:40, 1.45s/it]
Training 1/1 epoch (loss 2.3371): 69%|βββββββ | 432/625 [10:40<04:40, 1.45s/it]
Training 1/1 epoch (loss 2.3371): 69%|βββββββ | 433/625 [10:40<04:55, 1.54s/it]
Training 1/1 epoch (loss 2.6703): 69%|βββββββ | 433/625 [10:42<04:55, 1.54s/it]
Training 1/1 epoch (loss 2.6703): 69%|βββββββ | 434/625 [10:42<04:58, 1.56s/it]
Training 1/1 epoch (loss 2.8382): 69%|βββββββ | 434/625 [10:42<04:58, 1.56s/it]
Training 1/1 epoch (loss 2.8382): 70%|βββββββ | 435/625 [10:42<04:06, 1.30s/it]
Training 1/1 epoch (loss 2.7424): 70%|βββββββ | 435/625 [10:44<04:06, 1.30s/it]
Training 1/1 epoch (loss 2.7424): 70%|βββββββ | 436/625 [10:44<04:49, 1.53s/it]
Training 1/1 epoch (loss 2.6149): 70%|βββββββ | 436/625 [10:46<04:49, 1.53s/it]
Training 1/1 epoch (loss 2.6149): 70%|βββββββ | 437/625 [10:46<04:40, 1.49s/it]
Training 1/1 epoch (loss 2.9793): 70%|βββββββ | 437/625 [10:47<04:40, 1.49s/it]
Training 1/1 epoch (loss 2.9793): 70%|βββββββ | 438/625 [10:47<04:11, 1.35s/it]
Training 1/1 epoch (loss 2.7343): 70%|βββββββ | 438/625 [10:49<04:11, 1.35s/it]
Training 1/1 epoch (loss 2.7343): 70%|βββββββ | 439/625 [10:49<05:07, 1.65s/it]
Training 1/1 epoch (loss 2.8354): 70%|βββββββ | 439/625 [10:50<05:07, 1.65s/it]
Training 1/1 epoch (loss 2.8354): 70%|βββββββ | 440/625 [10:50<04:30, 1.46s/it]
Training 1/1 epoch (loss 2.6836): 70%|βββββββ | 440/625 [10:52<04:30, 1.46s/it]
Training 1/1 epoch (loss 2.6836): 71%|βββββββ | 441/625 [10:52<04:24, 1.44s/it]
Training 1/1 epoch (loss 2.7549): 71%|βββββββ | 441/625 [10:54<04:24, 1.44s/it]
Training 1/1 epoch (loss 2.7549): 71%|βββββββ | 442/625 [10:54<04:51, 1.59s/it]
Training 1/1 epoch (loss 2.6887): 71%|βββββββ | 442/625 [10:54<04:51, 1.59s/it]
Training 1/1 epoch (loss 2.6887): 71%|βββββββ | 443/625 [10:54<04:03, 1.34s/it]
Training 1/1 epoch (loss 2.6418): 71%|βββββββ | 443/625 [10:56<04:03, 1.34s/it]
Training 1/1 epoch (loss 2.6418): 71%|βββββββ | 444/625 [10:56<04:30, 1.49s/it]
Training 1/1 epoch (loss 2.6393): 71%|βββββββ | 444/625 [10:59<04:30, 1.49s/it]
Training 1/1 epoch (loss 2.6393): 71%|βββββββ | 445/625 [10:59<05:23, 1.80s/it]
Training 1/1 epoch (loss 2.8016): 71%|βββββββ | 445/625 [10:59<05:23, 1.80s/it]
Training 1/1 epoch (loss 2.8016): 71%|ββββββββ | 446/625 [10:59<04:09, 1.39s/it]
Training 1/1 epoch (loss 2.7936): 71%|ββββββββ | 446/625 [11:01<04:09, 1.39s/it]
Training 1/1 epoch (loss 2.7936): 72%|ββββββββ | 447/625 [11:01<04:44, 1.60s/it]
Training 1/1 epoch (loss 2.6837): 72%|ββββββββ | 447/625 [11:03<04:44, 1.60s/it]
Training 1/1 epoch (loss 2.6837): 72%|ββββββββ | 448/625 [11:03<04:42, 1.59s/it]
Training 1/1 epoch (loss 2.8586): 72%|ββββββββ | 448/625 [11:04<04:42, 1.59s/it]
Training 1/1 epoch (loss 2.8586): 72%|ββββββββ | 449/625 [11:04<04:12, 1.43s/it]
Training 1/1 epoch (loss 2.6336): 72%|ββββββββ | 449/625 [11:05<04:12, 1.43s/it]
Training 1/1 epoch (loss 2.6336): 72%|ββββββββ | 450/625 [11:05<04:04, 1.40s/it]
Training 1/1 epoch (loss 2.4979): 72%|ββββββββ | 450/625 [11:07<04:04, 1.40s/it]
Training 1/1 epoch (loss 2.4979): 72%|ββββββββ | 451/625 [11:07<04:26, 1.53s/it]
Training 1/1 epoch (loss 2.6253): 72%|ββββββββ | 451/625 [11:09<04:26, 1.53s/it]
Training 1/1 epoch (loss 2.6253): 72%|ββββββββ | 452/625 [11:09<04:26, 1.54s/it]
Training 1/1 epoch (loss 2.4846): 72%|ββββββββ | 452/625 [11:10<04:26, 1.54s/it]
Training 1/1 epoch (loss 2.4846): 72%|ββββββββ | 453/625 [11:10<04:38, 1.62s/it]
Training 1/1 epoch (loss 2.5169): 72%|ββββββββ | 453/625 [11:11<04:38, 1.62s/it]
Training 1/1 epoch (loss 2.5169): 73%|ββββββββ | 454/625 [11:11<03:51, 1.35s/it]
Training 1/1 epoch (loss 2.7643): 73%|ββββββββ | 454/625 [11:12<03:51, 1.35s/it]
Training 1/1 epoch (loss 2.7643): 73%|ββββββββ | 455/625 [11:12<03:52, 1.37s/it]
Training 1/1 epoch (loss 2.7253): 73%|ββββββββ | 455/625 [11:14<03:52, 1.37s/it]
Training 1/1 epoch (loss 2.7253): 73%|ββββββββ | 456/625 [11:14<04:03, 1.44s/it]
Training 1/1 epoch (loss 2.8312): 73%|ββββββββ | 456/625 [11:15<04:03, 1.44s/it]
Training 1/1 epoch (loss 2.8312): 73%|ββββββββ | 457/625 [11:15<03:24, 1.22s/it]
Training 1/1 epoch (loss 2.7637): 73%|ββββββββ | 457/625 [11:16<03:24, 1.22s/it]
Training 1/1 epoch (loss 2.7637): 73%|ββββββββ | 458/625 [11:16<03:29, 1.25s/it]
Training 1/1 epoch (loss 2.6730): 73%|ββββββββ | 458/625 [11:17<03:29, 1.25s/it]
Training 1/1 epoch (loss 2.6730): 73%|ββββββββ | 459/625 [11:17<03:22, 1.22s/it]
Training 1/1 epoch (loss 2.4605): 73%|ββββββββ | 459/625 [11:19<03:22, 1.22s/it]
Training 1/1 epoch (loss 2.4605): 74%|ββββββββ | 460/625 [11:19<03:24, 1.24s/it]
Training 1/1 epoch (loss 2.8656): 74%|ββββββββ | 460/625 [11:21<03:24, 1.24s/it]
Training 1/1 epoch (loss 2.8656): 74%|ββββββββ | 461/625 [11:21<04:16, 1.56s/it]
Training 1/1 epoch (loss 2.8255): 74%|ββββββββ | 461/625 [11:22<04:16, 1.56s/it]
Training 1/1 epoch (loss 2.8255): 74%|ββββββββ | 462/625 [11:22<04:08, 1.52s/it]
Training 1/1 epoch (loss 2.6536): 74%|ββββββββ | 462/625 [11:24<04:08, 1.52s/it]
Training 1/1 epoch (loss 2.6536): 74%|ββββββββ | 463/625 [11:24<03:55, 1.46s/it]
Training 1/1 epoch (loss 2.8916): 74%|ββββββββ | 463/625 [11:25<03:55, 1.46s/it]
Training 1/1 epoch (loss 2.8916): 74%|ββββββββ | 464/625 [11:25<03:54, 1.46s/it]
Training 1/1 epoch (loss 2.7124): 74%|ββββββββ | 464/625 [11:26<03:54, 1.46s/it]
Training 1/1 epoch (loss 2.7124): 74%|ββββββββ | 465/625 [11:26<03:41, 1.38s/it]
Training 1/1 epoch (loss 2.8760): 74%|ββββββββ | 465/625 [11:29<03:41, 1.38s/it]
Training 1/1 epoch (loss 2.8760): 75%|ββββββββ | 466/625 [11:29<04:23, 1.65s/it]
Training 1/1 epoch (loss 2.6222): 75%|ββββββββ | 466/625 [11:29<04:23, 1.65s/it]
Training 1/1 epoch (loss 2.6222): 75%|ββββββββ | 467/625 [11:29<03:43, 1.41s/it]
Training 1/1 epoch (loss 2.6802): 75%|ββββββββ | 467/625 [11:30<03:43, 1.41s/it]
Training 1/1 epoch (loss 2.6802): 75%|ββββββββ | 468/625 [11:30<03:17, 1.26s/it]
Training 1/1 epoch (loss 2.9459): 75%|ββββββββ | 468/625 [11:32<03:17, 1.26s/it]
Training 1/1 epoch (loss 2.9459): 75%|ββββββββ | 469/625 [11:32<03:28, 1.33s/it]
Training 1/1 epoch (loss 2.7033): 75%|ββββββββ | 469/625 [11:34<03:28, 1.33s/it]
Training 1/1 epoch (loss 2.7033): 75%|ββββββββ | 470/625 [11:34<04:09, 1.61s/it]
Training 1/1 epoch (loss 2.6446): 75%|ββββββββ | 470/625 [11:36<04:09, 1.61s/it]
Training 1/1 epoch (loss 2.6446): 75%|ββββββββ | 471/625 [11:36<04:02, 1.57s/it]
Training 1/1 epoch (loss 2.8836): 75%|ββββββββ | 471/625 [11:37<04:02, 1.57s/it]
Training 1/1 epoch (loss 2.8836): 76%|ββββββββ | 472/625 [11:37<04:10, 1.64s/it]
Training 1/1 epoch (loss 2.4409): 76%|ββββββββ | 472/625 [11:40<04:10, 1.64s/it]
Training 1/1 epoch (loss 2.4409): 76%|ββββββββ | 473/625 [11:40<04:46, 1.89s/it]
Training 1/1 epoch (loss 2.8617): 76%|ββββββββ | 473/625 [11:40<04:46, 1.89s/it]
Training 1/1 epoch (loss 2.8617): 76%|ββββββββ | 474/625 [11:40<03:40, 1.46s/it]
Training 1/1 epoch (loss 2.8812): 76%|ββββββββ | 474/625 [11:42<03:40, 1.46s/it]
Training 1/1 epoch (loss 2.8812): 76%|ββββββββ | 475/625 [11:42<03:36, 1.44s/it]
Training 1/1 epoch (loss 2.9336): 76%|ββββββββ | 475/625 [11:43<03:36, 1.44s/it]
Training 1/1 epoch (loss 2.9336): 76%|ββββββββ | 476/625 [11:43<03:26, 1.38s/it]
Training 1/1 epoch (loss 2.6703): 76%|ββββββββ | 476/625 [11:43<03:26, 1.38s/it]
Training 1/1 epoch (loss 2.6703): 76%|ββββββββ | 477/625 [11:43<02:44, 1.11s/it]
Training 1/1 epoch (loss 2.6025): 76%|ββββββββ | 477/625 [11:45<02:44, 1.11s/it]
Training 1/1 epoch (loss 2.6025): 76%|ββββββββ | 478/625 [11:45<03:15, 1.33s/it]
Training 1/1 epoch (loss 2.6672): 76%|ββββββββ | 478/625 [11:48<03:15, 1.33s/it]
Training 1/1 epoch (loss 2.6672): 77%|ββββββββ | 479/625 [11:48<03:56, 1.62s/it]
Training 1/1 epoch (loss 2.7465): 77%|ββββββββ | 479/625 [11:49<03:56, 1.62s/it]
Training 1/1 epoch (loss 2.7465): 77%|ββββββββ | 480/625 [11:49<03:30, 1.45s/it]
Training 1/1 epoch (loss 2.6782): 77%|ββββββββ | 480/625 [11:50<03:30, 1.45s/it]
Training 1/1 epoch (loss 2.6782): 77%|ββββββββ | 481/625 [11:50<03:20, 1.39s/it]
Training 1/1 epoch (loss 2.4742): 77%|ββββββββ | 481/625 [11:51<03:20, 1.39s/it]
Training 1/1 epoch (loss 2.4742): 77%|ββββββββ | 482/625 [11:51<03:27, 1.45s/it]
Training 1/1 epoch (loss 2.6082): 77%|ββββββββ | 482/625 [11:53<03:27, 1.45s/it]
Training 1/1 epoch (loss 2.6082): 77%|ββββββββ | 483/625 [11:53<03:24, 1.44s/it]
Training 1/1 epoch (loss 2.6283): 77%|ββββββββ | 483/625 [11:55<03:24, 1.44s/it]
Training 1/1 epoch (loss 2.6283): 77%|ββββββββ | 484/625 [11:55<03:47, 1.61s/it]
Training 1/1 epoch (loss 2.6419): 77%|ββββββββ | 484/625 [11:56<03:47, 1.61s/it]
Training 1/1 epoch (loss 2.6419): 78%|ββββββββ | 485/625 [11:56<03:07, 1.34s/it]
Training 1/1 epoch (loss 2.6546): 78%|ββββββββ | 485/625 [11:57<03:07, 1.34s/it]
Training 1/1 epoch (loss 2.6546): 78%|ββββββββ | 486/625 [11:57<03:06, 1.34s/it]
Training 1/1 epoch (loss 2.8046): 78%|ββββββββ | 486/625 [11:59<03:06, 1.34s/it]
Training 1/1 epoch (loss 2.8046): 78%|ββββββββ | 487/625 [11:59<03:30, 1.53s/it]
Training 1/1 epoch (loss 2.6210): 78%|ββββββββ | 487/625 [12:00<03:30, 1.53s/it]
Training 1/1 epoch (loss 2.6210): 78%|ββββββββ | 488/625 [12:00<02:56, 1.29s/it]
Training 1/1 epoch (loss 2.6642): 78%|ββββββββ | 488/625 [12:02<02:56, 1.29s/it]
Training 1/1 epoch (loss 2.6642): 78%|ββββββββ | 489/625 [12:02<03:23, 1.50s/it]
Training 1/1 epoch (loss 2.7480): 78%|ββββββββ | 489/625 [12:03<03:23, 1.50s/it]
Training 1/1 epoch (loss 2.7480): 78%|ββββββββ | 490/625 [12:03<03:22, 1.50s/it]
Training 1/1 epoch (loss 2.7254): 78%|ββββββββ | 490/625 [12:05<03:22, 1.50s/it]
Training 1/1 epoch (loss 2.7254): 79%|ββββββββ | 491/625 [12:05<03:17, 1.47s/it]
Training 1/1 epoch (loss 2.5732): 79%|ββββββββ | 491/625 [12:06<03:17, 1.47s/it]
Training 1/1 epoch (loss 2.5732): 79%|ββββββββ | 492/625 [12:06<03:18, 1.49s/it]
Training 1/1 epoch (loss 2.5868): 79%|ββββββββ | 492/625 [12:07<03:18, 1.49s/it]
Training 1/1 epoch (loss 2.5868): 79%|ββββββββ | 493/625 [12:07<03:08, 1.43s/it]
Training 1/1 epoch (loss 2.4749): 79%|ββββββββ | 493/625 [12:08<03:08, 1.43s/it]
Training 1/1 epoch (loss 2.4749): 79%|ββββββββ | 494/625 [12:08<02:50, 1.30s/it]
Training 1/1 epoch (loss 2.4451): 79%|ββββββββ | 494/625 [12:11<02:50, 1.30s/it]
Training 1/1 epoch (loss 2.4451): 79%|ββββββββ | 495/625 [12:11<03:28, 1.61s/it]
Training 1/1 epoch (loss 2.6728): 79%|ββββββββ | 495/625 [12:12<03:28, 1.61s/it]
Training 1/1 epoch (loss 2.6728): 79%|ββββββββ | 496/625 [12:12<03:06, 1.44s/it]
Training 1/1 epoch (loss 2.7716): 79%|ββββββββ | 496/625 [12:14<03:06, 1.44s/it]
Training 1/1 epoch (loss 2.7716): 80%|ββββββββ | 497/625 [12:14<03:25, 1.61s/it]
Training 1/1 epoch (loss 2.6192): 80%|ββββββββ | 497/625 [12:15<03:25, 1.61s/it]
Training 1/1 epoch (loss 2.6192): 80%|ββββββββ | 498/625 [12:15<03:12, 1.52s/it]
Training 1/1 epoch (loss 2.9054): 80%|ββββββββ | 498/625 [12:16<03:12, 1.52s/it]
Training 1/1 epoch (loss 2.9054): 80%|ββββββββ | 499/625 [12:16<02:38, 1.26s/it]
Training 1/1 epoch (loss 2.5493): 80%|ββββββββ | 499/625 [12:17<02:38, 1.26s/it]
Training 1/1 epoch (loss 2.5493): 80%|ββββββββ | 500/625 [12:17<02:57, 1.42s/it]
Training 1/1 epoch (loss 2.6556): 80%|ββββββββ | 500/625 [12:19<02:57, 1.42s/it]
Training 1/1 epoch (loss 2.6556): 80%|ββββββββ | 501/625 [12:19<02:59, 1.45s/it]
Training 1/1 epoch (loss 2.6945): 80%|ββββββββ | 501/625 [12:20<02:59, 1.45s/it]
Training 1/1 epoch (loss 2.6945): 80%|ββββββββ | 502/625 [12:20<02:37, 1.28s/it]
Training 1/1 epoch (loss 2.6503): 80%|ββββββββ | 502/625 [12:22<02:37, 1.28s/it]
Training 1/1 epoch (loss 2.6503): 80%|ββββββββ | 503/625 [12:22<03:06, 1.53s/it]
Training 1/1 epoch (loss 2.8068): 80%|ββββββββ | 503/625 [12:23<03:06, 1.53s/it]
Training 1/1 epoch (loss 2.8068): 81%|ββββββββ | 504/625 [12:23<02:42, 1.35s/it]
Training 1/1 epoch (loss 2.7201): 81%|ββββββββ | 504/625 [12:25<02:42, 1.35s/it]
Training 1/1 epoch (loss 2.7201): 81%|ββββββββ | 505/625 [12:25<02:59, 1.49s/it]
Training 1/1 epoch (loss 2.7836): 81%|ββββββββ | 505/625 [12:27<02:59, 1.49s/it]
Training 1/1 epoch (loss 2.7836): 81%|ββββββββ | 506/625 [12:27<03:11, 1.61s/it]
Training 1/1 epoch (loss 2.5287): 81%|ββββββββ | 506/625 [12:27<03:11, 1.61s/it]
Training 1/1 epoch (loss 2.5287): 81%|ββββββββ | 507/625 [12:27<02:31, 1.28s/it]
Training 1/1 epoch (loss 2.5359): 81%|ββββββββ | 507/625 [12:29<02:31, 1.28s/it]
Training 1/1 epoch (loss 2.5359): 81%|βββββββββ | 508/625 [12:29<02:37, 1.34s/it]
Training 1/1 epoch (loss 2.6375): 81%|βββββββββ | 508/625 [12:30<02:37, 1.34s/it]
Training 1/1 epoch (loss 2.6375): 81%|βββββββββ | 509/625 [12:30<02:46, 1.43s/it]
Training 1/1 epoch (loss 2.6527): 81%|βββββββββ | 509/625 [12:31<02:46, 1.43s/it]
Training 1/1 epoch (loss 2.6527): 82%|βββββββββ | 510/625 [12:31<02:11, 1.14s/it]
Training 1/1 epoch (loss 2.8590): 82%|βββββββββ | 510/625 [12:32<02:11, 1.14s/it]
Training 1/1 epoch (loss 2.8590): 82%|βββββββββ | 511/625 [12:32<01:59, 1.05s/it]
Training 1/1 epoch (loss 2.4422): 82%|βββββββββ | 511/625 [12:34<01:59, 1.05s/it]
Training 1/1 epoch (loss 2.4422): 82%|βββββββββ | 512/625 [12:34<02:39, 1.41s/it]
Training 1/1 epoch (loss 2.5137): 82%|βββββββββ | 512/625 [12:34<02:39, 1.41s/it]
Training 1/1 epoch (loss 2.5137): 82%|βββββββββ | 513/625 [12:34<02:13, 1.19s/it]
Training 1/1 epoch (loss 2.6581): 82%|βββββββββ | 513/625 [12:36<02:13, 1.19s/it]
Training 1/1 epoch (loss 2.6581): 82%|βββββββββ | 514/625 [12:36<02:36, 1.41s/it]
Training 1/1 epoch (loss 2.6174): 82%|βββββββββ | 514/625 [12:38<02:36, 1.41s/it]
Training 1/1 epoch (loss 2.6174): 82%|βββββββββ | 515/625 [12:38<02:36, 1.42s/it]
Training 1/1 epoch (loss 2.7969): 82%|βββββββββ | 515/625 [12:38<02:36, 1.42s/it]
Training 1/1 epoch (loss 2.7969): 83%|βββββββββ | 516/625 [12:38<02:08, 1.18s/it]
Training 1/1 epoch (loss 2.6267): 83%|βββββββββ | 516/625 [12:40<02:08, 1.18s/it]
Training 1/1 epoch (loss 2.6267): 83%|βββββββββ | 517/625 [12:40<02:12, 1.22s/it]
Training 1/1 epoch (loss 2.8309): 83%|βββββββββ | 517/625 [12:41<02:12, 1.22s/it]
Training 1/1 epoch (loss 2.8309): 83%|βββββββββ | 518/625 [12:41<02:18, 1.29s/it]
Training 1/1 epoch (loss 2.8336): 83%|βββββββββ | 518/625 [12:43<02:18, 1.29s/it]
Training 1/1 epoch (loss 2.8336): 83%|βββββββββ | 519/625 [12:43<02:22, 1.35s/it]
Training 1/1 epoch (loss 2.8400): 83%|βββββββββ | 519/625 [12:44<02:22, 1.35s/it]
Training 1/1 epoch (loss 2.8400): 83%|βββββββββ | 520/625 [12:44<02:28, 1.41s/it]
Training 1/1 epoch (loss 2.8379): 83%|βββββββββ | 520/625 [12:45<02:28, 1.41s/it]
Training 1/1 epoch (loss 2.8379): 83%|βββββββββ | 521/625 [12:45<02:01, 1.17s/it]
Training 1/1 epoch (loss 2.6582): 83%|βββββββββ | 521/625 [12:47<02:01, 1.17s/it]
Training 1/1 epoch (loss 2.6582): 84%|βββββββββ | 522/625 [12:47<02:25, 1.42s/it]
Training 1/1 epoch (loss 2.7319): 84%|βββββββββ | 522/625 [12:49<02:25, 1.42s/it]
Training 1/1 epoch (loss 2.7319): 84%|βββββββββ | 523/625 [12:49<02:42, 1.59s/it]
Training 1/1 epoch (loss 2.6043): 84%|βββββββββ | 523/625 [12:50<02:42, 1.59s/it]
Training 1/1 epoch (loss 2.6043): 84%|βββββββββ | 524/625 [12:50<02:23, 1.43s/it]
Training 1/1 epoch (loss 2.7414): 84%|βββββββββ | 524/625 [12:52<02:23, 1.43s/it]
Training 1/1 epoch (loss 2.7414): 84%|βββββββββ | 525/625 [12:52<02:37, 1.57s/it]
Training 1/1 epoch (loss 2.9143): 84%|βββββββββ | 525/625 [12:53<02:37, 1.57s/it]
Training 1/1 epoch (loss 2.9143): 84%|βββββββββ | 526/625 [12:53<02:30, 1.52s/it]
Training 1/1 epoch (loss 2.7462): 84%|βββββββββ | 526/625 [12:54<02:30, 1.52s/it]
Training 1/1 epoch (loss 2.7462): 84%|βββββββββ | 527/625 [12:54<02:20, 1.43s/it]
Training 1/1 epoch (loss 2.6541): 84%|βββββββββ | 527/625 [12:56<02:20, 1.43s/it]
Training 1/1 epoch (loss 2.6541): 84%|βββββββββ | 528/625 [12:56<02:20, 1.45s/it]
Training 1/1 epoch (loss 2.6840): 84%|βββββββββ | 528/625 [12:57<02:20, 1.45s/it]
Training 1/1 epoch (loss 2.6840): 85%|βββββββββ | 529/625 [12:57<02:07, 1.33s/it]
Training 1/1 epoch (loss 2.5914): 85%|βββββββββ | 529/625 [12:59<02:07, 1.33s/it]
Training 1/1 epoch (loss 2.5914): 85%|βββββββββ | 530/625 [12:59<02:21, 1.49s/it]
Training 1/1 epoch (loss 2.8261): 85%|βββββββββ | 530/625 [13:01<02:21, 1.49s/it]
Training 1/1 epoch (loss 2.8261): 85%|βββββββββ | 531/625 [13:01<02:34, 1.64s/it]
Training 1/1 epoch (loss 2.6155): 85%|βββββββββ | 531/625 [13:01<02:34, 1.64s/it]
Training 1/1 epoch (loss 2.6155): 85%|βββββββββ | 532/625 [13:01<01:59, 1.28s/it]
Training 1/1 epoch (loss 2.7735): 85%|βββββββββ | 532/625 [13:04<01:59, 1.28s/it]
Training 1/1 epoch (loss 2.7735): 85%|βββββββββ | 533/625 [13:04<02:27, 1.60s/it]
Training 1/1 epoch (loss 2.8644): 85%|βββββββββ | 533/625 [13:05<02:27, 1.60s/it]
Training 1/1 epoch (loss 2.8644): 85%|βββββββββ | 534/625 [13:05<02:31, 1.67s/it]
Training 1/1 epoch (loss 2.6011): 85%|βββββββββ | 534/625 [13:06<02:31, 1.67s/it]
Training 1/1 epoch (loss 2.6011): 86%|βββββββββ | 535/625 [13:06<02:01, 1.35s/it]
Training 1/1 epoch (loss 2.7114): 86%|βββββββββ | 535/625 [13:08<02:01, 1.35s/it]
Training 1/1 epoch (loss 2.7114): 86%|βββββββββ | 536/625 [13:08<02:02, 1.37s/it]
Training 1/1 epoch (loss 2.8042): 86%|βββββββββ | 536/625 [13:09<02:02, 1.37s/it]
Training 1/1 epoch (loss 2.8042): 86%|βββββββββ | 537/625 [13:09<01:59, 1.36s/it]
Training 1/1 epoch (loss 2.6035): 86%|βββββββββ | 537/625 [13:10<01:59, 1.36s/it]
Training 1/1 epoch (loss 2.6035): 86%|βββββββββ | 538/625 [13:10<01:46, 1.22s/it]
Training 1/1 epoch (loss 2.6631): 86%|βββββββββ | 538/625 [13:12<01:46, 1.22s/it]
Training 1/1 epoch (loss 2.6631): 86%|βββββββββ | 539/625 [13:12<02:08, 1.50s/it]
Training 1/1 epoch (loss 2.5412): 86%|βββββββββ | 539/625 [13:13<02:08, 1.50s/it]
Training 1/1 epoch (loss 2.5412): 86%|βββββββββ | 540/625 [13:13<02:04, 1.47s/it]
Training 1/1 epoch (loss 2.5186): 86%|βββββββββ | 540/625 [13:15<02:04, 1.47s/it]
Training 1/1 epoch (loss 2.5186): 87%|βββββββββ | 541/625 [13:15<02:16, 1.62s/it]
Training 1/1 epoch (loss 2.6552): 87%|βββββββββ | 541/625 [13:17<02:16, 1.62s/it]
Training 1/1 epoch (loss 2.6552): 87%|βββββββββ | 542/625 [13:17<02:28, 1.79s/it]
Training 1/1 epoch (loss 2.5979): 87%|βββββββββ | 542/625 [13:18<02:28, 1.79s/it]
Training 1/1 epoch (loss 2.5979): 87%|βββββββββ | 543/625 [13:18<01:53, 1.39s/it]
Training 1/1 epoch (loss 2.8029): 87%|βββββββββ | 543/625 [13:20<01:53, 1.39s/it]
Training 1/1 epoch (loss 2.8029): 87%|βββββββββ | 544/625 [13:20<01:58, 1.46s/it]
Training 1/1 epoch (loss 2.9240): 87%|βββββββββ | 544/625 [13:20<01:58, 1.46s/it]
Training 1/1 epoch (loss 2.9240): 87%|βββββββββ | 545/625 [13:20<01:43, 1.30s/it]
Training 1/1 epoch (loss 2.6574): 87%|βββββββββ | 545/625 [13:21<01:43, 1.30s/it]
Training 1/1 epoch (loss 2.6574): 87%|βββββββββ | 546/625 [13:21<01:29, 1.13s/it]
Training 1/1 epoch (loss 2.5855): 87%|βββββββββ | 546/625 [13:23<01:29, 1.13s/it]
Training 1/1 epoch (loss 2.5855): 88%|βββββββββ | 547/625 [13:23<01:53, 1.45s/it]
Training 1/1 epoch (loss 2.4278): 88%|βββββββββ | 547/625 [13:24<01:53, 1.45s/it]
Training 1/1 epoch (loss 2.4278): 88%|βββββββββ | 548/625 [13:24<01:36, 1.25s/it]
Training 1/1 epoch (loss 2.6733): 88%|βββββββββ | 548/625 [13:25<01:36, 1.25s/it]
Training 1/1 epoch (loss 2.6733): 88%|βββββββββ | 549/625 [13:25<01:16, 1.01s/it]
Training 1/1 epoch (loss 2.5193): 88%|βββββββββ | 549/625 [13:27<01:16, 1.01s/it]
Training 1/1 epoch (loss 2.5193): 88%|βββββββββ | 550/625 [13:27<01:47, 1.44s/it]
Training 1/1 epoch (loss 2.8316): 88%|βββββββββ | 550/625 [13:28<01:47, 1.44s/it]
Training 1/1 epoch (loss 2.8316): 88%|βββββββββ | 551/625 [13:28<01:41, 1.37s/it]
Training 1/1 epoch (loss 2.5789): 88%|βββββββββ | 551/625 [13:29<01:41, 1.37s/it]
Training 1/1 epoch (loss 2.5789): 88%|βββββββββ | 552/625 [13:29<01:24, 1.15s/it]
Training 1/1 epoch (loss 2.6127): 88%|βββββββββ | 552/625 [13:31<01:24, 1.15s/it]
Training 1/1 epoch (loss 2.6127): 88%|βββββββββ | 553/625 [13:31<01:36, 1.33s/it]
Training 1/1 epoch (loss 2.6665): 88%|βββββββββ | 553/625 [13:32<01:36, 1.33s/it]
Training 1/1 epoch (loss 2.6665): 89%|βββββββββ | 554/625 [13:32<01:29, 1.26s/it]
Training 1/1 epoch (loss 2.6984): 89%|βββββββββ | 554/625 [13:33<01:29, 1.26s/it]
Training 1/1 epoch (loss 2.6984): 89%|βββββββββ | 555/625 [13:33<01:27, 1.25s/it]
Training 1/1 epoch (loss 2.5226): 89%|βββββββββ | 555/625 [13:34<01:27, 1.25s/it]
Training 1/1 epoch (loss 2.5226): 89%|βββββββββ | 556/625 [13:34<01:31, 1.33s/it]
Training 1/1 epoch (loss 2.6419): 89%|βββββββββ | 556/625 [13:36<01:31, 1.33s/it]
Training 1/1 epoch (loss 2.6419): 89%|βββββββββ | 557/625 [13:36<01:27, 1.29s/it]
Training 1/1 epoch (loss 2.7361): 89%|βββββββββ | 557/625 [13:38<01:27, 1.29s/it]
Training 1/1 epoch (loss 2.7361): 89%|βββββββββ | 558/625 [13:38<01:38, 1.47s/it]
Training 1/1 epoch (loss 2.5239): 89%|βββββββββ | 558/625 [13:39<01:38, 1.47s/it]
Training 1/1 epoch (loss 2.5239): 89%|βββββββββ | 559/625 [13:39<01:41, 1.54s/it]
Training 1/1 epoch (loss 2.7180): 89%|βββββββββ | 559/625 [13:40<01:41, 1.54s/it]
Training 1/1 epoch (loss 2.7180): 90%|βββββββββ | 560/625 [13:40<01:32, 1.43s/it]
Training 1/1 epoch (loss 2.7451): 90%|βββββββββ | 560/625 [13:42<01:32, 1.43s/it]
Training 1/1 epoch (loss 2.7451): 90%|βββββββββ | 561/625 [13:42<01:38, 1.54s/it]
Training 1/1 epoch (loss 2.8230): 90%|βββββββββ | 561/625 [13:43<01:38, 1.54s/it]
Training 1/1 epoch (loss 2.8230): 90%|βββββββββ | 562/625 [13:43<01:25, 1.36s/it]
Training 1/1 epoch (loss 2.6901): 90%|βββββββββ | 562/625 [13:44<01:25, 1.36s/it]
Training 1/1 epoch (loss 2.6901): 90%|βββββββββ | 563/625 [13:44<01:16, 1.23s/it]
Training 1/1 epoch (loss 2.4369): 90%|βββββββββ | 563/625 [13:46<01:16, 1.23s/it]
Training 1/1 epoch (loss 2.4369): 90%|βββββββββ | 564/625 [13:46<01:25, 1.40s/it]
Training 1/1 epoch (loss 2.6236): 90%|βββββββββ | 564/625 [13:47<01:25, 1.40s/it]
Training 1/1 epoch (loss 2.6236): 90%|βββββββββ | 565/625 [13:47<01:10, 1.18s/it]
Training 1/1 epoch (loss 2.6408): 90%|βββββββββ | 565/625 [13:47<01:10, 1.18s/it]
Training 1/1 epoch (loss 2.6408): 91%|βββββββββ | 566/625 [13:47<01:03, 1.08s/it]
Training 1/1 epoch (loss 2.5399): 91%|βββββββββ | 566/625 [13:49<01:03, 1.08s/it]
Training 1/1 epoch (loss 2.5399): 91%|βββββββββ | 567/625 [13:49<01:06, 1.15s/it]
Training 1/1 epoch (loss 2.6092): 91%|βββββββββ | 567/625 [13:50<01:06, 1.15s/it]
Training 1/1 epoch (loss 2.6092): 91%|βββββββββ | 568/625 [13:50<01:06, 1.17s/it]
Training 1/1 epoch (loss 2.8657): 91%|βββββββββ | 568/625 [13:51<01:06, 1.17s/it]
Training 1/1 epoch (loss 2.8657): 91%|βββββββββ | 569/625 [13:51<01:09, 1.24s/it]
Training 1/1 epoch (loss 2.8378): 91%|βββββββββ | 569/625 [13:53<01:09, 1.24s/it]
Training 1/1 epoch (loss 2.8378): 91%|βββββββββ | 570/625 [13:53<01:18, 1.42s/it]
Training 1/1 epoch (loss 2.8400): 91%|βββββββββ | 570/625 [13:54<01:18, 1.42s/it]
Training 1/1 epoch (loss 2.8400): 91%|ββββββββββ| 571/625 [13:54<01:04, 1.20s/it]
Training 1/1 epoch (loss 2.8552): 91%|ββββββββββ| 571/625 [13:55<01:04, 1.20s/it]
Training 1/1 epoch (loss 2.8552): 92%|ββββββββββ| 572/625 [13:55<01:09, 1.30s/it]
Training 1/1 epoch (loss 2.6271): 92%|ββββββββββ| 572/625 [13:57<01:09, 1.30s/it]
Training 1/1 epoch (loss 2.6271): 92%|ββββββββββ| 573/625 [13:57<01:10, 1.36s/it]
Training 1/1 epoch (loss 2.4922): 92%|ββββββββββ| 573/625 [13:58<01:10, 1.36s/it]
Training 1/1 epoch (loss 2.4922): 92%|ββββββββββ| 574/625 [13:58<01:12, 1.42s/it]
Training 1/1 epoch (loss 2.4763): 92%|ββββββββββ| 574/625 [14:00<01:12, 1.42s/it]
Training 1/1 epoch (loss 2.4763): 92%|ββββββββββ| 575/625 [14:00<01:16, 1.53s/it]
Training 1/1 epoch (loss 2.4898): 92%|ββββββββββ| 575/625 [14:01<01:16, 1.53s/it]
Training 1/1 epoch (loss 2.4898): 92%|ββββββββββ| 576/625 [14:01<01:06, 1.35s/it]
Training 1/1 epoch (loss 2.6648): 92%|ββββββββββ| 576/625 [14:03<01:06, 1.35s/it]
Training 1/1 epoch (loss 2.6648): 92%|ββββββββββ| 577/625 [14:03<01:13, 1.52s/it]
Training 1/1 epoch (loss 2.6563): 92%|ββββββββββ| 577/625 [14:04<01:13, 1.52s/it]
Training 1/1 epoch (loss 2.6563): 92%|ββββββββββ| 578/625 [14:04<01:05, 1.40s/it]
Training 1/1 epoch (loss 2.9191): 92%|ββββββββββ| 578/625 [14:06<01:05, 1.40s/it]
Training 1/1 epoch (loss 2.9191): 93%|ββββββββββ| 579/625 [14:06<01:03, 1.38s/it]
Training 1/1 epoch (loss 2.6404): 93%|ββββββββββ| 579/625 [14:07<01:03, 1.38s/it]
Training 1/1 epoch (loss 2.6404): 93%|ββββββββββ| 580/625 [14:07<01:08, 1.52s/it]
Training 1/1 epoch (loss 2.6309): 93%|ββββββββββ| 580/625 [14:09<01:08, 1.52s/it]
Training 1/1 epoch (loss 2.6309): 93%|ββββββββββ| 581/625 [14:09<01:03, 1.43s/it]
Training 1/1 epoch (loss 2.7193): 93%|ββββββββββ| 581/625 [14:10<01:03, 1.43s/it]
Training 1/1 epoch (loss 2.7193): 93%|ββββββββββ| 582/625 [14:10<00:56, 1.32s/it]
Training 1/1 epoch (loss 2.8206): 93%|ββββββββββ| 582/625 [14:11<00:56, 1.32s/it]
Training 1/1 epoch (loss 2.8206): 93%|ββββββββββ| 583/625 [14:11<00:56, 1.35s/it]
Training 1/1 epoch (loss 2.4461): 93%|ββββββββββ| 583/625 [14:13<00:56, 1.35s/it]
Training 1/1 epoch (loss 2.4461): 93%|ββββββββββ| 584/625 [14:13<00:58, 1.43s/it]
Training 1/1 epoch (loss 2.6287): 93%|ββββββββββ| 584/625 [14:15<00:58, 1.43s/it]
Training 1/1 epoch (loss 2.6287): 94%|ββββββββββ| 585/625 [14:15<01:01, 1.53s/it]
Training 1/1 epoch (loss 2.5113): 94%|ββββββββββ| 585/625 [14:16<01:01, 1.53s/it]
Training 1/1 epoch (loss 2.5113): 94%|ββββββββββ| 586/625 [14:16<01:03, 1.62s/it]
Training 1/1 epoch (loss 2.8413): 94%|ββββββββββ| 586/625 [14:17<01:03, 1.62s/it]
Training 1/1 epoch (loss 2.8413): 94%|ββββββββββ| 587/625 [14:17<00:54, 1.43s/it]
Training 1/1 epoch (loss 2.6762): 94%|ββββββββββ| 587/625 [14:19<00:54, 1.43s/it]
Training 1/1 epoch (loss 2.6762): 94%|ββββββββββ| 588/625 [14:19<00:58, 1.59s/it]
Training 1/1 epoch (loss 2.6679): 94%|ββββββββββ| 588/625 [14:22<00:58, 1.59s/it]
Training 1/1 epoch (loss 2.6679): 94%|ββββββββββ| 589/625 [14:22<01:06, 1.85s/it]
Training 1/1 epoch (loss 2.5722): 94%|ββββββββββ| 589/625 [14:22<01:06, 1.85s/it]
Training 1/1 epoch (loss 2.5722): 94%|ββββββββββ| 590/625 [14:22<00:50, 1.44s/it]
Training 1/1 epoch (loss 2.9428): 94%|ββββββββββ| 590/625 [14:24<00:50, 1.44s/it]
Training 1/1 epoch (loss 2.9428): 95%|ββββββββββ| 591/625 [14:24<00:53, 1.58s/it]
Training 1/1 epoch (loss 2.5571): 95%|ββββββββββ| 591/625 [14:26<00:53, 1.58s/it]
Training 1/1 epoch (loss 2.5571): 95%|ββββββββββ| 592/625 [14:26<00:53, 1.63s/it]
Training 1/1 epoch (loss 2.6722): 95%|ββββββββββ| 592/625 [14:27<00:53, 1.63s/it]
Training 1/1 epoch (loss 2.6722): 95%|ββββββββββ| 593/625 [14:27<00:47, 1.50s/it]
Training 1/1 epoch (loss 2.7136): 95%|ββββββββββ| 593/625 [14:29<00:47, 1.50s/it]
Training 1/1 epoch (loss 2.7136): 95%|ββββββββββ| 594/625 [14:29<00:50, 1.63s/it]
Training 1/1 epoch (loss 2.5325): 95%|ββββββββββ| 594/625 [14:30<00:50, 1.63s/it]
Training 1/1 epoch (loss 2.5325): 95%|ββββββββββ| 595/625 [14:30<00:44, 1.49s/it]
Training 1/1 epoch (loss 2.5843): 95%|ββββββββββ| 595/625 [14:32<00:44, 1.49s/it]
Training 1/1 epoch (loss 2.5843): 95%|ββββββββββ| 596/625 [14:32<00:41, 1.44s/it]
Training 1/1 epoch (loss 2.5613): 95%|ββββββββββ| 596/625 [14:33<00:41, 1.44s/it]
Training 1/1 epoch (loss 2.5613): 96%|ββββββββββ| 597/625 [14:33<00:43, 1.54s/it]
Training 1/1 epoch (loss 2.7103): 96%|ββββββββββ| 597/625 [14:35<00:43, 1.54s/it]
Training 1/1 epoch (loss 2.7103): 96%|ββββββββββ| 598/625 [14:35<00:40, 1.49s/it]
Training 1/1 epoch (loss 2.7207): 96%|ββββββββββ| 598/625 [14:36<00:40, 1.49s/it]
Training 1/1 epoch (loss 2.7207): 96%|ββββββββββ| 599/625 [14:36<00:36, 1.41s/it]
Training 1/1 epoch (loss 2.7044): 96%|ββββββββββ| 599/625 [14:39<00:36, 1.41s/it]
Training 1/1 epoch (loss 2.7044): 96%|ββββββββββ| 600/625 [14:39<00:45, 1.81s/it]
Training 1/1 epoch (loss 2.5593): 96%|ββββββββββ| 600/625 [14:40<00:45, 1.81s/it]
Training 1/1 epoch (loss 2.5593): 96%|ββββββββββ| 601/625 [14:40<00:39, 1.65s/it]
Training 1/1 epoch (loss 2.6698): 96%|ββββββββββ| 601/625 [14:41<00:39, 1.65s/it]
Training 1/1 epoch (loss 2.6698): 96%|ββββββββββ| 602/625 [14:41<00:34, 1.50s/it]
Training 1/1 epoch (loss 2.5622): 96%|ββββββββββ| 602/625 [14:43<00:34, 1.50s/it]
Training 1/1 epoch (loss 2.5622): 96%|ββββββββββ| 603/625 [14:43<00:39, 1.79s/it]
Training 1/1 epoch (loss 2.6432): 96%|ββββββββββ| 603/625 [14:44<00:39, 1.79s/it]
Training 1/1 epoch (loss 2.6432): 97%|ββββββββββ| 604/625 [14:44<00:29, 1.43s/it]
Training 1/1 epoch (loss 2.7479): 97%|ββββββββββ| 604/625 [14:46<00:29, 1.43s/it]
Training 1/1 epoch (loss 2.7479): 97%|ββββββββββ| 605/625 [14:46<00:32, 1.64s/it]
Training 1/1 epoch (loss 2.6479): 97%|ββββββββββ| 605/625 [14:48<00:32, 1.64s/it]
Training 1/1 epoch (loss 2.6479): 97%|ββββββββββ| 606/625 [14:48<00:31, 1.64s/it]
Training 1/1 epoch (loss 2.6513): 97%|ββββββββββ| 606/625 [14:49<00:31, 1.64s/it]
Training 1/1 epoch (loss 2.6513): 97%|ββββββββββ| 607/625 [14:49<00:27, 1.53s/it]
Training 1/1 epoch (loss 2.8008): 97%|ββββββββββ| 607/625 [14:52<00:27, 1.53s/it]
Training 1/1 epoch (loss 2.8008): 97%|ββββββββββ| 608/625 [14:52<00:32, 1.90s/it]
Training 1/1 epoch (loss 2.7124): 97%|ββββββββββ| 608/625 [14:53<00:32, 1.90s/it]
Training 1/1 epoch (loss 2.7124): 97%|ββββββββββ| 609/625 [14:53<00:26, 1.67s/it]
Training 1/1 epoch (loss 2.7556): 97%|ββββββββββ| 609/625 [14:55<00:26, 1.67s/it]
Training 1/1 epoch (loss 2.7556): 98%|ββββββββββ| 610/625 [14:55<00:28, 1.90s/it]
Training 1/1 epoch (loss 2.5810): 98%|ββββββββββ| 610/625 [14:57<00:28, 1.90s/it]
Training 1/1 epoch (loss 2.5810): 98%|ββββββββββ| 611/625 [14:57<00:23, 1.69s/it]
Training 1/1 epoch (loss 2.9056): 98%|ββββββββββ| 611/625 [14:57<00:23, 1.69s/it]
Training 1/1 epoch (loss 2.9056): 98%|ββββββββββ| 612/625 [14:57<00:17, 1.35s/it]
Training 1/1 epoch (loss 2.5708): 98%|ββββββββββ| 612/625 [14:59<00:17, 1.35s/it]
Training 1/1 epoch (loss 2.5708): 98%|ββββββββββ| 613/625 [14:59<00:16, 1.35s/it]
Training 1/1 epoch (loss 2.7470): 98%|ββββββββββ| 613/625 [15:00<00:16, 1.35s/it]
Training 1/1 epoch (loss 2.7470): 98%|ββββββββββ| 614/625 [15:00<00:15, 1.40s/it]
Training 1/1 epoch (loss 2.5741): 98%|ββββββββββ| 614/625 [15:01<00:15, 1.40s/it]
Training 1/1 epoch (loss 2.5741): 98%|ββββββββββ| 615/625 [15:01<00:11, 1.17s/it]
Training 1/1 epoch (loss 2.5535): 98%|ββββββββββ| 615/625 [15:03<00:11, 1.17s/it]
Training 1/1 epoch (loss 2.5535): 99%|ββββββββββ| 616/625 [15:03<00:12, 1.42s/it]
Training 1/1 epoch (loss 2.7749): 99%|ββββββββββ| 616/625 [15:04<00:12, 1.42s/it]
Training 1/1 epoch (loss 2.7749): 99%|ββββββββββ| 617/625 [15:04<00:11, 1.46s/it]
Training 1/1 epoch (loss 2.8078): 99%|ββββββββββ| 617/625 [15:05<00:11, 1.46s/it]
Training 1/1 epoch (loss 2.8078): 99%|ββββββββββ| 618/625 [15:05<00:08, 1.23s/it]
Training 1/1 epoch (loss 2.6477): 99%|ββββββββββ| 618/625 [15:07<00:08, 1.23s/it]
Training 1/1 epoch (loss 2.6477): 99%|ββββββββββ| 619/625 [15:07<00:09, 1.53s/it]
Training 1/1 epoch (loss 2.5277): 99%|ββββββββββ| 619/625 [15:09<00:09, 1.53s/it]
Training 1/1 epoch (loss 2.5277): 99%|ββββββββββ| 620/625 [15:09<00:08, 1.67s/it]
Training 1/1 epoch (loss 2.7598): 99%|ββββββββββ| 620/625 [15:10<00:08, 1.67s/it]
Training 1/1 epoch (loss 2.7598): 99%|ββββββββββ| 621/625 [15:10<00:05, 1.31s/it]
Training 1/1 epoch (loss 2.5991): 99%|ββββββββββ| 621/625 [15:12<00:05, 1.31s/it]
Training 1/1 epoch (loss 2.5991): 100%|ββββββββββ| 622/625 [15:12<00:04, 1.64s/it]
Training 1/1 epoch (loss 2.7529): 100%|ββββββββββ| 622/625 [15:13<00:04, 1.64s/it]
Training 1/1 epoch (loss 2.7529): 100%|ββββββββββ| 623/625 [15:13<00:03, 1.56s/it]
Training 1/1 epoch (loss 2.5256): 100%|ββββββββββ| 623/625 [15:14<00:03, 1.56s/it]
Training 1/1 epoch (loss 2.5256): 100%|ββββββββββ| 624/625 [15:14<00:01, 1.36s/it]
Training 1/1 epoch (loss 2.6628): 100%|ββββββββββ| 624/625 [15:16<00:01, 1.36s/it]
Training 1/1 epoch (loss 2.6628): 100%|ββββββββββ| 625/625 [15:16<00:00, 1.51s/it]
Training 1/1 epoch (loss 2.6628): 100%|ββββββββββ| 625/625 [15:16<00:00, 1.47s/it] |