|
Training 1/1 epoch (loss 2.5427): 0%| | 0/625 [00:10<?, ?it/s]
Training 1/1 epoch (loss 2.5427): 0%| | 1/625 [00:10<1:45:12, 10.12s/it]
Training 1/1 epoch (loss 2.8152): 0%| | 1/625 [00:11<1:45:12, 10.12s/it]
Training 1/1 epoch (loss 2.8152): 0%| | 2/625 [00:11<53:42, 5.17s/it]
Training 1/1 epoch (loss 2.5731): 0%| | 2/625 [00:14<53:42, 5.17s/it]
Training 1/1 epoch (loss 2.5731): 0%| | 3/625 [00:14<39:45, 3.84s/it]
Training 1/1 epoch (loss 2.4448): 0%| | 3/625 [00:15<39:45, 3.84s/it]
Training 1/1 epoch (loss 2.4448): 1%| | 4/625 [00:15<31:53, 3.08s/it]
Training 1/1 epoch (loss 2.8287): 1%| | 4/625 [00:16<31:53, 3.08s/it]
Training 1/1 epoch (loss 2.8287): 1%| | 5/625 [00:16<22:01, 2.13s/it]
Training 1/1 epoch (loss 2.6341): 1%| | 5/625 [00:18<22:01, 2.13s/it]
Training 1/1 epoch (loss 2.6341): 1%| | 6/625 [00:18<21:08, 2.05s/it]
Training 1/1 epoch (loss 2.6672): 1%| | 6/625 [00:20<21:08, 2.05s/it]
Training 1/1 epoch (loss 2.6672): 1%| | 7/625 [00:20<20:08, 1.96s/it]
Training 1/1 epoch (loss 2.9926): 1%| | 7/625 [00:21<20:08, 1.96s/it]
Training 1/1 epoch (loss 2.9926): 1%|β | 8/625 [00:21<16:48, 1.63s/it]
Training 1/1 epoch (loss 2.5593): 1%|β | 8/625 [00:22<16:48, 1.63s/it]
Training 1/1 epoch (loss 2.5593): 1%|β | 9/625 [00:22<15:06, 1.47s/it]
Training 1/1 epoch (loss 2.9267): 1%|β | 9/625 [00:23<15:06, 1.47s/it]
Training 1/1 epoch (loss 2.9267): 2%|β | 10/625 [00:23<13:11, 1.29s/it]
Training 1/1 epoch (loss 2.4823): 2%|β | 10/625 [00:25<13:11, 1.29s/it]
Training 1/1 epoch (loss 2.4823): 2%|β | 11/625 [00:25<16:20, 1.60s/it]
Training 1/1 epoch (loss 2.5900): 2%|β | 11/625 [00:27<16:20, 1.60s/it]
Training 1/1 epoch (loss 2.5900): 2%|β | 12/625 [00:27<19:05, 1.87s/it]
Training 1/1 epoch (loss 2.6741): 2%|β | 12/625 [00:28<19:05, 1.87s/it]
Training 1/1 epoch (loss 2.6741): 2%|β | 13/625 [00:28<15:14, 1.49s/it]
Training 1/1 epoch (loss 2.9819): 2%|β | 13/625 [00:30<15:14, 1.49s/it]
Training 1/1 epoch (loss 2.9819): 2%|β | 14/625 [00:30<18:08, 1.78s/it]
Training 1/1 epoch (loss 2.5660): 2%|β | 14/625 [00:32<18:08, 1.78s/it]
Training 1/1 epoch (loss 2.5660): 2%|β | 15/625 [00:32<17:36, 1.73s/it]
Training 1/1 epoch (loss 2.8358): 2%|β | 15/625 [00:33<17:36, 1.73s/it]
Training 1/1 epoch (loss 2.8358): 3%|β | 16/625 [00:33<14:45, 1.45s/it]
Training 1/1 epoch (loss 2.7763): 3%|β | 16/625 [00:34<14:45, 1.45s/it]
Training 1/1 epoch (loss 2.7763): 3%|β | 17/625 [00:34<15:04, 1.49s/it]
Training 1/1 epoch (loss 2.6999): 3%|β | 17/625 [00:36<15:04, 1.49s/it]
Training 1/1 epoch (loss 2.6999): 3%|β | 18/625 [00:36<14:57, 1.48s/it]
Training 1/1 epoch (loss 2.8074): 3%|β | 18/625 [00:37<14:57, 1.48s/it]
Training 1/1 epoch (loss 2.8074): 3%|β | 19/625 [00:37<14:22, 1.42s/it]
Training 1/1 epoch (loss 2.8524): 3%|β | 19/625 [00:38<14:22, 1.42s/it]
Training 1/1 epoch (loss 2.8524): 3%|β | 20/625 [00:38<13:50, 1.37s/it]
Training 1/1 epoch (loss 2.5800): 3%|β | 20/625 [00:40<13:50, 1.37s/it]
Training 1/1 epoch (loss 2.5800): 3%|β | 21/625 [00:40<13:30, 1.34s/it]
Training 1/1 epoch (loss 2.7558): 3%|β | 21/625 [00:41<13:30, 1.34s/it]
Training 1/1 epoch (loss 2.7558): 4%|β | 22/625 [00:41<13:04, 1.30s/it]
Training 1/1 epoch (loss 2.6838): 4%|β | 22/625 [00:43<13:04, 1.30s/it]
Training 1/1 epoch (loss 2.6838): 4%|β | 23/625 [00:43<16:19, 1.63s/it]
Training 1/1 epoch (loss 2.7209): 4%|β | 23/625 [00:44<16:19, 1.63s/it]
Training 1/1 epoch (loss 2.7209): 4%|β | 24/625 [00:44<14:18, 1.43s/it]
Training 1/1 epoch (loss 2.4768): 4%|β | 24/625 [00:46<14:18, 1.43s/it]
Training 1/1 epoch (loss 2.4768): 4%|β | 25/625 [00:46<16:21, 1.64s/it]
Training 1/1 epoch (loss 2.6394): 4%|β | 25/625 [00:48<16:21, 1.64s/it]
Training 1/1 epoch (loss 2.6394): 4%|β | 26/625 [00:48<16:31, 1.65s/it]
Training 1/1 epoch (loss 2.3664): 4%|β | 26/625 [00:50<16:31, 1.65s/it]
Training 1/1 epoch (loss 2.3664): 4%|β | 27/625 [00:50<15:56, 1.60s/it]
Training 1/1 epoch (loss 2.7167): 4%|β | 27/625 [00:51<15:56, 1.60s/it]
Training 1/1 epoch (loss 2.7167): 4%|β | 28/625 [00:51<17:03, 1.71s/it]
Training 1/1 epoch (loss 2.5326): 4%|β | 28/625 [00:52<17:03, 1.71s/it]
Training 1/1 epoch (loss 2.5326): 5%|β | 29/625 [00:52<13:59, 1.41s/it]
Training 1/1 epoch (loss 2.7069): 5%|β | 29/625 [00:54<13:59, 1.41s/it]
Training 1/1 epoch (loss 2.7069): 5%|β | 30/625 [00:54<15:12, 1.53s/it]
Training 1/1 epoch (loss 2.4367): 5%|β | 30/625 [00:56<15:12, 1.53s/it]
Training 1/1 epoch (loss 2.4367): 5%|β | 31/625 [00:56<15:45, 1.59s/it]
Training 1/1 epoch (loss 2.7917): 5%|β | 31/625 [00:56<15:45, 1.59s/it]
Training 1/1 epoch (loss 2.7917): 5%|β | 32/625 [00:56<13:08, 1.33s/it]
Training 1/1 epoch (loss 2.8058): 5%|β | 32/625 [00:58<13:08, 1.33s/it]
Training 1/1 epoch (loss 2.8058): 5%|β | 33/625 [00:58<13:41, 1.39s/it]
Training 1/1 epoch (loss 2.6292): 5%|β | 33/625 [00:59<13:41, 1.39s/it]
Training 1/1 epoch (loss 2.6292): 5%|β | 34/625 [00:59<13:19, 1.35s/it]
Training 1/1 epoch (loss 2.5362): 5%|β | 34/625 [01:00<13:19, 1.35s/it]
Training 1/1 epoch (loss 2.5362): 6%|β | 35/625 [01:00<12:55, 1.32s/it]
Training 1/1 epoch (loss 2.5890): 6%|β | 35/625 [01:03<12:55, 1.32s/it]
Training 1/1 epoch (loss 2.5890): 6%|β | 36/625 [01:03<16:05, 1.64s/it]
Training 1/1 epoch (loss 2.8380): 6%|β | 36/625 [01:04<16:05, 1.64s/it]
Training 1/1 epoch (loss 2.8380): 6%|β | 37/625 [01:04<15:20, 1.57s/it]
Training 1/1 epoch (loss 2.5890): 6%|β | 37/625 [01:05<15:20, 1.57s/it]
Training 1/1 epoch (loss 2.5890): 6%|β | 38/625 [01:05<12:58, 1.33s/it]
Training 1/1 epoch (loss 2.8955): 6%|β | 38/625 [01:07<12:58, 1.33s/it]
Training 1/1 epoch (loss 2.8955): 6%|β | 39/625 [01:07<14:33, 1.49s/it]
Training 1/1 epoch (loss 2.7131): 6%|β | 39/625 [01:08<14:33, 1.49s/it]
Training 1/1 epoch (loss 2.7131): 6%|β | 40/625 [01:08<13:37, 1.40s/it]
Training 1/1 epoch (loss 2.8202): 6%|β | 40/625 [01:10<13:37, 1.40s/it]
Training 1/1 epoch (loss 2.8202): 7%|β | 41/625 [01:10<14:58, 1.54s/it]
Training 1/1 epoch (loss 2.8835): 7%|β | 41/625 [01:11<14:58, 1.54s/it]
Training 1/1 epoch (loss 2.8835): 7%|β | 42/625 [01:11<14:38, 1.51s/it]
Training 1/1 epoch (loss 2.7267): 7%|β | 42/625 [01:12<14:38, 1.51s/it]
Training 1/1 epoch (loss 2.7267): 7%|β | 43/625 [01:12<11:25, 1.18s/it]
Training 1/1 epoch (loss 2.6761): 7%|β | 43/625 [01:13<11:25, 1.18s/it]
Training 1/1 epoch (loss 2.6761): 7%|β | 44/625 [01:13<12:33, 1.30s/it]
Training 1/1 epoch (loss 2.7755): 7%|β | 44/625 [01:15<12:33, 1.30s/it]
Training 1/1 epoch (loss 2.7755): 7%|β | 45/625 [01:15<13:57, 1.44s/it]
Training 1/1 epoch (loss 2.6753): 7%|β | 45/625 [01:16<13:57, 1.44s/it]
Training 1/1 epoch (loss 2.6753): 7%|β | 46/625 [01:16<11:21, 1.18s/it]
Training 1/1 epoch (loss 2.5320): 7%|β | 46/625 [01:18<11:21, 1.18s/it]
Training 1/1 epoch (loss 2.5320): 8%|β | 47/625 [01:18<13:52, 1.44s/it]
Training 1/1 epoch (loss 2.5086): 8%|β | 47/625 [01:19<13:52, 1.44s/it]
Training 1/1 epoch (loss 2.5086): 8%|β | 48/625 [01:19<13:21, 1.39s/it]
Training 1/1 epoch (loss 2.8480): 8%|β | 48/625 [01:20<13:21, 1.39s/it]
Training 1/1 epoch (loss 2.8480): 8%|β | 49/625 [01:20<13:15, 1.38s/it]
Training 1/1 epoch (loss 3.0284): 8%|β | 49/625 [01:22<13:15, 1.38s/it]
Training 1/1 epoch (loss 3.0284): 8%|β | 50/625 [01:22<13:58, 1.46s/it]
Training 1/1 epoch (loss 2.7826): 8%|β | 50/625 [01:23<13:58, 1.46s/it]
Training 1/1 epoch (loss 2.7826): 8%|β | 51/625 [01:23<12:06, 1.27s/it]
Training 1/1 epoch (loss 2.7458): 8%|β | 51/625 [01:25<12:06, 1.27s/it]
Training 1/1 epoch (loss 2.7458): 8%|β | 52/625 [01:25<15:06, 1.58s/it]
Training 1/1 epoch (loss 2.5782): 8%|β | 52/625 [01:27<15:06, 1.58s/it]
Training 1/1 epoch (loss 2.5782): 8%|β | 53/625 [01:27<14:34, 1.53s/it]
Training 1/1 epoch (loss 2.4045): 8%|β | 53/625 [01:27<14:34, 1.53s/it]
Training 1/1 epoch (loss 2.4045): 9%|β | 54/625 [01:27<12:13, 1.28s/it]
Training 1/1 epoch (loss 2.7853): 9%|β | 54/625 [01:29<12:13, 1.28s/it]
Training 1/1 epoch (loss 2.7853): 9%|β | 55/625 [01:29<13:46, 1.45s/it]
Training 1/1 epoch (loss 2.6285): 9%|β | 55/625 [01:31<13:46, 1.45s/it]
Training 1/1 epoch (loss 2.6285): 9%|β | 56/625 [01:31<15:02, 1.59s/it]
Training 1/1 epoch (loss 2.6183): 9%|β | 56/625 [01:32<15:02, 1.59s/it]
Training 1/1 epoch (loss 2.6183): 9%|β | 57/625 [01:32<12:37, 1.33s/it]
Training 1/1 epoch (loss 2.6726): 9%|β | 57/625 [01:33<12:37, 1.33s/it]
Training 1/1 epoch (loss 2.6726): 9%|β | 58/625 [01:33<13:31, 1.43s/it]
Training 1/1 epoch (loss 2.6436): 9%|β | 58/625 [01:35<13:31, 1.43s/it]
Training 1/1 epoch (loss 2.6436): 9%|β | 59/625 [01:35<14:20, 1.52s/it]
Training 1/1 epoch (loss 2.6157): 9%|β | 59/625 [01:36<14:20, 1.52s/it]
Training 1/1 epoch (loss 2.6157): 10%|β | 60/625 [01:36<12:30, 1.33s/it]
Training 1/1 epoch (loss 2.5594): 10%|β | 60/625 [01:39<12:30, 1.33s/it]
Training 1/1 epoch (loss 2.5594): 10%|β | 61/625 [01:39<15:45, 1.68s/it]
Training 1/1 epoch (loss 2.7368): 10%|β | 61/625 [01:40<15:45, 1.68s/it]
Training 1/1 epoch (loss 2.7368): 10%|β | 62/625 [01:40<15:36, 1.66s/it]
Training 1/1 epoch (loss 2.6734): 10%|β | 62/625 [01:41<15:36, 1.66s/it]
Training 1/1 epoch (loss 2.6734): 10%|β | 63/625 [01:41<13:21, 1.43s/it]
Training 1/1 epoch (loss 2.5592): 10%|β | 63/625 [01:43<13:21, 1.43s/it]
Training 1/1 epoch (loss 2.5592): 10%|β | 64/625 [01:43<14:53, 1.59s/it]
Training 1/1 epoch (loss 2.5592): 10%|β | 64/625 [01:44<14:53, 1.59s/it]
Training 1/1 epoch (loss 2.5592): 10%|β | 65/625 [01:44<12:04, 1.29s/it]
Training 1/1 epoch (loss 2.7259): 10%|β | 65/625 [01:45<12:04, 1.29s/it]
Training 1/1 epoch (loss 2.7259): 11%|β | 66/625 [01:45<13:18, 1.43s/it]
Training 1/1 epoch (loss 2.6960): 11%|β | 66/625 [01:47<13:18, 1.43s/it]
Training 1/1 epoch (loss 2.6960): 11%|β | 67/625 [01:47<13:29, 1.45s/it]
Training 1/1 epoch (loss 2.5908): 11%|β | 67/625 [01:48<13:29, 1.45s/it]
Training 1/1 epoch (loss 2.5908): 11%|β | 68/625 [01:48<11:22, 1.22s/it]
Training 1/1 epoch (loss 2.8130): 11%|β | 68/625 [01:49<11:22, 1.22s/it]
Training 1/1 epoch (loss 2.8130): 11%|β | 69/625 [01:49<11:26, 1.23s/it]
Training 1/1 epoch (loss 2.5569): 11%|β | 69/625 [01:51<11:26, 1.23s/it]
Training 1/1 epoch (loss 2.5569): 11%|β | 70/625 [01:51<12:59, 1.40s/it]
Training 1/1 epoch (loss 2.6406): 11%|β | 70/625 [01:51<12:59, 1.40s/it]
Training 1/1 epoch (loss 2.6406): 11%|ββ | 71/625 [01:51<11:05, 1.20s/it]
Training 1/1 epoch (loss 2.5109): 11%|ββ | 71/625 [01:54<11:05, 1.20s/it]
Training 1/1 epoch (loss 2.5109): 12%|ββ | 72/625 [01:54<13:43, 1.49s/it]
Training 1/1 epoch (loss 2.5735): 12%|ββ | 72/625 [01:55<13:43, 1.49s/it]
Training 1/1 epoch (loss 2.5735): 12%|ββ | 73/625 [01:55<14:21, 1.56s/it]
Training 1/1 epoch (loss 2.7697): 12%|ββ | 73/625 [01:56<14:21, 1.56s/it]
Training 1/1 epoch (loss 2.7697): 12%|ββ | 74/625 [01:56<12:11, 1.33s/it]
Training 1/1 epoch (loss 2.3873): 12%|ββ | 74/625 [01:58<12:11, 1.33s/it]
Training 1/1 epoch (loss 2.3873): 12%|ββ | 75/625 [01:58<15:06, 1.65s/it]
Training 1/1 epoch (loss 2.5503): 12%|ββ | 75/625 [01:59<15:06, 1.65s/it]
Training 1/1 epoch (loss 2.5503): 12%|ββ | 76/625 [01:59<13:01, 1.42s/it]
Training 1/1 epoch (loss 2.5729): 12%|ββ | 76/625 [02:01<13:01, 1.42s/it]
Training 1/1 epoch (loss 2.5729): 12%|ββ | 77/625 [02:01<13:12, 1.45s/it]
Training 1/1 epoch (loss 2.6222): 12%|ββ | 77/625 [02:03<13:12, 1.45s/it]
Training 1/1 epoch (loss 2.6222): 12%|ββ | 78/625 [02:03<15:42, 1.72s/it]
Training 1/1 epoch (loss 2.7347): 12%|ββ | 78/625 [02:04<15:42, 1.72s/it]
Training 1/1 epoch (loss 2.7347): 13%|ββ | 79/625 [02:04<12:32, 1.38s/it]
Training 1/1 epoch (loss 2.6833): 13%|ββ | 79/625 [02:06<12:32, 1.38s/it]
Training 1/1 epoch (loss 2.6833): 13%|ββ | 80/625 [02:06<14:35, 1.61s/it]
Training 1/1 epoch (loss 2.7327): 13%|ββ | 80/625 [02:08<14:35, 1.61s/it]
Training 1/1 epoch (loss 2.7327): 13%|ββ | 81/625 [02:08<16:51, 1.86s/it]
Training 1/1 epoch (loss 2.6278): 13%|ββ | 81/625 [02:09<16:51, 1.86s/it]
Training 1/1 epoch (loss 2.6278): 13%|ββ | 82/625 [02:09<14:02, 1.55s/it]
Training 1/1 epoch (loss 2.5696): 13%|ββ | 82/625 [02:11<14:02, 1.55s/it]
Training 1/1 epoch (loss 2.5696): 13%|ββ | 83/625 [02:11<14:34, 1.61s/it]
Training 1/1 epoch (loss 2.3577): 13%|ββ | 83/625 [02:12<14:34, 1.61s/it]
Training 1/1 epoch (loss 2.3577): 13%|ββ | 84/625 [02:12<13:30, 1.50s/it]
Training 1/1 epoch (loss 2.7355): 13%|ββ | 84/625 [02:13<13:30, 1.50s/it]
Training 1/1 epoch (loss 2.7355): 14%|ββ | 85/625 [02:13<12:53, 1.43s/it]
Training 1/1 epoch (loss 2.6788): 14%|ββ | 85/625 [02:15<12:53, 1.43s/it]
Training 1/1 epoch (loss 2.6788): 14%|ββ | 86/625 [02:15<12:33, 1.40s/it]
Training 1/1 epoch (loss 2.7280): 14%|ββ | 86/625 [02:16<12:33, 1.40s/it]
Training 1/1 epoch (loss 2.7280): 14%|ββ | 87/625 [02:16<10:55, 1.22s/it]
Training 1/1 epoch (loss 2.7886): 14%|ββ | 87/625 [02:16<10:55, 1.22s/it]
Training 1/1 epoch (loss 2.7886): 14%|ββ | 88/625 [02:16<09:47, 1.09s/it]
Training 1/1 epoch (loss 2.7374): 14%|ββ | 88/625 [02:18<09:47, 1.09s/it]
Training 1/1 epoch (loss 2.7374): 14%|ββ | 89/625 [02:18<11:49, 1.32s/it]
Training 1/1 epoch (loss 2.7939): 14%|ββ | 89/625 [02:19<11:49, 1.32s/it]
Training 1/1 epoch (loss 2.7939): 14%|ββ | 90/625 [02:19<10:52, 1.22s/it]
Training 1/1 epoch (loss 2.5396): 14%|ββ | 90/625 [02:21<10:52, 1.22s/it]
Training 1/1 epoch (loss 2.5396): 15%|ββ | 91/625 [02:21<12:42, 1.43s/it]
Training 1/1 epoch (loss 2.6463): 15%|ββ | 91/625 [02:23<12:42, 1.43s/it]
Training 1/1 epoch (loss 2.6463): 15%|ββ | 92/625 [02:23<13:48, 1.55s/it]
Training 1/1 epoch (loss 2.6070): 15%|ββ | 92/625 [02:24<13:48, 1.55s/it]
Training 1/1 epoch (loss 2.6070): 15%|ββ | 93/625 [02:24<11:08, 1.26s/it]
Training 1/1 epoch (loss 2.7010): 15%|ββ | 93/625 [02:25<11:08, 1.26s/it]
Training 1/1 epoch (loss 2.7010): 15%|ββ | 94/625 [02:25<12:55, 1.46s/it]
Training 1/1 epoch (loss 2.8116): 15%|ββ | 94/625 [02:27<12:55, 1.46s/it]
Training 1/1 epoch (loss 2.8116): 15%|ββ | 95/625 [02:27<13:06, 1.48s/it]
Training 1/1 epoch (loss 2.4975): 15%|ββ | 95/625 [02:28<13:06, 1.48s/it]
Training 1/1 epoch (loss 2.4975): 15%|ββ | 96/625 [02:28<11:24, 1.29s/it]
Training 1/1 epoch (loss 2.6418): 15%|ββ | 96/625 [02:30<11:24, 1.29s/it]
Training 1/1 epoch (loss 2.6418): 16%|ββ | 97/625 [02:30<13:06, 1.49s/it]
Training 1/1 epoch (loss 2.5772): 16%|ββ | 97/625 [02:31<13:06, 1.49s/it]
Training 1/1 epoch (loss 2.5772): 16%|ββ | 98/625 [02:31<11:12, 1.28s/it]
Training 1/1 epoch (loss 2.7551): 16%|ββ | 98/625 [02:33<11:12, 1.28s/it]
Training 1/1 epoch (loss 2.7551): 16%|ββ | 99/625 [02:33<14:10, 1.62s/it]
Training 1/1 epoch (loss 2.6961): 16%|ββ | 99/625 [02:34<14:10, 1.62s/it]
Training 1/1 epoch (loss 2.6961): 16%|ββ | 100/625 [02:34<13:19, 1.52s/it]
Training 1/1 epoch (loss 2.5780): 16%|ββ | 100/625 [02:35<13:19, 1.52s/it]
Training 1/1 epoch (loss 2.5780): 16%|ββ | 101/625 [02:35<11:13, 1.29s/it]
Training 1/1 epoch (loss 2.6321): 16%|ββ | 101/625 [02:37<11:13, 1.29s/it]
Training 1/1 epoch (loss 2.6321): 16%|ββ | 102/625 [02:37<12:50, 1.47s/it]
Training 1/1 epoch (loss 2.6405): 16%|ββ | 102/625 [02:39<12:50, 1.47s/it]
Training 1/1 epoch (loss 2.6405): 16%|ββ | 103/625 [02:39<14:08, 1.63s/it]
Training 1/1 epoch (loss 2.6105): 16%|ββ | 103/625 [02:40<14:08, 1.63s/it]
Training 1/1 epoch (loss 2.6105): 17%|ββ | 104/625 [02:40<11:42, 1.35s/it]
Training 1/1 epoch (loss 2.6372): 17%|ββ | 104/625 [02:42<11:42, 1.35s/it]
Training 1/1 epoch (loss 2.6372): 17%|ββ | 105/625 [02:42<13:21, 1.54s/it]
Training 1/1 epoch (loss 2.4548): 17%|ββ | 105/625 [02:43<13:21, 1.54s/it]
Training 1/1 epoch (loss 2.4548): 17%|ββ | 106/625 [02:43<12:08, 1.40s/it]
Training 1/1 epoch (loss 2.9583): 17%|ββ | 106/625 [02:44<12:08, 1.40s/it]
Training 1/1 epoch (loss 2.9583): 17%|ββ | 107/625 [02:44<11:09, 1.29s/it]
Training 1/1 epoch (loss 2.6898): 17%|ββ | 107/625 [02:46<11:09, 1.29s/it]
Training 1/1 epoch (loss 2.6898): 17%|ββ | 108/625 [02:46<14:09, 1.64s/it]
Training 1/1 epoch (loss 2.7766): 17%|ββ | 108/625 [02:48<14:09, 1.64s/it]
Training 1/1 epoch (loss 2.7766): 17%|ββ | 109/625 [02:48<14:10, 1.65s/it]
Training 1/1 epoch (loss 2.8715): 17%|ββ | 109/625 [02:48<14:10, 1.65s/it]
Training 1/1 epoch (loss 2.8715): 18%|ββ | 110/625 [02:48<11:12, 1.31s/it]
Training 1/1 epoch (loss 2.7501): 18%|ββ | 110/625 [02:50<11:12, 1.31s/it]
Training 1/1 epoch (loss 2.7501): 18%|ββ | 111/625 [02:50<12:01, 1.40s/it]
Training 1/1 epoch (loss 2.7436): 18%|ββ | 111/625 [02:52<12:01, 1.40s/it]
Training 1/1 epoch (loss 2.7436): 18%|ββ | 112/625 [02:52<14:00, 1.64s/it]
Training 1/1 epoch (loss 2.5454): 18%|ββ | 112/625 [02:53<14:00, 1.64s/it]
Training 1/1 epoch (loss 2.5454): 18%|ββ | 113/625 [02:53<12:20, 1.45s/it]
Training 1/1 epoch (loss 2.6927): 18%|ββ | 113/625 [02:55<12:20, 1.45s/it]
Training 1/1 epoch (loss 2.6927): 18%|ββ | 114/625 [02:55<12:40, 1.49s/it]
Training 1/1 epoch (loss 2.7644): 18%|ββ | 114/625 [02:56<12:40, 1.49s/it]
Training 1/1 epoch (loss 2.7644): 18%|ββ | 115/625 [02:56<12:14, 1.44s/it]
Training 1/1 epoch (loss 2.8205): 18%|ββ | 115/625 [02:57<12:14, 1.44s/it]
Training 1/1 epoch (loss 2.8205): 19%|ββ | 116/625 [02:57<11:15, 1.33s/it]
Training 1/1 epoch (loss 2.7826): 19%|ββ | 116/625 [02:59<11:15, 1.33s/it]
Training 1/1 epoch (loss 2.7826): 19%|ββ | 117/625 [02:59<13:17, 1.57s/it]
Training 1/1 epoch (loss 2.3700): 19%|ββ | 117/625 [03:00<13:17, 1.57s/it]
Training 1/1 epoch (loss 2.3700): 19%|ββ | 118/625 [03:00<12:18, 1.46s/it]
Training 1/1 epoch (loss 2.4724): 19%|ββ | 118/625 [03:03<12:18, 1.46s/it]
Training 1/1 epoch (loss 2.4724): 19%|ββ | 119/625 [03:03<14:25, 1.71s/it]
Training 1/1 epoch (loss 2.5321): 19%|ββ | 119/625 [03:04<14:25, 1.71s/it]
Training 1/1 epoch (loss 2.5321): 19%|ββ | 120/625 [03:04<13:48, 1.64s/it]
Training 1/1 epoch (loss 2.5152): 19%|ββ | 120/625 [03:05<13:48, 1.64s/it]
Training 1/1 epoch (loss 2.5152): 19%|ββ | 121/625 [03:05<10:58, 1.31s/it]
Training 1/1 epoch (loss 2.5335): 19%|ββ | 121/625 [03:07<10:58, 1.31s/it]
Training 1/1 epoch (loss 2.5335): 20%|ββ | 122/625 [03:07<13:04, 1.56s/it]
Training 1/1 epoch (loss 2.4755): 20%|ββ | 122/625 [03:09<13:04, 1.56s/it]
Training 1/1 epoch (loss 2.4755): 20%|ββ | 123/625 [03:09<13:36, 1.63s/it]
Training 1/1 epoch (loss 2.7318): 20%|ββ | 123/625 [03:09<13:36, 1.63s/it]
Training 1/1 epoch (loss 2.7318): 20%|ββ | 124/625 [03:09<11:03, 1.33s/it]
Training 1/1 epoch (loss 2.5409): 20%|ββ | 124/625 [03:11<11:03, 1.33s/it]
Training 1/1 epoch (loss 2.5409): 20%|ββ | 125/625 [03:11<10:37, 1.27s/it]
Training 1/1 epoch (loss 2.7779): 20%|ββ | 125/625 [03:12<10:37, 1.27s/it]
Training 1/1 epoch (loss 2.7779): 20%|ββ | 126/625 [03:12<11:07, 1.34s/it]
Training 1/1 epoch (loss 2.7018): 20%|ββ | 126/625 [03:13<11:07, 1.34s/it]
Training 1/1 epoch (loss 2.7018): 20%|ββ | 127/625 [03:13<09:58, 1.20s/it]
Training 1/1 epoch (loss 2.5727): 20%|ββ | 127/625 [03:15<09:58, 1.20s/it]
Training 1/1 epoch (loss 2.5727): 20%|ββ | 128/625 [03:15<11:34, 1.40s/it]
Training 1/1 epoch (loss 2.4624): 20%|ββ | 128/625 [03:16<11:34, 1.40s/it]
Training 1/1 epoch (loss 2.4624): 21%|ββ | 129/625 [03:16<10:59, 1.33s/it]
Training 1/1 epoch (loss 2.6087): 21%|ββ | 129/625 [03:18<10:59, 1.33s/it]
Training 1/1 epoch (loss 2.6087): 21%|ββ | 130/625 [03:18<13:08, 1.59s/it]
Training 1/1 epoch (loss 2.4374): 21%|ββ | 130/625 [03:20<13:08, 1.59s/it]
Training 1/1 epoch (loss 2.4374): 21%|ββ | 131/625 [03:20<12:53, 1.56s/it]
Training 1/1 epoch (loss 2.7708): 21%|ββ | 131/625 [03:20<12:53, 1.56s/it]
Training 1/1 epoch (loss 2.7708): 21%|ββ | 132/625 [03:20<10:14, 1.25s/it]
Training 1/1 epoch (loss 2.6460): 21%|ββ | 132/625 [03:23<10:14, 1.25s/it]
Training 1/1 epoch (loss 2.6460): 21%|βββ | 133/625 [03:23<13:08, 1.60s/it]
Training 1/1 epoch (loss 2.5751): 21%|βββ | 133/625 [03:24<13:08, 1.60s/it]
Training 1/1 epoch (loss 2.5751): 21%|βββ | 134/625 [03:24<13:25, 1.64s/it]
Training 1/1 epoch (loss 2.6916): 21%|βββ | 134/625 [03:25<13:25, 1.64s/it]
Training 1/1 epoch (loss 2.6916): 22%|βββ | 135/625 [03:25<11:17, 1.38s/it]
Training 1/1 epoch (loss 2.7079): 22%|βββ | 135/625 [03:27<11:17, 1.38s/it]
Training 1/1 epoch (loss 2.7079): 22%|βββ | 136/625 [03:27<12:08, 1.49s/it]
Training 1/1 epoch (loss 2.5144): 22%|βββ | 136/625 [03:29<12:08, 1.49s/it]
Training 1/1 epoch (loss 2.5144): 22%|βββ | 137/625 [03:29<13:26, 1.65s/it]
Training 1/1 epoch (loss 2.6894): 22%|βββ | 137/625 [03:30<13:26, 1.65s/it]
Training 1/1 epoch (loss 2.6894): 22%|βββ | 138/625 [03:30<11:30, 1.42s/it]
Training 1/1 epoch (loss 2.7266): 22%|βββ | 138/625 [03:31<11:30, 1.42s/it]
Training 1/1 epoch (loss 2.7266): 22%|βββ | 139/625 [03:31<11:27, 1.42s/it]
Training 1/1 epoch (loss 2.4824): 22%|βββ | 139/625 [03:32<11:27, 1.42s/it]
Training 1/1 epoch (loss 2.4824): 22%|βββ | 140/625 [03:32<10:19, 1.28s/it]
Training 1/1 epoch (loss 2.4970): 22%|βββ | 140/625 [03:34<10:19, 1.28s/it]
Training 1/1 epoch (loss 2.4970): 23%|βββ | 141/625 [03:34<10:53, 1.35s/it]
Training 1/1 epoch (loss 2.5923): 23%|βββ | 141/625 [03:36<10:53, 1.35s/it]
Training 1/1 epoch (loss 2.5923): 23%|βββ | 142/625 [03:36<12:15, 1.52s/it]
Training 1/1 epoch (loss 2.5751): 23%|βββ | 142/625 [03:36<12:15, 1.52s/it]
Training 1/1 epoch (loss 2.5751): 23%|βββ | 143/625 [03:36<10:41, 1.33s/it]
Training 1/1 epoch (loss 2.6946): 23%|βββ | 143/625 [03:39<10:41, 1.33s/it]
Training 1/1 epoch (loss 2.6946): 23%|βββ | 144/625 [03:39<13:13, 1.65s/it]
Training 1/1 epoch (loss 2.8361): 23%|βββ | 144/625 [03:41<13:13, 1.65s/it]
Training 1/1 epoch (loss 2.8361): 23%|βββ | 145/625 [03:41<14:06, 1.76s/it]
Training 1/1 epoch (loss 2.6740): 23%|βββ | 145/625 [03:41<14:06, 1.76s/it]
Training 1/1 epoch (loss 2.6740): 23%|βββ | 146/625 [03:41<11:11, 1.40s/it]
Training 1/1 epoch (loss 2.5470): 23%|βββ | 146/625 [03:43<11:11, 1.40s/it]
Training 1/1 epoch (loss 2.5470): 24%|βββ | 147/625 [03:43<10:49, 1.36s/it]
Training 1/1 epoch (loss 2.4878): 24%|βββ | 147/625 [03:44<10:49, 1.36s/it]
Training 1/1 epoch (loss 2.4878): 24%|βββ | 148/625 [03:44<11:46, 1.48s/it]
Training 1/1 epoch (loss 2.7065): 24%|βββ | 148/625 [03:45<11:46, 1.48s/it]
Training 1/1 epoch (loss 2.7065): 24%|βββ | 149/625 [03:45<09:17, 1.17s/it]
Training 1/1 epoch (loss 2.6415): 24%|βββ | 149/625 [03:47<09:17, 1.17s/it]
Training 1/1 epoch (loss 2.6415): 24%|βββ | 150/625 [03:47<11:15, 1.42s/it]
Training 1/1 epoch (loss 2.6830): 24%|βββ | 150/625 [03:48<11:15, 1.42s/it]
Training 1/1 epoch (loss 2.6830): 24%|βββ | 151/625 [03:48<11:13, 1.42s/it]
Training 1/1 epoch (loss 2.6356): 24%|βββ | 151/625 [03:50<11:13, 1.42s/it]
Training 1/1 epoch (loss 2.6356): 24%|βββ | 152/625 [03:50<11:17, 1.43s/it]
Training 1/1 epoch (loss 2.6546): 24%|βββ | 152/625 [03:52<11:17, 1.43s/it]
Training 1/1 epoch (loss 2.6546): 24%|βββ | 153/625 [03:52<13:14, 1.68s/it]
Training 1/1 epoch (loss 2.6836): 24%|βββ | 153/625 [03:53<13:14, 1.68s/it]
Training 1/1 epoch (loss 2.6836): 25%|βββ | 154/625 [03:53<11:54, 1.52s/it]
Training 1/1 epoch (loss 2.6786): 25%|βββ | 154/625 [03:55<11:54, 1.52s/it]
Training 1/1 epoch (loss 2.6786): 25%|βββ | 155/625 [03:55<11:36, 1.48s/it]
Training 1/1 epoch (loss 2.5549): 25%|βββ | 155/625 [03:56<11:36, 1.48s/it]
Training 1/1 epoch (loss 2.5549): 25%|βββ | 156/625 [03:56<11:17, 1.44s/it]
Training 1/1 epoch (loss 2.8755): 25%|βββ | 156/625 [03:56<11:17, 1.44s/it]
Training 1/1 epoch (loss 2.8755): 25%|βββ | 157/625 [03:56<08:52, 1.14s/it]
Training 1/1 epoch (loss 2.6122): 25%|βββ | 157/625 [03:58<08:52, 1.14s/it]
Training 1/1 epoch (loss 2.6122): 25%|βββ | 158/625 [03:58<09:00, 1.16s/it]
Training 1/1 epoch (loss 2.6248): 25%|βββ | 158/625 [03:59<09:00, 1.16s/it]
Training 1/1 epoch (loss 2.6248): 25%|βββ | 159/625 [03:59<10:11, 1.31s/it]
Training 1/1 epoch (loss 2.4765): 25%|βββ | 159/625 [04:00<10:11, 1.31s/it]
Training 1/1 epoch (loss 2.4765): 26%|βββ | 160/625 [04:00<08:56, 1.15s/it]
Training 1/1 epoch (loss 2.6742): 26%|βββ | 160/625 [04:01<08:56, 1.15s/it]
Training 1/1 epoch (loss 2.6742): 26%|βββ | 161/625 [04:01<09:23, 1.21s/it]
Training 1/1 epoch (loss 2.6501): 26%|βββ | 161/625 [04:03<09:23, 1.21s/it]
Training 1/1 epoch (loss 2.6501): 26%|βββ | 162/625 [04:03<09:32, 1.24s/it]
Training 1/1 epoch (loss 2.7158): 26%|βββ | 162/625 [04:04<09:32, 1.24s/it]
Training 1/1 epoch (loss 2.7158): 26%|βββ | 163/625 [04:04<10:11, 1.32s/it]
Training 1/1 epoch (loss 2.6408): 26%|βββ | 163/625 [04:06<10:11, 1.32s/it]
Training 1/1 epoch (loss 2.6408): 26%|βββ | 164/625 [04:06<11:00, 1.43s/it]
Training 1/1 epoch (loss 2.6153): 26%|βββ | 164/625 [04:07<11:00, 1.43s/it]
Training 1/1 epoch (loss 2.6153): 26%|βββ | 165/625 [04:07<09:43, 1.27s/it]
Training 1/1 epoch (loss 2.5297): 26%|βββ | 165/625 [04:08<09:43, 1.27s/it]
Training 1/1 epoch (loss 2.5297): 27%|βββ | 166/625 [04:08<10:12, 1.33s/it]
Training 1/1 epoch (loss 2.4681): 27%|βββ | 166/625 [04:09<10:12, 1.33s/it]
Training 1/1 epoch (loss 2.4681): 27%|βββ | 167/625 [04:09<09:54, 1.30s/it]
Training 1/1 epoch (loss 2.5752): 27%|βββ | 167/625 [04:10<09:54, 1.30s/it]
Training 1/1 epoch (loss 2.5752): 27%|βββ | 168/625 [04:10<08:37, 1.13s/it]
Training 1/1 epoch (loss 2.3571): 27%|βββ | 168/625 [04:12<08:37, 1.13s/it]
Training 1/1 epoch (loss 2.3571): 27%|βββ | 169/625 [04:12<09:42, 1.28s/it]
Training 1/1 epoch (loss 2.6415): 27%|βββ | 169/625 [04:14<09:42, 1.28s/it]
Training 1/1 epoch (loss 2.6415): 27%|βββ | 170/625 [04:14<12:09, 1.60s/it]
Training 1/1 epoch (loss 2.5761): 27%|βββ | 170/625 [04:15<12:09, 1.60s/it]
Training 1/1 epoch (loss 2.5761): 27%|βββ | 171/625 [04:15<09:57, 1.32s/it]
Training 1/1 epoch (loss 2.7469): 27%|βββ | 171/625 [04:17<09:57, 1.32s/it]
Training 1/1 epoch (loss 2.7469): 28%|βββ | 172/625 [04:17<12:16, 1.62s/it]
Training 1/1 epoch (loss 2.5196): 28%|βββ | 172/625 [04:19<12:16, 1.62s/it]
Training 1/1 epoch (loss 2.5196): 28%|βββ | 173/625 [04:19<11:48, 1.57s/it]
Training 1/1 epoch (loss 2.4005): 28%|βββ | 173/625 [04:19<11:48, 1.57s/it]
Training 1/1 epoch (loss 2.4005): 28%|βββ | 174/625 [04:19<09:16, 1.23s/it]
Training 1/1 epoch (loss 2.7121): 28%|βββ | 174/625 [04:21<09:16, 1.23s/it]
Training 1/1 epoch (loss 2.7121): 28%|βββ | 175/625 [04:21<10:55, 1.46s/it]
Training 1/1 epoch (loss 2.4954): 28%|βββ | 175/625 [04:23<10:55, 1.46s/it]
Training 1/1 epoch (loss 2.4954): 28%|βββ | 176/625 [04:23<13:07, 1.75s/it]
Training 1/1 epoch (loss 2.6247): 28%|βββ | 176/625 [04:24<13:07, 1.75s/it]
Training 1/1 epoch (loss 2.6247): 28%|βββ | 177/625 [04:24<10:50, 1.45s/it]
Training 1/1 epoch (loss 2.5376): 28%|βββ | 177/625 [04:26<10:50, 1.45s/it]
Training 1/1 epoch (loss 2.5376): 28%|βββ | 178/625 [04:26<11:07, 1.49s/it]
Training 1/1 epoch (loss 2.5385): 28%|βββ | 178/625 [04:27<11:07, 1.49s/it]
Training 1/1 epoch (loss 2.5385): 29%|βββ | 179/625 [04:27<09:46, 1.32s/it]
Training 1/1 epoch (loss 2.5247): 29%|βββ | 179/625 [04:28<09:46, 1.32s/it]
Training 1/1 epoch (loss 2.5247): 29%|βββ | 180/625 [04:28<09:12, 1.24s/it]
Training 1/1 epoch (loss 2.6580): 29%|βββ | 180/625 [04:30<09:12, 1.24s/it]
Training 1/1 epoch (loss 2.6580): 29%|βββ | 181/625 [04:30<10:19, 1.39s/it]
Training 1/1 epoch (loss 2.6075): 29%|βββ | 181/625 [04:31<10:19, 1.39s/it]
Training 1/1 epoch (loss 2.6075): 29%|βββ | 182/625 [04:31<09:54, 1.34s/it]
Training 1/1 epoch (loss 2.7614): 29%|βββ | 182/625 [04:33<09:54, 1.34s/it]
Training 1/1 epoch (loss 2.7614): 29%|βββ | 183/625 [04:33<12:19, 1.67s/it]
Training 1/1 epoch (loss 2.5946): 29%|βββ | 183/625 [04:35<12:19, 1.67s/it]
Training 1/1 epoch (loss 2.5946): 29%|βββ | 184/625 [04:35<12:34, 1.71s/it]
Training 1/1 epoch (loss 2.4979): 29%|βββ | 184/625 [04:36<12:34, 1.71s/it]
Training 1/1 epoch (loss 2.4979): 30%|βββ | 185/625 [04:36<10:52, 1.48s/it]
Training 1/1 epoch (loss 2.4753): 30%|βββ | 185/625 [04:38<10:52, 1.48s/it]
Training 1/1 epoch (loss 2.4753): 30%|βββ | 186/625 [04:38<12:08, 1.66s/it]
Training 1/1 epoch (loss 2.7076): 30%|βββ | 186/625 [04:39<12:08, 1.66s/it]
Training 1/1 epoch (loss 2.7076): 30%|βββ | 187/625 [04:39<11:16, 1.54s/it]
Training 1/1 epoch (loss 2.4280): 30%|βββ | 187/625 [04:41<11:16, 1.54s/it]
Training 1/1 epoch (loss 2.4280): 30%|βββ | 188/625 [04:41<10:54, 1.50s/it]
Training 1/1 epoch (loss 2.5690): 30%|βββ | 188/625 [04:43<10:54, 1.50s/it]
Training 1/1 epoch (loss 2.5690): 30%|βββ | 189/625 [04:43<11:54, 1.64s/it]
Training 1/1 epoch (loss 2.7860): 30%|βββ | 189/625 [04:43<11:54, 1.64s/it]
Training 1/1 epoch (loss 2.7860): 30%|βββ | 190/625 [04:43<10:13, 1.41s/it]
Training 1/1 epoch (loss 2.6086): 30%|βββ | 190/625 [04:46<10:13, 1.41s/it]
Training 1/1 epoch (loss 2.6086): 31%|βββ | 191/625 [04:46<11:32, 1.60s/it]
Training 1/1 epoch (loss 2.6271): 31%|βββ | 191/625 [04:48<11:32, 1.60s/it]
Training 1/1 epoch (loss 2.6271): 31%|βββ | 192/625 [04:48<13:40, 1.90s/it]
Training 1/1 epoch (loss 2.4577): 31%|βββ | 192/625 [04:49<13:40, 1.90s/it]
Training 1/1 epoch (loss 2.4577): 31%|βββ | 193/625 [04:49<10:56, 1.52s/it]
Training 1/1 epoch (loss 2.5984): 31%|βββ | 193/625 [04:51<10:56, 1.52s/it]
Training 1/1 epoch (loss 2.5984): 31%|βββ | 194/625 [04:51<12:13, 1.70s/it]
Training 1/1 epoch (loss 2.4780): 31%|βββ | 194/625 [04:53<12:13, 1.70s/it]
Training 1/1 epoch (loss 2.4780): 31%|βββ | 195/625 [04:53<12:00, 1.67s/it]
Training 1/1 epoch (loss 2.8194): 31%|βββ | 195/625 [04:53<12:00, 1.67s/it]
Training 1/1 epoch (loss 2.8194): 31%|ββββ | 196/625 [04:53<09:24, 1.32s/it]
Training 1/1 epoch (loss 2.8102): 31%|ββββ | 196/625 [04:55<09:24, 1.32s/it]
Training 1/1 epoch (loss 2.8102): 32%|ββββ | 197/625 [04:55<11:51, 1.66s/it]
Training 1/1 epoch (loss 2.5674): 32%|ββββ | 197/625 [04:57<11:51, 1.66s/it]
Training 1/1 epoch (loss 2.5674): 32%|ββββ | 198/625 [04:57<11:52, 1.67s/it]
Training 1/1 epoch (loss 2.6109): 32%|ββββ | 198/625 [04:58<11:52, 1.67s/it]
Training 1/1 epoch (loss 2.6109): 32%|ββββ | 199/625 [04:58<09:43, 1.37s/it]
Training 1/1 epoch (loss 2.6418): 32%|ββββ | 199/625 [05:00<09:43, 1.37s/it]
Training 1/1 epoch (loss 2.6418): 32%|ββββ | 200/625 [05:00<10:24, 1.47s/it]
Training 1/1 epoch (loss 2.7416): 32%|ββββ | 200/625 [05:01<10:24, 1.47s/it]
Training 1/1 epoch (loss 2.7416): 32%|ββββ | 201/625 [05:01<10:11, 1.44s/it]
Training 1/1 epoch (loss 2.5150): 32%|ββββ | 201/625 [05:02<10:11, 1.44s/it]
Training 1/1 epoch (loss 2.5150): 32%|ββββ | 202/625 [05:02<09:45, 1.38s/it]
Training 1/1 epoch (loss 2.5550): 32%|ββββ | 202/625 [05:04<09:45, 1.38s/it]
Training 1/1 epoch (loss 2.5550): 32%|ββββ | 203/625 [05:04<10:20, 1.47s/it]
Training 1/1 epoch (loss 2.6362): 32%|ββββ | 203/625 [05:05<10:20, 1.47s/it]
Training 1/1 epoch (loss 2.6362): 33%|ββββ | 204/625 [05:05<10:02, 1.43s/it]
Training 1/1 epoch (loss 2.6820): 33%|ββββ | 204/625 [05:06<10:02, 1.43s/it]
Training 1/1 epoch (loss 2.6820): 33%|ββββ | 205/625 [05:06<09:05, 1.30s/it]
Training 1/1 epoch (loss 2.7883): 33%|ββββ | 205/625 [05:08<09:05, 1.30s/it]
Training 1/1 epoch (loss 2.7883): 33%|ββββ | 206/625 [05:08<11:01, 1.58s/it]
Training 1/1 epoch (loss 2.3639): 33%|ββββ | 206/625 [05:09<11:01, 1.58s/it]
Training 1/1 epoch (loss 2.3639): 33%|ββββ | 207/625 [05:09<09:42, 1.39s/it]
Training 1/1 epoch (loss 2.6125): 33%|ββββ | 207/625 [05:11<09:42, 1.39s/it]
Training 1/1 epoch (loss 2.6125): 33%|ββββ | 208/625 [05:11<10:23, 1.49s/it]
Training 1/1 epoch (loss 2.3400): 33%|ββββ | 208/625 [05:13<10:23, 1.49s/it]
Training 1/1 epoch (loss 2.3400): 33%|ββββ | 209/625 [05:13<10:41, 1.54s/it]
Training 1/1 epoch (loss 2.6103): 33%|ββββ | 209/625 [05:13<10:41, 1.54s/it]
Training 1/1 epoch (loss 2.6103): 34%|ββββ | 210/625 [05:13<08:49, 1.28s/it]
Training 1/1 epoch (loss 2.8005): 34%|ββββ | 210/625 [05:15<08:49, 1.28s/it]
Training 1/1 epoch (loss 2.8005): 34%|ββββ | 211/625 [05:15<10:05, 1.46s/it]
Training 1/1 epoch (loss 2.6906): 34%|ββββ | 211/625 [05:17<10:05, 1.46s/it]
Training 1/1 epoch (loss 2.6906): 34%|ββββ | 212/625 [05:17<09:37, 1.40s/it]
Training 1/1 epoch (loss 2.8476): 34%|ββββ | 212/625 [05:17<09:37, 1.40s/it]
Training 1/1 epoch (loss 2.8476): 34%|ββββ | 213/625 [05:17<08:40, 1.26s/it]
Training 1/1 epoch (loss 2.7679): 34%|ββββ | 213/625 [05:19<08:40, 1.26s/it]
Training 1/1 epoch (loss 2.7679): 34%|ββββ | 214/625 [05:19<09:48, 1.43s/it]
Training 1/1 epoch (loss 2.3855): 34%|ββββ | 214/625 [05:21<09:48, 1.43s/it]
Training 1/1 epoch (loss 2.3855): 34%|ββββ | 215/625 [05:21<10:10, 1.49s/it]
Training 1/1 epoch (loss 2.3767): 34%|ββββ | 215/625 [05:22<10:10, 1.49s/it]
Training 1/1 epoch (loss 2.3767): 35%|ββββ | 216/625 [05:22<10:09, 1.49s/it]
Training 1/1 epoch (loss 2.6617): 35%|ββββ | 216/625 [05:24<10:09, 1.49s/it]
Training 1/1 epoch (loss 2.6617): 35%|ββββ | 217/625 [05:24<10:55, 1.61s/it]
Training 1/1 epoch (loss 2.3889): 35%|ββββ | 217/625 [05:25<10:55, 1.61s/it]
Training 1/1 epoch (loss 2.3889): 35%|ββββ | 218/625 [05:25<09:20, 1.38s/it]
Training 1/1 epoch (loss 2.8121): 35%|ββββ | 218/625 [05:27<09:20, 1.38s/it]
Training 1/1 epoch (loss 2.8121): 35%|ββββ | 219/625 [05:27<09:32, 1.41s/it]
Training 1/1 epoch (loss 2.8187): 35%|ββββ | 219/625 [05:28<09:32, 1.41s/it]
Training 1/1 epoch (loss 2.8187): 35%|ββββ | 220/625 [05:28<09:30, 1.41s/it]
Training 1/1 epoch (loss 2.7025): 35%|ββββ | 220/625 [05:29<09:30, 1.41s/it]
Training 1/1 epoch (loss 2.7025): 35%|ββββ | 221/625 [05:29<07:41, 1.14s/it]
Training 1/1 epoch (loss 2.6063): 35%|ββββ | 221/625 [05:31<07:41, 1.14s/it]
Training 1/1 epoch (loss 2.6063): 36%|ββββ | 222/625 [05:31<09:28, 1.41s/it]
Training 1/1 epoch (loss 2.8162): 36%|ββββ | 222/625 [05:32<09:28, 1.41s/it]
Training 1/1 epoch (loss 2.8162): 36%|ββββ | 223/625 [05:32<10:21, 1.55s/it]
Training 1/1 epoch (loss 2.6274): 36%|ββββ | 223/625 [05:33<10:21, 1.55s/it]
Training 1/1 epoch (loss 2.6274): 36%|ββββ | 224/625 [05:33<08:49, 1.32s/it]
Training 1/1 epoch (loss 2.6401): 36%|ββββ | 224/625 [05:35<08:49, 1.32s/it]
Training 1/1 epoch (loss 2.6401): 36%|ββββ | 225/625 [05:35<09:30, 1.43s/it]
Training 1/1 epoch (loss 2.8141): 36%|ββββ | 225/625 [05:36<09:30, 1.43s/it]
Training 1/1 epoch (loss 2.8141): 36%|ββββ | 226/625 [05:36<08:39, 1.30s/it]
Training 1/1 epoch (loss 2.5453): 36%|ββββ | 226/625 [05:37<08:39, 1.30s/it]
Training 1/1 epoch (loss 2.5453): 36%|ββββ | 227/625 [05:37<08:34, 1.29s/it]
Training 1/1 epoch (loss 2.4085): 36%|ββββ | 227/625 [05:39<08:34, 1.29s/it]
Training 1/1 epoch (loss 2.4085): 36%|ββββ | 228/625 [05:39<09:33, 1.44s/it]
Training 1/1 epoch (loss 2.7002): 36%|ββββ | 228/625 [05:40<09:33, 1.44s/it]
Training 1/1 epoch (loss 2.7002): 37%|ββββ | 229/625 [05:40<09:21, 1.42s/it]
Training 1/1 epoch (loss 2.6541): 37%|ββββ | 229/625 [05:43<09:21, 1.42s/it]
Training 1/1 epoch (loss 2.6541): 37%|ββββ | 230/625 [05:43<11:06, 1.69s/it]
Training 1/1 epoch (loss 2.6420): 37%|ββββ | 230/625 [05:45<11:06, 1.69s/it]
Training 1/1 epoch (loss 2.6420): 37%|ββββ | 231/625 [05:45<12:20, 1.88s/it]
Training 1/1 epoch (loss 2.7685): 37%|ββββ | 231/625 [05:46<12:20, 1.88s/it]
Training 1/1 epoch (loss 2.7685): 37%|ββββ | 232/625 [05:46<10:32, 1.61s/it]
Training 1/1 epoch (loss 2.5405): 37%|ββββ | 232/625 [05:48<10:32, 1.61s/it]
Training 1/1 epoch (loss 2.5405): 37%|ββββ | 233/625 [05:48<11:00, 1.68s/it]
Training 1/1 epoch (loss 2.6411): 37%|ββββ | 233/625 [05:49<11:00, 1.68s/it]
Training 1/1 epoch (loss 2.6411): 37%|ββββ | 234/625 [05:49<10:40, 1.64s/it]
Training 1/1 epoch (loss 2.6668): 37%|ββββ | 234/625 [05:50<10:40, 1.64s/it]
Training 1/1 epoch (loss 2.6668): 38%|ββββ | 235/625 [05:50<09:01, 1.39s/it]
Training 1/1 epoch (loss 2.7184): 38%|ββββ | 235/625 [05:52<09:01, 1.39s/it]
Training 1/1 epoch (loss 2.7184): 38%|ββββ | 236/625 [05:52<10:07, 1.56s/it]
Training 1/1 epoch (loss 2.6650): 38%|ββββ | 236/625 [05:54<10:07, 1.56s/it]
Training 1/1 epoch (loss 2.6650): 38%|ββββ | 237/625 [05:54<09:54, 1.53s/it]
Training 1/1 epoch (loss 2.4817): 38%|ββββ | 237/625 [05:55<09:54, 1.53s/it]
Training 1/1 epoch (loss 2.4817): 38%|ββββ | 238/625 [05:55<08:44, 1.35s/it]
Training 1/1 epoch (loss 2.5814): 38%|ββββ | 238/625 [05:57<08:44, 1.35s/it]
Training 1/1 epoch (loss 2.5814): 38%|ββββ | 239/625 [05:57<10:40, 1.66s/it]
Training 1/1 epoch (loss 2.4530): 38%|ββββ | 239/625 [05:58<10:40, 1.66s/it]
Training 1/1 epoch (loss 2.4530): 38%|ββββ | 240/625 [05:58<08:59, 1.40s/it]
Training 1/1 epoch (loss 2.5598): 38%|ββββ | 240/625 [05:59<08:59, 1.40s/it]
Training 1/1 epoch (loss 2.5598): 39%|ββββ | 241/625 [05:59<09:01, 1.41s/it]
Training 1/1 epoch (loss 2.8545): 39%|ββββ | 241/625 [06:01<09:01, 1.41s/it]
Training 1/1 epoch (loss 2.8545): 39%|ββββ | 242/625 [06:01<09:10, 1.44s/it]
Training 1/1 epoch (loss 2.6389): 39%|ββββ | 242/625 [06:01<09:10, 1.44s/it]
Training 1/1 epoch (loss 2.6389): 39%|ββββ | 243/625 [06:01<07:34, 1.19s/it]
Training 1/1 epoch (loss 2.6013): 39%|ββββ | 243/625 [06:04<07:34, 1.19s/it]
Training 1/1 epoch (loss 2.6013): 39%|ββββ | 244/625 [06:04<09:43, 1.53s/it]
Training 1/1 epoch (loss 2.7315): 39%|ββββ | 244/625 [06:05<09:43, 1.53s/it]
Training 1/1 epoch (loss 2.7315): 39%|ββββ | 245/625 [06:05<10:24, 1.64s/it]
Training 1/1 epoch (loss 2.5681): 39%|ββββ | 245/625 [06:06<10:24, 1.64s/it]
Training 1/1 epoch (loss 2.5681): 39%|ββββ | 246/625 [06:06<09:00, 1.43s/it]
Training 1/1 epoch (loss 2.5820): 39%|ββββ | 246/625 [06:08<09:00, 1.43s/it]
Training 1/1 epoch (loss 2.5820): 40%|ββββ | 247/625 [06:08<10:07, 1.61s/it]
Training 1/1 epoch (loss 2.4063): 40%|ββββ | 247/625 [06:09<10:07, 1.61s/it]
Training 1/1 epoch (loss 2.4063): 40%|ββββ | 248/625 [06:09<08:27, 1.35s/it]
Training 1/1 epoch (loss 2.5107): 40%|ββββ | 248/625 [06:11<08:27, 1.35s/it]
Training 1/1 epoch (loss 2.5107): 40%|ββββ | 249/625 [06:11<09:03, 1.44s/it]
Training 1/1 epoch (loss 2.9040): 40%|ββββ | 249/625 [06:13<09:03, 1.44s/it]
Training 1/1 epoch (loss 2.9040): 40%|ββββ | 250/625 [06:13<09:29, 1.52s/it]
Training 1/1 epoch (loss 2.5184): 40%|ββββ | 250/625 [06:13<09:29, 1.52s/it]
Training 1/1 epoch (loss 2.5184): 40%|ββββ | 251/625 [06:13<07:44, 1.24s/it]
Training 1/1 epoch (loss 2.7497): 40%|ββββ | 251/625 [06:15<07:44, 1.24s/it]
Training 1/1 epoch (loss 2.7497): 40%|ββββ | 252/625 [06:15<08:00, 1.29s/it]
Training 1/1 epoch (loss 2.6719): 40%|ββββ | 252/625 [06:17<08:00, 1.29s/it]
Training 1/1 epoch (loss 2.6719): 40%|ββββ | 253/625 [06:17<09:56, 1.60s/it]
Training 1/1 epoch (loss 2.4735): 40%|ββββ | 253/625 [06:17<09:56, 1.60s/it]
Training 1/1 epoch (loss 2.4735): 41%|ββββ | 254/625 [06:17<07:46, 1.26s/it]
Training 1/1 epoch (loss 2.6940): 41%|ββββ | 254/625 [06:18<07:46, 1.26s/it]
Training 1/1 epoch (loss 2.6940): 41%|ββββ | 255/625 [06:18<07:14, 1.17s/it]
Training 1/1 epoch (loss 2.5917): 41%|ββββ | 255/625 [06:21<07:14, 1.17s/it]
Training 1/1 epoch (loss 2.5917): 41%|ββββ | 256/625 [06:21<09:55, 1.61s/it]
Training 1/1 epoch (loss 2.4903): 41%|ββββ | 256/625 [06:22<09:55, 1.61s/it]
Training 1/1 epoch (loss 2.4903): 41%|ββββ | 257/625 [06:22<08:40, 1.42s/it]
Training 1/1 epoch (loss 2.5487): 41%|ββββ | 257/625 [06:24<08:40, 1.42s/it]
Training 1/1 epoch (loss 2.5487): 41%|βββββ | 258/625 [06:24<10:37, 1.74s/it]
Training 1/1 epoch (loss 2.8026): 41%|βββββ | 258/625 [06:26<10:37, 1.74s/it]
Training 1/1 epoch (loss 2.8026): 41%|βββββ | 259/625 [06:26<11:04, 1.81s/it]
Training 1/1 epoch (loss 2.6049): 41%|βββββ | 259/625 [06:27<11:04, 1.81s/it]
Training 1/1 epoch (loss 2.6049): 42%|βββββ | 260/625 [06:27<09:19, 1.53s/it]
Training 1/1 epoch (loss 2.5635): 42%|βββββ | 260/625 [06:30<09:19, 1.53s/it]
Training 1/1 epoch (loss 2.5635): 42%|βββββ | 261/625 [06:30<10:48, 1.78s/it]
Training 1/1 epoch (loss 2.5964): 42%|βββββ | 261/625 [06:31<10:48, 1.78s/it]
Training 1/1 epoch (loss 2.5964): 42%|βββββ | 262/625 [06:31<10:23, 1.72s/it]
Training 1/1 epoch (loss 2.4780): 42%|βββββ | 262/625 [06:32<10:23, 1.72s/it]
Training 1/1 epoch (loss 2.4780): 42%|βββββ | 263/625 [06:32<09:27, 1.57s/it]
Training 1/1 epoch (loss 2.3013): 42%|βββββ | 263/625 [06:34<09:27, 1.57s/it]
Training 1/1 epoch (loss 2.3013): 42%|βββββ | 264/625 [06:34<09:51, 1.64s/it]
Training 1/1 epoch (loss 2.4745): 42%|βββββ | 264/625 [06:35<09:51, 1.64s/it]
Training 1/1 epoch (loss 2.4745): 42%|βββββ | 265/625 [06:35<07:47, 1.30s/it]
Training 1/1 epoch (loss 2.4539): 42%|βββββ | 265/625 [06:37<07:47, 1.30s/it]
Training 1/1 epoch (loss 2.4539): 43%|βββββ | 266/625 [06:37<08:54, 1.49s/it]
Training 1/1 epoch (loss 2.7676): 43%|βββββ | 266/625 [06:38<08:54, 1.49s/it]
Training 1/1 epoch (loss 2.7676): 43%|βββββ | 267/625 [06:38<08:41, 1.46s/it]
Training 1/1 epoch (loss 2.5934): 43%|βββββ | 267/625 [06:39<08:41, 1.46s/it]
Training 1/1 epoch (loss 2.5934): 43%|βββββ | 268/625 [06:39<07:02, 1.18s/it]
Training 1/1 epoch (loss 2.6399): 43%|βββββ | 268/625 [06:40<07:02, 1.18s/it]
Training 1/1 epoch (loss 2.6399): 43%|βββββ | 269/625 [06:40<06:48, 1.15s/it]
Training 1/1 epoch (loss 2.4922): 43%|βββββ | 269/625 [06:42<06:48, 1.15s/it]
Training 1/1 epoch (loss 2.4922): 43%|βββββ | 270/625 [06:42<08:18, 1.40s/it]
Training 1/1 epoch (loss 2.7075): 43%|βββββ | 270/625 [06:42<08:18, 1.40s/it]
Training 1/1 epoch (loss 2.7075): 43%|βββββ | 271/625 [06:42<06:42, 1.14s/it]
Training 1/1 epoch (loss 2.6264): 43%|βββββ | 271/625 [06:45<06:42, 1.14s/it]
Training 1/1 epoch (loss 2.6264): 44%|βββββ | 272/625 [06:45<09:19, 1.58s/it]
Training 1/1 epoch (loss 2.6361): 44%|βββββ | 272/625 [06:46<09:19, 1.58s/it]
Training 1/1 epoch (loss 2.6361): 44%|βββββ | 273/625 [06:46<08:45, 1.49s/it]
Training 1/1 epoch (loss 2.4965): 44%|βββββ | 273/625 [06:47<08:45, 1.49s/it]
Training 1/1 epoch (loss 2.4965): 44%|βββββ | 274/625 [06:47<08:31, 1.46s/it]
Training 1/1 epoch (loss 2.4225): 44%|βββββ | 274/625 [06:49<08:31, 1.46s/it]
Training 1/1 epoch (loss 2.4225): 44%|βββββ | 275/625 [06:49<09:19, 1.60s/it]
Training 1/1 epoch (loss 2.4284): 44%|βββββ | 275/625 [06:50<09:19, 1.60s/it]
Training 1/1 epoch (loss 2.4284): 44%|βββββ | 276/625 [06:50<08:21, 1.44s/it]
Training 1/1 epoch (loss 2.5998): 44%|βββββ | 276/625 [06:53<08:21, 1.44s/it]
Training 1/1 epoch (loss 2.5998): 44%|βββββ | 277/625 [06:53<09:36, 1.66s/it]
Training 1/1 epoch (loss 2.6190): 44%|βββββ | 277/625 [06:55<09:36, 1.66s/it]
Training 1/1 epoch (loss 2.6190): 44%|βββββ | 278/625 [06:55<10:39, 1.84s/it]
Training 1/1 epoch (loss 2.6651): 44%|βββββ | 278/625 [06:55<10:39, 1.84s/it]
Training 1/1 epoch (loss 2.6651): 45%|βββββ | 279/625 [06:55<08:17, 1.44s/it]
Training 1/1 epoch (loss 2.5361): 45%|βββββ | 279/625 [06:58<08:17, 1.44s/it]
Training 1/1 epoch (loss 2.5361): 45%|βββββ | 280/625 [06:58<10:13, 1.78s/it]
Training 1/1 epoch (loss 2.7235): 45%|βββββ | 280/625 [07:00<10:13, 1.78s/it]
Training 1/1 epoch (loss 2.7235): 45%|βββββ | 281/625 [07:00<10:38, 1.86s/it]
Training 1/1 epoch (loss 2.6270): 45%|βββββ | 281/625 [07:01<10:38, 1.86s/it]
Training 1/1 epoch (loss 2.6270): 45%|βββββ | 282/625 [07:01<08:45, 1.53s/it]
Training 1/1 epoch (loss 2.5739): 45%|βββββ | 282/625 [07:02<08:45, 1.53s/it]
Training 1/1 epoch (loss 2.5739): 45%|βββββ | 283/625 [07:02<08:39, 1.52s/it]
Training 1/1 epoch (loss 2.8040): 45%|βββββ | 283/625 [07:03<08:39, 1.52s/it]
Training 1/1 epoch (loss 2.8040): 45%|βββββ | 284/625 [07:03<07:58, 1.40s/it]
Training 1/1 epoch (loss 2.5138): 45%|βββββ | 284/625 [07:04<07:58, 1.40s/it]
Training 1/1 epoch (loss 2.5138): 46%|βββββ | 285/625 [07:04<07:00, 1.24s/it]
Training 1/1 epoch (loss 2.5169): 46%|βββββ | 285/625 [07:06<07:00, 1.24s/it]
Training 1/1 epoch (loss 2.5169): 46%|βββββ | 286/625 [07:06<07:32, 1.34s/it]
Training 1/1 epoch (loss 2.5670): 46%|βββββ | 286/625 [07:07<07:32, 1.34s/it]
Training 1/1 epoch (loss 2.5670): 46%|βββββ | 287/625 [07:07<07:30, 1.33s/it]
Training 1/1 epoch (loss 2.4548): 46%|βββββ | 287/625 [07:08<07:30, 1.33s/it]
Training 1/1 epoch (loss 2.4548): 46%|βββββ | 288/625 [07:08<07:06, 1.27s/it]
Training 1/1 epoch (loss 2.6085): 46%|βββββ | 288/625 [07:10<07:06, 1.27s/it]
Training 1/1 epoch (loss 2.6085): 46%|βββββ | 289/625 [07:10<08:32, 1.53s/it]
Training 1/1 epoch (loss 2.7216): 46%|βββββ | 289/625 [07:11<08:32, 1.53s/it]
Training 1/1 epoch (loss 2.7216): 46%|βββββ | 290/625 [07:11<06:40, 1.19s/it]
Training 1/1 epoch (loss 2.6597): 46%|βββββ | 290/625 [07:12<06:40, 1.19s/it]
Training 1/1 epoch (loss 2.6597): 47%|βββββ | 291/625 [07:12<07:13, 1.30s/it]
Training 1/1 epoch (loss 2.9198): 47%|βββββ | 291/625 [07:14<07:13, 1.30s/it]
Training 1/1 epoch (loss 2.9198): 47%|βββββ | 292/625 [07:14<07:54, 1.42s/it]
Training 1/1 epoch (loss 2.6552): 47%|βββββ | 292/625 [07:15<07:54, 1.42s/it]
Training 1/1 epoch (loss 2.6552): 47%|βββββ | 293/625 [07:15<06:43, 1.22s/it]
Training 1/1 epoch (loss 2.4511): 47%|βββββ | 293/625 [07:17<06:43, 1.22s/it]
Training 1/1 epoch (loss 2.4511): 47%|βββββ | 294/625 [07:17<07:40, 1.39s/it]
Training 1/1 epoch (loss 2.6283): 47%|βββββ | 294/625 [07:18<07:40, 1.39s/it]
Training 1/1 epoch (loss 2.6283): 47%|βββββ | 295/625 [07:18<07:04, 1.29s/it]
Training 1/1 epoch (loss 2.4732): 47%|βββββ | 295/625 [07:19<07:04, 1.29s/it]
Training 1/1 epoch (loss 2.4732): 47%|βββββ | 296/625 [07:19<07:57, 1.45s/it]
Training 1/1 epoch (loss 2.7586): 47%|βββββ | 296/625 [07:21<07:57, 1.45s/it]
Training 1/1 epoch (loss 2.7586): 48%|βββββ | 297/625 [07:21<08:14, 1.51s/it]
Training 1/1 epoch (loss 2.3962): 48%|βββββ | 297/625 [07:22<08:14, 1.51s/it]
Training 1/1 epoch (loss 2.3962): 48%|βββββ | 298/625 [07:22<06:52, 1.26s/it]
Training 1/1 epoch (loss 2.2577): 48%|βββββ | 298/625 [07:24<06:52, 1.26s/it]
Training 1/1 epoch (loss 2.2577): 48%|βββββ | 299/625 [07:24<08:34, 1.58s/it]
Training 1/1 epoch (loss 2.4298): 48%|βββββ | 299/625 [07:26<08:34, 1.58s/it]
Training 1/1 epoch (loss 2.4298): 48%|βββββ | 300/625 [07:26<08:20, 1.54s/it]
Training 1/1 epoch (loss 2.5330): 48%|βββββ | 300/625 [07:26<08:20, 1.54s/it]
Training 1/1 epoch (loss 2.5330): 48%|βββββ | 301/625 [07:26<06:44, 1.25s/it]
Training 1/1 epoch (loss 2.6453): 48%|βββββ | 301/625 [07:29<06:44, 1.25s/it]
Training 1/1 epoch (loss 2.6453): 48%|βββββ | 302/625 [07:29<08:40, 1.61s/it]
Training 1/1 epoch (loss 2.4889): 48%|βββββ | 302/625 [07:31<08:40, 1.61s/it]
Training 1/1 epoch (loss 2.4889): 48%|βββββ | 303/625 [07:31<10:07, 1.89s/it]
Training 1/1 epoch (loss 2.6977): 48%|βββββ | 303/625 [07:32<10:07, 1.89s/it]
Training 1/1 epoch (loss 2.6977): 49%|βββββ | 304/625 [07:32<09:06, 1.70s/it]
Training 1/1 epoch (loss 2.5961): 49%|βββββ | 304/625 [07:34<09:06, 1.70s/it]
Training 1/1 epoch (loss 2.5961): 49%|βββββ | 305/625 [07:34<09:15, 1.74s/it]
Training 1/1 epoch (loss 2.4152): 49%|βββββ | 305/625 [07:35<09:15, 1.74s/it]
Training 1/1 epoch (loss 2.4152): 49%|βββββ | 306/625 [07:35<08:23, 1.58s/it]
Training 1/1 epoch (loss 2.7896): 49%|βββββ | 306/625 [07:37<08:23, 1.58s/it]
Training 1/1 epoch (loss 2.7896): 49%|βββββ | 307/625 [07:37<07:40, 1.45s/it]
Training 1/1 epoch (loss 2.6228): 49%|βββββ | 307/625 [07:39<07:40, 1.45s/it]
Training 1/1 epoch (loss 2.6228): 49%|βββββ | 308/625 [07:39<08:52, 1.68s/it]
Training 1/1 epoch (loss 2.5520): 49%|βββββ | 308/625 [07:40<08:52, 1.68s/it]
Training 1/1 epoch (loss 2.5520): 49%|βββββ | 309/625 [07:40<07:56, 1.51s/it]
Training 1/1 epoch (loss 2.4747): 49%|βββββ | 309/625 [07:42<07:56, 1.51s/it]
Training 1/1 epoch (loss 2.4747): 50%|βββββ | 310/625 [07:42<08:55, 1.70s/it]
Training 1/1 epoch (loss 2.3989): 50%|βββββ | 310/625 [07:44<08:55, 1.70s/it]
Training 1/1 epoch (loss 2.3989): 50%|βββββ | 311/625 [07:44<10:08, 1.94s/it]
Training 1/1 epoch (loss 2.6524): 50%|βββββ | 311/625 [07:45<10:08, 1.94s/it]
Training 1/1 epoch (loss 2.6524): 50%|βββββ | 312/625 [07:45<08:16, 1.59s/it]
Training 1/1 epoch (loss 2.5155): 50%|βββββ | 312/625 [07:47<08:16, 1.59s/it]
Training 1/1 epoch (loss 2.5155): 50%|βββββ | 313/625 [07:47<08:48, 1.69s/it]
Training 1/1 epoch (loss 2.5233): 50%|βββββ | 313/625 [07:49<08:48, 1.69s/it]
Training 1/1 epoch (loss 2.5233): 50%|βββββ | 314/625 [07:49<08:35, 1.66s/it]
Training 1/1 epoch (loss 2.7938): 50%|βββββ | 314/625 [07:50<08:35, 1.66s/it]
Training 1/1 epoch (loss 2.7938): 50%|βββββ | 315/625 [07:50<07:55, 1.53s/it]
Training 1/1 epoch (loss 2.7087): 50%|βββββ | 315/625 [07:52<07:55, 1.53s/it]
Training 1/1 epoch (loss 2.7087): 51%|βββββ | 316/625 [07:52<08:33, 1.66s/it]
Training 1/1 epoch (loss 2.7095): 51%|βββββ | 316/625 [07:53<08:33, 1.66s/it]
Training 1/1 epoch (loss 2.7095): 51%|βββββ | 317/625 [07:53<07:49, 1.52s/it]
Training 1/1 epoch (loss 2.5010): 51%|βββββ | 317/625 [07:55<07:49, 1.52s/it]
Training 1/1 epoch (loss 2.5010): 51%|βββββ | 318/625 [07:55<08:01, 1.57s/it]
Training 1/1 epoch (loss 2.6636): 51%|βββββ | 318/625 [07:57<08:01, 1.57s/it]
Training 1/1 epoch (loss 2.6636): 51%|βββββ | 319/625 [07:57<08:13, 1.61s/it]
Training 1/1 epoch (loss 2.5154): 51%|βββββ | 319/625 [07:58<08:13, 1.61s/it]
Training 1/1 epoch (loss 2.5154): 51%|βββββ | 320/625 [07:58<07:20, 1.45s/it]
Training 1/1 epoch (loss 2.6414): 51%|βββββ | 320/625 [08:00<07:20, 1.45s/it]
Training 1/1 epoch (loss 2.6414): 51%|ββββββ | 321/625 [08:00<08:04, 1.59s/it]
Training 1/1 epoch (loss 2.5597): 51%|ββββββ | 321/625 [08:02<08:04, 1.59s/it]
Training 1/1 epoch (loss 2.5597): 52%|ββββββ | 322/625 [08:02<08:43, 1.73s/it]
Training 1/1 epoch (loss 2.6551): 52%|ββββββ | 322/625 [08:02<08:43, 1.73s/it]
Training 1/1 epoch (loss 2.6551): 52%|ββββββ | 323/625 [08:02<07:17, 1.45s/it]
Training 1/1 epoch (loss 2.6597): 52%|ββββββ | 323/625 [08:05<07:17, 1.45s/it]
Training 1/1 epoch (loss 2.6597): 52%|ββββββ | 324/625 [08:05<08:47, 1.75s/it]
Training 1/1 epoch (loss 2.7932): 52%|ββββββ | 324/625 [08:06<08:47, 1.75s/it]
Training 1/1 epoch (loss 2.7932): 52%|ββββββ | 325/625 [08:06<07:41, 1.54s/it]
Training 1/1 epoch (loss 2.5349): 52%|ββββββ | 325/625 [08:07<07:41, 1.54s/it]
Training 1/1 epoch (loss 2.5349): 52%|ββββββ | 326/625 [08:07<07:40, 1.54s/it]
Training 1/1 epoch (loss 2.6322): 52%|ββββββ | 326/625 [08:10<07:40, 1.54s/it]
Training 1/1 epoch (loss 2.6322): 52%|ββββββ | 327/625 [08:10<08:49, 1.78s/it]
Training 1/1 epoch (loss 2.5171): 52%|ββββββ | 327/625 [08:11<08:49, 1.78s/it]
Training 1/1 epoch (loss 2.5171): 52%|ββββββ | 328/625 [08:11<07:35, 1.53s/it]
Training 1/1 epoch (loss 2.5003): 52%|ββββββ | 328/625 [08:12<07:35, 1.53s/it]
Training 1/1 epoch (loss 2.5003): 53%|ββββββ | 329/625 [08:12<07:41, 1.56s/it]
Training 1/1 epoch (loss 2.5401): 53%|ββββββ | 329/625 [08:14<07:41, 1.56s/it]
Training 1/1 epoch (loss 2.5401): 53%|ββββββ | 330/625 [08:14<08:01, 1.63s/it]
Training 1/1 epoch (loss 2.6279): 53%|ββββββ | 330/625 [08:15<08:01, 1.63s/it]
Training 1/1 epoch (loss 2.6279): 53%|ββββββ | 331/625 [08:15<07:03, 1.44s/it]
Training 1/1 epoch (loss 2.7986): 53%|ββββββ | 331/625 [08:16<07:03, 1.44s/it]
Training 1/1 epoch (loss 2.7986): 53%|ββββββ | 332/625 [08:16<06:17, 1.29s/it]
Training 1/1 epoch (loss 2.4821): 53%|ββββββ | 332/625 [08:18<06:17, 1.29s/it]
Training 1/1 epoch (loss 2.4821): 53%|ββββββ | 333/625 [08:18<07:08, 1.47s/it]
Training 1/1 epoch (loss 2.5834): 53%|ββββββ | 333/625 [08:19<07:08, 1.47s/it]
Training 1/1 epoch (loss 2.5834): 53%|ββββββ | 334/625 [08:19<06:50, 1.41s/it]
Training 1/1 epoch (loss 2.5129): 53%|ββββββ | 334/625 [08:22<06:50, 1.41s/it]
Training 1/1 epoch (loss 2.5129): 54%|ββββββ | 335/625 [08:22<08:08, 1.69s/it]
Training 1/1 epoch (loss 2.6788): 54%|ββββββ | 335/625 [08:23<08:08, 1.69s/it]
Training 1/1 epoch (loss 2.6788): 54%|ββββββ | 336/625 [08:23<07:40, 1.59s/it]
Training 1/1 epoch (loss 2.7073): 54%|ββββββ | 336/625 [08:24<07:40, 1.59s/it]
Training 1/1 epoch (loss 2.7073): 54%|ββββββ | 337/625 [08:24<06:46, 1.41s/it]
Training 1/1 epoch (loss 2.6534): 54%|ββββββ | 337/625 [08:26<06:46, 1.41s/it]
Training 1/1 epoch (loss 2.6534): 54%|ββββββ | 338/625 [08:26<07:45, 1.62s/it]
Training 1/1 epoch (loss 2.5003): 54%|ββββββ | 338/625 [08:27<07:45, 1.62s/it]
Training 1/1 epoch (loss 2.5003): 54%|ββββββ | 339/625 [08:27<06:51, 1.44s/it]
Training 1/1 epoch (loss 2.6853): 54%|ββββββ | 339/625 [08:29<06:51, 1.44s/it]
Training 1/1 epoch (loss 2.6853): 54%|ββββββ | 340/625 [08:29<07:41, 1.62s/it]
Training 1/1 epoch (loss 2.4783): 54%|ββββββ | 340/625 [08:30<07:41, 1.62s/it]
Training 1/1 epoch (loss 2.4783): 55%|ββββββ | 341/625 [08:30<07:11, 1.52s/it]
Training 1/1 epoch (loss 2.5973): 55%|ββββββ | 341/625 [08:31<07:11, 1.52s/it]
Training 1/1 epoch (loss 2.5973): 55%|ββββββ | 342/625 [08:31<05:56, 1.26s/it]
Training 1/1 epoch (loss 2.4898): 55%|ββββββ | 342/625 [08:32<05:56, 1.26s/it]
Training 1/1 epoch (loss 2.4898): 55%|ββββββ | 343/625 [08:32<06:01, 1.28s/it]
Training 1/1 epoch (loss 2.4284): 55%|ββββββ | 343/625 [08:34<06:01, 1.28s/it]
Training 1/1 epoch (loss 2.4284): 55%|ββββββ | 344/625 [08:34<06:49, 1.46s/it]
Training 1/1 epoch (loss 2.5660): 55%|ββββββ | 344/625 [08:35<06:49, 1.46s/it]
Training 1/1 epoch (loss 2.5660): 55%|ββββββ | 345/625 [08:35<05:23, 1.16s/it]
Training 1/1 epoch (loss 2.4308): 55%|ββββββ | 345/625 [08:37<05:23, 1.16s/it]
Training 1/1 epoch (loss 2.4308): 55%|ββββββ | 346/625 [08:37<07:08, 1.54s/it]
Training 1/1 epoch (loss 2.6122): 55%|ββββββ | 346/625 [08:39<07:08, 1.54s/it]
Training 1/1 epoch (loss 2.6122): 56%|ββββββ | 347/625 [08:39<07:05, 1.53s/it]
Training 1/1 epoch (loss 2.8261): 56%|ββββββ | 347/625 [08:39<07:05, 1.53s/it]
Training 1/1 epoch (loss 2.8261): 56%|ββββββ | 348/625 [08:39<05:51, 1.27s/it]
Training 1/1 epoch (loss 2.5617): 56%|ββββββ | 348/625 [08:41<05:51, 1.27s/it]
Training 1/1 epoch (loss 2.5617): 56%|ββββββ | 349/625 [08:41<06:48, 1.48s/it]
Training 1/1 epoch (loss 2.6373): 56%|ββββββ | 349/625 [08:43<06:48, 1.48s/it]
Training 1/1 epoch (loss 2.6373): 56%|ββββββ | 350/625 [08:43<06:39, 1.45s/it]
Training 1/1 epoch (loss 2.4482): 56%|ββββββ | 350/625 [08:43<06:39, 1.45s/it]
Training 1/1 epoch (loss 2.4482): 56%|ββββββ | 351/625 [08:43<05:17, 1.16s/it]
Training 1/1 epoch (loss 2.8180): 56%|ββββββ | 351/625 [08:45<05:17, 1.16s/it]
Training 1/1 epoch (loss 2.8180): 56%|ββββββ | 352/625 [08:45<05:58, 1.31s/it]
Training 1/1 epoch (loss 2.9658): 56%|ββββββ | 352/625 [08:47<05:58, 1.31s/it]
Training 1/1 epoch (loss 2.9658): 56%|ββββββ | 353/625 [08:47<06:44, 1.49s/it]
Training 1/1 epoch (loss 2.6292): 56%|ββββββ | 353/625 [08:48<06:44, 1.49s/it]
Training 1/1 epoch (loss 2.6292): 57%|ββββββ | 354/625 [08:48<06:04, 1.35s/it]
Training 1/1 epoch (loss 2.6546): 57%|ββββββ | 354/625 [08:49<06:04, 1.35s/it]
Training 1/1 epoch (loss 2.6546): 57%|ββββββ | 355/625 [08:49<06:18, 1.40s/it]
Training 1/1 epoch (loss 2.4895): 57%|ββββββ | 355/625 [08:51<06:18, 1.40s/it]
Training 1/1 epoch (loss 2.4895): 57%|ββββββ | 356/625 [08:51<06:09, 1.37s/it]
Training 1/1 epoch (loss 2.7064): 57%|ββββββ | 356/625 [08:53<06:09, 1.37s/it]
Training 1/1 epoch (loss 2.7064): 57%|ββββββ | 357/625 [08:53<07:02, 1.57s/it]
Training 1/1 epoch (loss 2.5439): 57%|ββββββ | 357/625 [08:54<07:02, 1.57s/it]
Training 1/1 epoch (loss 2.5439): 57%|ββββββ | 358/625 [08:54<07:14, 1.63s/it]
Training 1/1 epoch (loss 2.7931): 57%|ββββββ | 358/625 [08:55<07:14, 1.63s/it]
Training 1/1 epoch (loss 2.7931): 57%|ββββββ | 359/625 [08:55<05:48, 1.31s/it]
Training 1/1 epoch (loss 2.3964): 57%|ββββββ | 359/625 [08:57<05:48, 1.31s/it]
Training 1/1 epoch (loss 2.3964): 58%|ββββββ | 360/625 [08:57<06:30, 1.47s/it]
Training 1/1 epoch (loss 2.6376): 58%|ββββββ | 360/625 [08:59<06:30, 1.47s/it]
Training 1/1 epoch (loss 2.6376): 58%|ββββββ | 361/625 [08:59<07:42, 1.75s/it]
Training 1/1 epoch (loss 2.4979): 58%|ββββββ | 361/625 [09:00<07:42, 1.75s/it]
Training 1/1 epoch (loss 2.4979): 58%|ββββββ | 362/625 [09:00<07:00, 1.60s/it]
Training 1/1 epoch (loss 2.6187): 58%|ββββββ | 362/625 [09:02<07:00, 1.60s/it]
Training 1/1 epoch (loss 2.6187): 58%|ββββββ | 363/625 [09:02<07:33, 1.73s/it]
Training 1/1 epoch (loss 2.6096): 58%|ββββββ | 363/625 [09:04<07:33, 1.73s/it]
Training 1/1 epoch (loss 2.6096): 58%|ββββββ | 364/625 [09:04<06:50, 1.57s/it]
Training 1/1 epoch (loss 2.3512): 58%|ββββββ | 364/625 [09:05<06:50, 1.57s/it]
Training 1/1 epoch (loss 2.3512): 58%|ββββββ | 365/625 [09:05<06:01, 1.39s/it]
Training 1/1 epoch (loss 2.5626): 58%|ββββββ | 365/625 [09:07<06:01, 1.39s/it]
Training 1/1 epoch (loss 2.5626): 59%|ββββββ | 366/625 [09:07<07:12, 1.67s/it]
Training 1/1 epoch (loss 2.5151): 59%|ββββββ | 366/625 [09:08<07:12, 1.67s/it]
Training 1/1 epoch (loss 2.5151): 59%|ββββββ | 367/625 [09:08<06:16, 1.46s/it]
Training 1/1 epoch (loss 2.5015): 59%|ββββββ | 367/625 [09:10<06:16, 1.46s/it]
Training 1/1 epoch (loss 2.5015): 59%|ββββββ | 368/625 [09:10<06:53, 1.61s/it]
Training 1/1 epoch (loss 2.4379): 59%|ββββββ | 368/625 [09:11<06:53, 1.61s/it]
Training 1/1 epoch (loss 2.4379): 59%|ββββββ | 369/625 [09:11<06:44, 1.58s/it]
Training 1/1 epoch (loss 2.5754): 59%|ββββββ | 369/625 [09:12<06:44, 1.58s/it]
Training 1/1 epoch (loss 2.5754): 59%|ββββββ | 370/625 [09:12<05:33, 1.31s/it]
Training 1/1 epoch (loss 2.6579): 59%|ββββββ | 370/625 [09:14<05:33, 1.31s/it]
Training 1/1 epoch (loss 2.6579): 59%|ββββββ | 371/625 [09:14<06:42, 1.58s/it]
Training 1/1 epoch (loss 2.7597): 59%|ββββββ | 371/625 [09:16<06:42, 1.58s/it]
Training 1/1 epoch (loss 2.7597): 60%|ββββββ | 372/625 [09:16<07:00, 1.66s/it]
Training 1/1 epoch (loss 2.4668): 60%|ββββββ | 372/625 [09:17<07:00, 1.66s/it]
Training 1/1 epoch (loss 2.4668): 60%|ββββββ | 373/625 [09:17<05:35, 1.33s/it]
Training 1/1 epoch (loss 2.7324): 60%|ββββββ | 373/625 [09:18<05:35, 1.33s/it]
Training 1/1 epoch (loss 2.7324): 60%|ββββββ | 374/625 [09:18<06:03, 1.45s/it]
Training 1/1 epoch (loss 2.6061): 60%|ββββββ | 374/625 [09:20<06:03, 1.45s/it]
Training 1/1 epoch (loss 2.6061): 60%|ββββββ | 375/625 [09:20<06:10, 1.48s/it]
Training 1/1 epoch (loss 2.5348): 60%|ββββββ | 375/625 [09:21<06:10, 1.48s/it]
Training 1/1 epoch (loss 2.5348): 60%|ββββββ | 376/625 [09:21<05:17, 1.27s/it]
Training 1/1 epoch (loss 2.4443): 60%|ββββββ | 376/625 [09:23<05:17, 1.27s/it]
Training 1/1 epoch (loss 2.4443): 60%|ββββββ | 377/625 [09:23<05:57, 1.44s/it]
Training 1/1 epoch (loss 2.4855): 60%|ββββββ | 377/625 [09:24<05:57, 1.44s/it]
Training 1/1 epoch (loss 2.4855): 60%|ββββββ | 378/625 [09:24<05:30, 1.34s/it]
Training 1/1 epoch (loss 2.7491): 60%|ββββββ | 378/625 [09:26<05:30, 1.34s/it]
Training 1/1 epoch (loss 2.7491): 61%|ββββββ | 379/625 [09:26<06:18, 1.54s/it]
Training 1/1 epoch (loss 2.4486): 61%|ββββββ | 379/625 [09:28<06:18, 1.54s/it]
Training 1/1 epoch (loss 2.4486): 61%|ββββββ | 380/625 [09:28<06:44, 1.65s/it]
Training 1/1 epoch (loss 2.7408): 61%|ββββββ | 380/625 [09:28<06:44, 1.65s/it]
Training 1/1 epoch (loss 2.7408): 61%|ββββββ | 381/625 [09:28<05:26, 1.34s/it]
Training 1/1 epoch (loss 2.5169): 61%|ββββββ | 381/625 [09:31<05:26, 1.34s/it]
Training 1/1 epoch (loss 2.5169): 61%|ββββββ | 382/625 [09:31<06:46, 1.67s/it]
Training 1/1 epoch (loss 2.6812): 61%|ββββββ | 382/625 [09:32<06:46, 1.67s/it]
Training 1/1 epoch (loss 2.6812): 61%|βββββββ | 383/625 [09:32<06:51, 1.70s/it]
Training 1/1 epoch (loss 2.5197): 61%|βββββββ | 383/625 [09:33<06:51, 1.70s/it]
Training 1/1 epoch (loss 2.5197): 61%|βββββββ | 384/625 [09:33<05:53, 1.47s/it]
Training 1/1 epoch (loss 2.4998): 61%|βββββββ | 384/625 [09:35<05:53, 1.47s/it]
Training 1/1 epoch (loss 2.4998): 62%|βββββββ | 385/625 [09:35<06:26, 1.61s/it]
Training 1/1 epoch (loss 2.6645): 62%|βββββββ | 385/625 [09:37<06:26, 1.61s/it]
Training 1/1 epoch (loss 2.6645): 62%|βββββββ | 386/625 [09:37<06:06, 1.53s/it]
Training 1/1 epoch (loss 2.6631): 62%|βββββββ | 386/625 [09:38<06:06, 1.53s/it]
Training 1/1 epoch (loss 2.6631): 62%|βββββββ | 387/625 [09:38<05:56, 1.50s/it]
Training 1/1 epoch (loss 2.6429): 62%|βββββββ | 387/625 [09:41<05:56, 1.50s/it]
Training 1/1 epoch (loss 2.6429): 62%|βββββββ | 388/625 [09:41<07:05, 1.80s/it]
Training 1/1 epoch (loss 2.4201): 62%|βββββββ | 388/625 [09:41<07:05, 1.80s/it]
Training 1/1 epoch (loss 2.4201): 62%|βββββββ | 389/625 [09:41<05:33, 1.41s/it]
Training 1/1 epoch (loss 2.6715): 62%|βββββββ | 389/625 [09:44<05:33, 1.41s/it]
Training 1/1 epoch (loss 2.6715): 62%|βββββββ | 390/625 [09:44<06:48, 1.74s/it]
Training 1/1 epoch (loss 2.6031): 62%|βββββββ | 390/625 [09:45<06:48, 1.74s/it]
Training 1/1 epoch (loss 2.6031): 63%|βββββββ | 391/625 [09:45<06:23, 1.64s/it]
Training 1/1 epoch (loss 2.7450): 63%|βββββββ | 391/625 [09:46<06:23, 1.64s/it]
Training 1/1 epoch (loss 2.7450): 63%|βββββββ | 392/625 [09:46<05:11, 1.34s/it]
Training 1/1 epoch (loss 2.5557): 63%|βββββββ | 392/625 [09:47<05:11, 1.34s/it]
Training 1/1 epoch (loss 2.5557): 63%|βββββββ | 393/625 [09:47<05:00, 1.30s/it]
Training 1/1 epoch (loss 2.5687): 63%|βββββββ | 393/625 [09:49<05:00, 1.30s/it]
Training 1/1 epoch (loss 2.5687): 63%|βββββββ | 394/625 [09:49<05:55, 1.54s/it]
Training 1/1 epoch (loss 2.6263): 63%|βββββββ | 394/625 [09:49<05:55, 1.54s/it]
Training 1/1 epoch (loss 2.6263): 63%|βββββββ | 395/625 [09:49<04:45, 1.24s/it]
Training 1/1 epoch (loss 2.4415): 63%|βββββββ | 395/625 [09:51<04:45, 1.24s/it]
Training 1/1 epoch (loss 2.4415): 63%|βββββββ | 396/625 [09:51<05:27, 1.43s/it]
Training 1/1 epoch (loss 2.5769): 63%|βββββββ | 396/625 [09:53<05:27, 1.43s/it]
Training 1/1 epoch (loss 2.5769): 64%|βββββββ | 397/625 [09:53<06:09, 1.62s/it]
Training 1/1 epoch (loss 2.5505): 64%|βββββββ | 397/625 [09:55<06:09, 1.62s/it]
Training 1/1 epoch (loss 2.5505): 64%|βββββββ | 398/625 [09:55<05:49, 1.54s/it]
Training 1/1 epoch (loss 2.5764): 64%|βββββββ | 398/625 [09:57<05:49, 1.54s/it]
Training 1/1 epoch (loss 2.5764): 64%|βββββββ | 399/625 [09:57<06:24, 1.70s/it]
Training 1/1 epoch (loss 2.4617): 64%|βββββββ | 399/625 [09:58<06:24, 1.70s/it]
Training 1/1 epoch (loss 2.4617): 64%|βββββββ | 400/625 [09:58<06:02, 1.61s/it]
Training 1/1 epoch (loss 2.5703): 64%|βββββββ | 400/625 [10:00<06:02, 1.61s/it]
Training 1/1 epoch (loss 2.5703): 64%|βββββββ | 401/625 [10:00<05:50, 1.56s/it]
Training 1/1 epoch (loss 2.6415): 64%|βββββββ | 401/625 [10:02<05:50, 1.56s/it]
Training 1/1 epoch (loss 2.6415): 64%|βββββββ | 402/625 [10:02<06:36, 1.78s/it]
Training 1/1 epoch (loss 2.8100): 64%|βββββββ | 402/625 [10:03<06:36, 1.78s/it]
Training 1/1 epoch (loss 2.8100): 64%|βββββββ | 403/625 [10:03<05:33, 1.50s/it]
Training 1/1 epoch (loss 2.6004): 64%|βββββββ | 403/625 [10:05<05:33, 1.50s/it]
Training 1/1 epoch (loss 2.6004): 65%|βββββββ | 404/625 [10:05<06:35, 1.79s/it]
Training 1/1 epoch (loss 2.5503): 65%|βββββββ | 404/625 [10:08<06:35, 1.79s/it]
Training 1/1 epoch (loss 2.5503): 65%|βββββββ | 405/625 [10:08<07:19, 2.00s/it]
Training 1/1 epoch (loss 2.5576): 65%|βββββββ | 405/625 [10:08<07:19, 2.00s/it]
Training 1/1 epoch (loss 2.5576): 65%|βββββββ | 406/625 [10:08<05:46, 1.58s/it]
Training 1/1 epoch (loss 2.6769): 65%|βββββββ | 406/625 [10:10<05:46, 1.58s/it]
Training 1/1 epoch (loss 2.6769): 65%|βββββββ | 407/625 [10:10<05:56, 1.64s/it]
Training 1/1 epoch (loss 2.5987): 65%|βββββββ | 407/625 [10:11<05:56, 1.64s/it]
Training 1/1 epoch (loss 2.5987): 65%|βββββββ | 408/625 [10:11<05:37, 1.56s/it]
Training 1/1 epoch (loss 2.7526): 65%|βββββββ | 408/625 [10:13<05:37, 1.56s/it]
Training 1/1 epoch (loss 2.7526): 65%|βββββββ | 409/625 [10:13<05:14, 1.46s/it]
Training 1/1 epoch (loss 2.8479): 65%|βββββββ | 409/625 [10:14<05:14, 1.46s/it]
Training 1/1 epoch (loss 2.8479): 66%|βββββββ | 410/625 [10:14<05:31, 1.54s/it]
Training 1/1 epoch (loss 2.2967): 66%|βββββββ | 410/625 [10:16<05:31, 1.54s/it]
Training 1/1 epoch (loss 2.2967): 66%|βββββββ | 411/625 [10:16<04:58, 1.40s/it]
Training 1/1 epoch (loss 2.6545): 66%|βββββββ | 411/625 [10:17<04:58, 1.40s/it]
Training 1/1 epoch (loss 2.6545): 66%|βββββββ | 412/625 [10:17<04:34, 1.29s/it]
Training 1/1 epoch (loss 2.6186): 66%|βββββββ | 412/625 [10:18<04:34, 1.29s/it]
Training 1/1 epoch (loss 2.6186): 66%|βββββββ | 413/625 [10:18<05:11, 1.47s/it]
Training 1/1 epoch (loss 2.7719): 66%|βββββββ | 413/625 [10:20<05:11, 1.47s/it]
Training 1/1 epoch (loss 2.7719): 66%|βββββββ | 414/625 [10:20<04:56, 1.40s/it]
Training 1/1 epoch (loss 2.5696): 66%|βββββββ | 414/625 [10:21<04:56, 1.40s/it]
Training 1/1 epoch (loss 2.5696): 66%|βββββββ | 415/625 [10:21<04:54, 1.40s/it]
Training 1/1 epoch (loss 2.4290): 66%|βββββββ | 415/625 [10:23<04:54, 1.40s/it]
Training 1/1 epoch (loss 2.4290): 67%|βββββββ | 416/625 [10:23<05:03, 1.45s/it]
Training 1/1 epoch (loss 2.7545): 67%|βββββββ | 416/625 [10:23<05:03, 1.45s/it]
Training 1/1 epoch (loss 2.7545): 67%|βββββββ | 417/625 [10:23<04:08, 1.20s/it]
Training 1/1 epoch (loss 2.6036): 67%|βββββββ | 417/625 [10:24<04:08, 1.20s/it]
Training 1/1 epoch (loss 2.6036): 67%|βββββββ | 418/625 [10:24<04:02, 1.17s/it]
Training 1/1 epoch (loss 2.7015): 67%|βββββββ | 418/625 [10:26<04:02, 1.17s/it]
Training 1/1 epoch (loss 2.7015): 67%|βββββββ | 419/625 [10:26<04:52, 1.42s/it]
Training 1/1 epoch (loss 2.6309): 67%|βββββββ | 419/625 [10:27<04:52, 1.42s/it]
Training 1/1 epoch (loss 2.6309): 67%|βββββββ | 420/625 [10:27<04:18, 1.26s/it]
Training 1/1 epoch (loss 2.4313): 67%|βββββββ | 420/625 [10:29<04:18, 1.26s/it]
Training 1/1 epoch (loss 2.4313): 67%|βββββββ | 421/625 [10:29<04:16, 1.26s/it]
Training 1/1 epoch (loss 2.6151): 67%|βββββββ | 421/625 [10:30<04:16, 1.26s/it]
Training 1/1 epoch (loss 2.6151): 68%|βββββββ | 422/625 [10:30<04:33, 1.35s/it]
Training 1/1 epoch (loss 2.6115): 68%|βββββββ | 422/625 [10:31<04:33, 1.35s/it]
Training 1/1 epoch (loss 2.6115): 68%|βββββββ | 423/625 [10:31<03:52, 1.15s/it]
Training 1/1 epoch (loss 2.7631): 68%|βββββββ | 423/625 [10:33<03:52, 1.15s/it]
Training 1/1 epoch (loss 2.7631): 68%|βββββββ | 424/625 [10:33<04:39, 1.39s/it]
Training 1/1 epoch (loss 2.6443): 68%|βββββββ | 424/625 [10:34<04:39, 1.39s/it]
Training 1/1 epoch (loss 2.6443): 68%|βββββββ | 425/625 [10:34<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.5846): 68%|βββββββ | 425/625 [10:35<04:40, 1.40s/it]
Training 1/1 epoch (loss 2.5846): 68%|βββββββ | 426/625 [10:35<04:33, 1.38s/it]
Training 1/1 epoch (loss 2.5970): 68%|βββββββ | 426/625 [10:37<04:33, 1.38s/it]
Training 1/1 epoch (loss 2.5970): 68%|βββββββ | 427/625 [10:37<04:56, 1.50s/it]
Training 1/1 epoch (loss 2.5106): 68%|βββββββ | 427/625 [10:38<04:56, 1.50s/it]
Training 1/1 epoch (loss 2.5106): 68%|βββββββ | 428/625 [10:38<03:56, 1.20s/it]
Training 1/1 epoch (loss 2.6283): 68%|βββββββ | 428/625 [10:39<03:56, 1.20s/it]
Training 1/1 epoch (loss 2.6283): 69%|βββββββ | 429/625 [10:39<04:24, 1.35s/it]
Training 1/1 epoch (loss 2.8530): 69%|βββββββ | 429/625 [10:41<04:24, 1.35s/it]
Training 1/1 epoch (loss 2.8530): 69%|βββββββ | 430/625 [10:41<05:00, 1.54s/it]
Training 1/1 epoch (loss 2.6391): 69%|βββββββ | 430/625 [10:42<05:00, 1.54s/it]
Training 1/1 epoch (loss 2.6391): 69%|βββββββ | 431/625 [10:42<03:57, 1.22s/it]
Training 1/1 epoch (loss 2.3175): 69%|βββββββ | 431/625 [10:43<03:57, 1.22s/it]
Training 1/1 epoch (loss 2.3175): 69%|βββββββ | 432/625 [10:43<04:14, 1.32s/it]
Training 1/1 epoch (loss 2.2834): 69%|βββββββ | 432/625 [10:45<04:14, 1.32s/it]
Training 1/1 epoch (loss 2.2834): 69%|βββββββ | 433/625 [10:45<04:33, 1.42s/it]
Training 1/1 epoch (loss 2.5862): 69%|βββββββ | 433/625 [10:46<04:33, 1.42s/it]
Training 1/1 epoch (loss 2.5862): 69%|βββββββ | 434/625 [10:46<04:07, 1.30s/it]
Training 1/1 epoch (loss 2.7427): 69%|βββββββ | 434/625 [10:48<04:07, 1.30s/it]
Training 1/1 epoch (loss 2.7427): 70%|βββββββ | 435/625 [10:48<04:19, 1.37s/it]
Training 1/1 epoch (loss 2.6380): 70%|βββββββ | 435/625 [10:49<04:19, 1.37s/it]
Training 1/1 epoch (loss 2.6380): 70%|βββββββ | 436/625 [10:49<04:42, 1.49s/it]
Training 1/1 epoch (loss 2.5327): 70%|βββββββ | 436/625 [10:50<04:42, 1.49s/it]
Training 1/1 epoch (loss 2.5327): 70%|βββββββ | 437/625 [10:50<03:50, 1.23s/it]
Training 1/1 epoch (loss 2.8342): 70%|βββββββ | 437/625 [10:51<03:50, 1.23s/it]
Training 1/1 epoch (loss 2.8342): 70%|βββββββ | 438/625 [10:51<03:58, 1.28s/it]
Training 1/1 epoch (loss 2.6368): 70%|βββββββ | 438/625 [10:54<03:58, 1.28s/it]
Training 1/1 epoch (loss 2.6368): 70%|βββββββ | 439/625 [10:54<04:56, 1.60s/it]
Training 1/1 epoch (loss 2.7403): 70%|βββββββ | 439/625 [10:55<04:56, 1.60s/it]
Training 1/1 epoch (loss 2.7403): 70%|βββββββ | 440/625 [10:55<04:18, 1.40s/it]
Training 1/1 epoch (loss 2.6064): 70%|βββββββ | 440/625 [10:57<04:18, 1.40s/it]
Training 1/1 epoch (loss 2.6064): 71%|βββββββ | 441/625 [10:57<05:05, 1.66s/it]
Training 1/1 epoch (loss 2.6580): 71%|βββββββ | 441/625 [10:58<05:05, 1.66s/it]
Training 1/1 epoch (loss 2.6580): 71%|βββββββ | 442/625 [10:58<04:38, 1.52s/it]
Training 1/1 epoch (loss 2.5971): 71%|βββββββ | 442/625 [10:59<04:38, 1.52s/it]
Training 1/1 epoch (loss 2.5971): 71%|βββββββ | 443/625 [10:59<04:13, 1.39s/it]
Training 1/1 epoch (loss 2.5558): 71%|βββββββ | 443/625 [11:01<04:13, 1.39s/it]
Training 1/1 epoch (loss 2.5558): 71%|βββββββ | 444/625 [11:01<04:16, 1.42s/it]
Training 1/1 epoch (loss 2.5792): 71%|βββββββ | 444/625 [11:02<04:16, 1.42s/it]
Training 1/1 epoch (loss 2.5792): 71%|βββββββ | 445/625 [11:02<04:00, 1.34s/it]
Training 1/1 epoch (loss 2.7261): 71%|βββββββ | 445/625 [11:04<04:00, 1.34s/it]
Training 1/1 epoch (loss 2.7261): 71%|ββββββββ | 446/625 [11:04<05:01, 1.68s/it]
Training 1/1 epoch (loss 2.7103): 71%|ββββββββ | 446/625 [11:06<05:01, 1.68s/it]
Training 1/1 epoch (loss 2.7103): 72%|ββββββββ | 447/625 [11:06<04:52, 1.64s/it]
Training 1/1 epoch (loss 2.6052): 72%|ββββββββ | 447/625 [11:07<04:52, 1.64s/it]
Training 1/1 epoch (loss 2.6052): 72%|ββββββββ | 448/625 [11:07<04:02, 1.37s/it]
Training 1/1 epoch (loss 2.7320): 72%|ββββββββ | 448/625 [11:09<04:02, 1.37s/it]
Training 1/1 epoch (loss 2.7320): 72%|ββββββββ | 449/625 [11:09<04:29, 1.53s/it]
Training 1/1 epoch (loss 2.5389): 72%|ββββββββ | 449/625 [11:10<04:29, 1.53s/it]
Training 1/1 epoch (loss 2.5389): 72%|ββββββββ | 450/625 [11:10<04:34, 1.57s/it]
Training 1/1 epoch (loss 2.4020): 72%|ββββββββ | 450/625 [11:12<04:34, 1.57s/it]
Training 1/1 epoch (loss 2.4020): 72%|ββββββββ | 451/625 [11:12<04:44, 1.63s/it]
Training 1/1 epoch (loss 2.5383): 72%|ββββββββ | 451/625 [11:14<04:44, 1.63s/it]
Training 1/1 epoch (loss 2.5383): 72%|ββββββββ | 452/625 [11:14<04:39, 1.61s/it]
Training 1/1 epoch (loss 2.3971): 72%|ββββββββ | 452/625 [11:14<04:39, 1.61s/it]
Training 1/1 epoch (loss 2.3971): 72%|ββββββββ | 453/625 [11:14<03:47, 1.32s/it]
Training 1/1 epoch (loss 2.4180): 72%|ββββββββ | 453/625 [11:16<03:47, 1.32s/it]
Training 1/1 epoch (loss 2.4180): 73%|ββββββββ | 454/625 [11:16<04:25, 1.55s/it]
Training 1/1 epoch (loss 2.6847): 73%|ββββββββ | 454/625 [11:18<04:25, 1.55s/it]
Training 1/1 epoch (loss 2.6847): 73%|ββββββββ | 455/625 [11:18<04:21, 1.54s/it]
Training 1/1 epoch (loss 2.6379): 73%|ββββββββ | 455/625 [11:19<04:21, 1.54s/it]
Training 1/1 epoch (loss 2.6379): 73%|ββββββββ | 456/625 [11:19<03:56, 1.40s/it]
Training 1/1 epoch (loss 2.7473): 73%|ββββββββ | 456/625 [11:20<03:56, 1.40s/it]
Training 1/1 epoch (loss 2.7473): 73%|ββββββββ | 457/625 [11:20<03:49, 1.36s/it]
Training 1/1 epoch (loss 2.6685): 73%|ββββββββ | 457/625 [11:23<03:49, 1.36s/it]
Training 1/1 epoch (loss 2.6685): 73%|ββββββββ | 458/625 [11:23<04:35, 1.65s/it]
Training 1/1 epoch (loss 2.5748): 73%|ββββββββ | 458/625 [11:23<04:35, 1.65s/it]
Training 1/1 epoch (loss 2.5748): 73%|ββββββββ | 459/625 [11:23<03:49, 1.38s/it]
Training 1/1 epoch (loss 2.3663): 73%|ββββββββ | 459/625 [11:24<03:49, 1.38s/it]
Training 1/1 epoch (loss 2.3663): 74%|ββββββββ | 460/625 [11:24<03:23, 1.23s/it]
Training 1/1 epoch (loss 2.7717): 74%|ββββββββ | 460/625 [11:26<03:23, 1.23s/it]
Training 1/1 epoch (loss 2.7717): 74%|ββββββββ | 461/625 [11:26<03:59, 1.46s/it]
Training 1/1 epoch (loss 2.7178): 74%|ββββββββ | 461/625 [11:27<03:59, 1.46s/it]
Training 1/1 epoch (loss 2.7178): 74%|ββββββββ | 462/625 [11:27<03:14, 1.20s/it]
Training 1/1 epoch (loss 2.5671): 74%|ββββββββ | 462/625 [11:29<03:14, 1.20s/it]
Training 1/1 epoch (loss 2.5671): 74%|ββββββββ | 463/625 [11:29<04:18, 1.60s/it]
Training 1/1 epoch (loss 2.7997): 74%|ββββββββ | 463/625 [11:31<04:18, 1.60s/it]
Training 1/1 epoch (loss 2.7997): 74%|ββββββββ | 464/625 [11:31<04:26, 1.66s/it]
Training 1/1 epoch (loss 2.6376): 74%|ββββββββ | 464/625 [11:32<04:26, 1.66s/it]
Training 1/1 epoch (loss 2.6376): 74%|ββββββββ | 465/625 [11:32<03:42, 1.39s/it]
Training 1/1 epoch (loss 2.7795): 74%|ββββββββ | 465/625 [11:34<03:42, 1.39s/it]
Training 1/1 epoch (loss 2.7795): 75%|ββββββββ | 466/625 [11:34<04:08, 1.56s/it]
Training 1/1 epoch (loss 2.5309): 75%|ββββββββ | 466/625 [11:35<04:08, 1.56s/it]
Training 1/1 epoch (loss 2.5309): 75%|ββββββββ | 467/625 [11:35<03:46, 1.43s/it]
Training 1/1 epoch (loss 2.6092): 75%|ββββββββ | 467/625 [11:36<03:46, 1.43s/it]
Training 1/1 epoch (loss 2.6092): 75%|ββββββββ | 468/625 [11:36<03:30, 1.34s/it]
Training 1/1 epoch (loss 2.8611): 75%|ββββββββ | 468/625 [11:38<03:30, 1.34s/it]
Training 1/1 epoch (loss 2.8611): 75%|ββββββββ | 469/625 [11:38<04:11, 1.61s/it]
Training 1/1 epoch (loss 2.6315): 75%|ββββββββ | 469/625 [11:40<04:11, 1.61s/it]
Training 1/1 epoch (loss 2.6315): 75%|ββββββββ | 470/625 [11:40<03:59, 1.54s/it]
Training 1/1 epoch (loss 2.5524): 75%|ββββββββ | 470/625 [11:41<03:59, 1.54s/it]
Training 1/1 epoch (loss 2.5524): 75%|ββββββββ | 471/625 [11:41<03:53, 1.52s/it]
Training 1/1 epoch (loss 2.7999): 75%|ββββββββ | 471/625 [11:43<03:53, 1.52s/it]
Training 1/1 epoch (loss 2.7999): 76%|ββββββββ | 472/625 [11:43<04:18, 1.69s/it]
Training 1/1 epoch (loss 2.3648): 76%|ββββββββ | 472/625 [11:44<04:18, 1.69s/it]
Training 1/1 epoch (loss 2.3648): 76%|ββββββββ | 473/625 [11:44<03:25, 1.35s/it]
Training 1/1 epoch (loss 2.7794): 76%|ββββββββ | 473/625 [11:46<03:25, 1.35s/it]
Training 1/1 epoch (loss 2.7794): 76%|ββββββββ | 474/625 [11:46<04:07, 1.64s/it]
Training 1/1 epoch (loss 2.8001): 76%|ββββββββ | 474/625 [11:48<04:07, 1.64s/it]
Training 1/1 epoch (loss 2.8001): 76%|ββββββββ | 475/625 [11:48<04:23, 1.76s/it]
Training 1/1 epoch (loss 2.8485): 76%|ββββββββ | 475/625 [11:50<04:23, 1.76s/it]
Training 1/1 epoch (loss 2.8485): 76%|ββββββββ | 476/625 [11:50<04:04, 1.64s/it]
Training 1/1 epoch (loss 2.5657): 76%|ββββββββ | 476/625 [11:51<04:04, 1.64s/it]
Training 1/1 epoch (loss 2.5657): 76%|ββββββββ | 477/625 [11:51<04:06, 1.67s/it]
Training 1/1 epoch (loss 2.5572): 76%|ββββββββ | 477/625 [11:53<04:06, 1.67s/it]
Training 1/1 epoch (loss 2.5572): 76%|ββββββββ | 478/625 [11:53<03:49, 1.56s/it]
Training 1/1 epoch (loss 2.5822): 76%|ββββββββ | 478/625 [11:54<03:49, 1.56s/it]
Training 1/1 epoch (loss 2.5822): 77%|ββββββββ | 479/625 [11:54<03:58, 1.63s/it]
Training 1/1 epoch (loss 2.6627): 77%|ββββββββ | 479/625 [11:56<03:58, 1.63s/it]
Training 1/1 epoch (loss 2.6627): 77%|ββββββββ | 480/625 [11:56<04:09, 1.72s/it]
Training 1/1 epoch (loss 2.5973): 77%|ββββββββ | 480/625 [11:57<04:09, 1.72s/it]
Training 1/1 epoch (loss 2.5973): 77%|ββββββββ | 481/625 [11:57<03:22, 1.40s/it]
Training 1/1 epoch (loss 2.3979): 77%|ββββββββ | 481/625 [11:59<03:22, 1.40s/it]
Training 1/1 epoch (loss 2.3979): 77%|ββββββββ | 482/625 [11:59<03:50, 1.61s/it]
Training 1/1 epoch (loss 2.5550): 77%|ββββββββ | 482/625 [12:01<03:50, 1.61s/it]
Training 1/1 epoch (loss 2.5550): 77%|ββββββββ | 483/625 [12:01<03:48, 1.61s/it]
Training 1/1 epoch (loss 2.5552): 77%|ββββββββ | 483/625 [12:01<03:48, 1.61s/it]
Training 1/1 epoch (loss 2.5552): 77%|ββββββββ | 484/625 [12:01<03:08, 1.34s/it]
Training 1/1 epoch (loss 2.5636): 77%|ββββββββ | 484/625 [12:03<03:08, 1.34s/it]
Training 1/1 epoch (loss 2.5636): 78%|ββββββββ | 485/625 [12:03<03:15, 1.39s/it]
Training 1/1 epoch (loss 2.5894): 78%|ββββββββ | 485/625 [12:04<03:15, 1.39s/it]
Training 1/1 epoch (loss 2.5894): 78%|ββββββββ | 486/625 [12:04<03:19, 1.44s/it]
Training 1/1 epoch (loss 2.7178): 78%|ββββββββ | 486/625 [12:06<03:19, 1.44s/it]
Training 1/1 epoch (loss 2.7178): 78%|ββββββββ | 487/625 [12:06<03:09, 1.37s/it]
Training 1/1 epoch (loss 2.5259): 78%|ββββββββ | 487/625 [12:08<03:09, 1.37s/it]
Training 1/1 epoch (loss 2.5259): 78%|ββββββββ | 488/625 [12:08<03:34, 1.57s/it]
Training 1/1 epoch (loss 2.5658): 78%|ββββββββ | 488/625 [12:09<03:34, 1.57s/it]
Training 1/1 epoch (loss 2.5658): 78%|ββββββββ | 489/625 [12:09<03:05, 1.36s/it]
Training 1/1 epoch (loss 2.6501): 78%|ββββββββ | 489/625 [12:10<03:05, 1.36s/it]
Training 1/1 epoch (loss 2.6501): 78%|ββββββββ | 490/625 [12:10<03:10, 1.41s/it]
Training 1/1 epoch (loss 2.6262): 78%|ββββββββ | 490/625 [12:12<03:10, 1.41s/it]
Training 1/1 epoch (loss 2.6262): 79%|ββββββββ | 491/625 [12:12<03:28, 1.56s/it]
Training 1/1 epoch (loss 2.4666): 79%|ββββββββ | 491/625 [12:12<03:28, 1.56s/it]
Training 1/1 epoch (loss 2.4666): 79%|ββββββββ | 492/625 [12:12<02:46, 1.25s/it]
Training 1/1 epoch (loss 2.5360): 79%|ββββββββ | 492/625 [12:14<02:46, 1.25s/it]
Training 1/1 epoch (loss 2.5360): 79%|ββββββββ | 493/625 [12:14<03:04, 1.39s/it]
Training 1/1 epoch (loss 2.3829): 79%|ββββββββ | 493/625 [12:16<03:04, 1.39s/it]
Training 1/1 epoch (loss 2.3829): 79%|ββββββββ | 494/625 [12:16<03:29, 1.60s/it]
Training 1/1 epoch (loss 2.3693): 79%|ββββββββ | 494/625 [12:17<03:29, 1.60s/it]
Training 1/1 epoch (loss 2.3693): 79%|ββββββββ | 495/625 [12:17<02:51, 1.32s/it]
Training 1/1 epoch (loss 2.5875): 79%|ββββββββ | 495/625 [12:19<02:51, 1.32s/it]
Training 1/1 epoch (loss 2.5875): 79%|ββββββββ | 496/625 [12:19<03:17, 1.53s/it]
Training 1/1 epoch (loss 2.7002): 79%|ββββββββ | 496/625 [12:20<03:17, 1.53s/it]
Training 1/1 epoch (loss 2.7002): 80%|ββββββββ | 497/625 [12:20<03:07, 1.47s/it]
Training 1/1 epoch (loss 2.5149): 80%|ββββββββ | 497/625 [12:22<03:07, 1.47s/it]
Training 1/1 epoch (loss 2.5149): 80%|ββββββββ | 498/625 [12:22<03:00, 1.43s/it]
Training 1/1 epoch (loss 2.8006): 80%|ββββββββ | 498/625 [12:23<03:00, 1.43s/it]
Training 1/1 epoch (loss 2.8006): 80%|ββββββββ | 499/625 [12:23<03:15, 1.55s/it]
Training 1/1 epoch (loss 2.4894): 80%|ββββββββ | 499/625 [12:24<03:15, 1.55s/it]
Training 1/1 epoch (loss 2.4894): 80%|ββββββββ | 500/625 [12:24<02:44, 1.31s/it]
Training 1/1 epoch (loss 2.5536): 80%|ββββββββ | 500/625 [12:26<02:44, 1.31s/it]
Training 1/1 epoch (loss 2.5536): 80%|ββββββββ | 501/625 [12:26<03:08, 1.52s/it]
Training 1/1 epoch (loss 2.6028): 80%|ββββββββ | 501/625 [12:28<03:08, 1.52s/it]
Training 1/1 epoch (loss 2.6028): 80%|ββββββββ | 502/625 [12:28<02:58, 1.45s/it]
Training 1/1 epoch (loss 2.5656): 80%|ββββββββ | 502/625 [12:28<02:58, 1.45s/it]
Training 1/1 epoch (loss 2.5656): 80%|ββββββββ | 503/625 [12:28<02:25, 1.19s/it]
Training 1/1 epoch (loss 2.7300): 80%|ββββββββ | 503/625 [12:31<02:25, 1.19s/it]
Training 1/1 epoch (loss 2.7300): 81%|ββββββββ | 504/625 [12:31<03:09, 1.57s/it]
Training 1/1 epoch (loss 2.6421): 81%|ββββββββ | 504/625 [12:31<03:09, 1.57s/it]
Training 1/1 epoch (loss 2.6421): 81%|ββββββββ | 505/625 [12:31<02:39, 1.33s/it]
Training 1/1 epoch (loss 2.7151): 81%|ββββββββ | 505/625 [12:33<02:39, 1.33s/it]
Training 1/1 epoch (loss 2.7151): 81%|ββββββββ | 506/625 [12:33<03:03, 1.54s/it]
Training 1/1 epoch (loss 2.4261): 81%|ββββββββ | 506/625 [12:35<03:03, 1.54s/it]
Training 1/1 epoch (loss 2.4261): 81%|ββββββββ | 507/625 [12:35<03:10, 1.61s/it]
Training 1/1 epoch (loss 2.4511): 81%|ββββββββ | 507/625 [12:36<03:10, 1.61s/it]
Training 1/1 epoch (loss 2.4511): 81%|βββββββββ | 508/625 [12:36<02:37, 1.35s/it]
Training 1/1 epoch (loss 2.5525): 81%|βββββββββ | 508/625 [12:37<02:37, 1.35s/it]
Training 1/1 epoch (loss 2.5525): 81%|βββββββββ | 509/625 [12:37<02:42, 1.40s/it]
Training 1/1 epoch (loss 2.5704): 81%|βββββββββ | 509/625 [12:39<02:42, 1.40s/it]
Training 1/1 epoch (loss 2.5704): 82%|βββββββββ | 510/625 [12:39<02:33, 1.34s/it]
Training 1/1 epoch (loss 2.7444): 82%|βββββββββ | 510/625 [12:39<02:33, 1.34s/it]
Training 1/1 epoch (loss 2.7444): 82%|βββββββββ | 511/625 [12:39<02:14, 1.18s/it]
Training 1/1 epoch (loss 2.3659): 82%|βββββββββ | 511/625 [12:42<02:14, 1.18s/it]
Training 1/1 epoch (loss 2.3659): 82%|βββββββββ | 512/625 [12:42<02:49, 1.50s/it]
Training 1/1 epoch (loss 2.4238): 82%|βββββββββ | 512/625 [12:43<02:49, 1.50s/it]
Training 1/1 epoch (loss 2.4238): 82%|βββββββββ | 513/625 [12:43<02:37, 1.40s/it]
Training 1/1 epoch (loss 2.5829): 82%|βββββββββ | 513/625 [12:44<02:37, 1.40s/it]
Training 1/1 epoch (loss 2.5829): 82%|βββββββββ | 514/625 [12:44<02:20, 1.27s/it]
Training 1/1 epoch (loss 2.5557): 82%|βββββββββ | 514/625 [12:46<02:20, 1.27s/it]
Training 1/1 epoch (loss 2.5557): 82%|βββββββββ | 515/625 [12:46<02:59, 1.63s/it]
Training 1/1 epoch (loss 2.6952): 82%|βββββββββ | 515/625 [12:47<02:59, 1.63s/it]
Training 1/1 epoch (loss 2.6952): 83%|βββββββββ | 516/625 [12:47<02:36, 1.43s/it]
Training 1/1 epoch (loss 2.5333): 83%|βββββββββ | 516/625 [12:49<02:36, 1.43s/it]
Training 1/1 epoch (loss 2.5333): 83%|βββββββββ | 517/625 [12:49<02:37, 1.46s/it]
Training 1/1 epoch (loss 2.7485): 83%|βββββββββ | 517/625 [12:50<02:37, 1.46s/it]
Training 1/1 epoch (loss 2.7485): 83%|βββββββββ | 518/625 [12:50<02:43, 1.53s/it]
Training 1/1 epoch (loss 2.7427): 83%|βββββββββ | 518/625 [12:51<02:43, 1.53s/it]
Training 1/1 epoch (loss 2.7427): 83%|βββββββββ | 519/625 [12:51<02:11, 1.24s/it]
Training 1/1 epoch (loss 2.7713): 83%|βββββββββ | 519/625 [12:52<02:11, 1.24s/it]
Training 1/1 epoch (loss 2.7713): 83%|βββββββββ | 520/625 [12:52<02:17, 1.31s/it]
Training 1/1 epoch (loss 2.7463): 83%|βββββββββ | 520/625 [12:54<02:17, 1.31s/it]
Training 1/1 epoch (loss 2.7463): 83%|βββββββββ | 521/625 [12:54<02:33, 1.47s/it]
Training 1/1 epoch (loss 2.5813): 83%|βββββββββ | 521/625 [12:55<02:33, 1.47s/it]
Training 1/1 epoch (loss 2.5813): 84%|βββββββββ | 522/625 [12:55<02:09, 1.25s/it]
Training 1/1 epoch (loss 2.6409): 84%|βββββββββ | 522/625 [12:57<02:09, 1.25s/it]
Training 1/1 epoch (loss 2.6409): 84%|βββββββββ | 523/625 [12:57<02:43, 1.61s/it]
Training 1/1 epoch (loss 2.5240): 84%|βββββββββ | 523/625 [12:59<02:43, 1.61s/it]
Training 1/1 epoch (loss 2.5240): 84%|βββββββββ | 524/625 [12:59<02:52, 1.71s/it]
Training 1/1 epoch (loss 2.6811): 84%|βββββββββ | 524/625 [13:00<02:52, 1.71s/it]
Training 1/1 epoch (loss 2.6811): 84%|βββββββββ | 525/625 [13:00<02:14, 1.34s/it]
Training 1/1 epoch (loss 2.8201): 84%|βββββββββ | 525/625 [13:02<02:14, 1.34s/it]
Training 1/1 epoch (loss 2.8201): 84%|βββββββββ | 526/625 [13:02<02:37, 1.60s/it]
Training 1/1 epoch (loss 2.6825): 84%|βββββββββ | 526/625 [13:04<02:37, 1.60s/it]
Training 1/1 epoch (loss 2.6825): 84%|βββββββββ | 527/625 [13:04<02:46, 1.70s/it]
Training 1/1 epoch (loss 2.5792): 84%|βββββββββ | 527/625 [13:05<02:46, 1.70s/it]
Training 1/1 epoch (loss 2.5792): 84%|βββββββββ | 528/625 [13:05<02:11, 1.35s/it]
Training 1/1 epoch (loss 2.5793): 84%|βββββββββ | 528/625 [13:07<02:11, 1.35s/it]
Training 1/1 epoch (loss 2.5793): 85%|βββββββββ | 529/625 [13:07<02:32, 1.59s/it]
Training 1/1 epoch (loss 2.4885): 85%|βββββββββ | 529/625 [13:08<02:32, 1.59s/it]
Training 1/1 epoch (loss 2.4885): 85%|βββββββββ | 530/625 [13:08<02:35, 1.64s/it]
Training 1/1 epoch (loss 2.7660): 85%|βββββββββ | 530/625 [13:10<02:35, 1.64s/it]
Training 1/1 epoch (loss 2.7660): 85%|βββββββββ | 531/625 [13:10<02:25, 1.55s/it]
Training 1/1 epoch (loss 2.5172): 85%|βββββββββ | 531/625 [13:11<02:25, 1.55s/it]
Training 1/1 epoch (loss 2.5172): 85%|βββββββββ | 532/625 [13:11<02:24, 1.55s/it]
Training 1/1 epoch (loss 2.6863): 85%|βββββββββ | 532/625 [13:12<02:24, 1.55s/it]
Training 1/1 epoch (loss 2.6863): 85%|βββββββββ | 533/625 [13:12<02:03, 1.34s/it]
Training 1/1 epoch (loss 2.7519): 85%|βββββββββ | 533/625 [13:13<02:03, 1.34s/it]
Training 1/1 epoch (loss 2.7519): 85%|βββββββββ | 534/625 [13:13<01:53, 1.25s/it]
Training 1/1 epoch (loss 2.5159): 85%|βββββββββ | 534/625 [13:16<01:53, 1.25s/it]
Training 1/1 epoch (loss 2.5159): 86%|βββββββββ | 535/625 [13:16<02:21, 1.57s/it]
Training 1/1 epoch (loss 2.6469): 86%|βββββββββ | 535/625 [13:16<02:21, 1.57s/it]
Training 1/1 epoch (loss 2.6469): 86%|βββββββββ | 536/625 [13:16<01:56, 1.30s/it]
Training 1/1 epoch (loss 2.7431): 86%|βββββββββ | 536/625 [13:18<01:56, 1.30s/it]
Training 1/1 epoch (loss 2.7431): 86%|βββββββββ | 537/625 [13:18<02:13, 1.51s/it]
Training 1/1 epoch (loss 2.5341): 86%|βββββββββ | 537/625 [13:20<02:13, 1.51s/it]
Training 1/1 epoch (loss 2.5341): 86%|βββββββββ | 538/625 [13:20<02:08, 1.48s/it]
Training 1/1 epoch (loss 2.5660): 86%|βββββββββ | 538/625 [13:21<02:08, 1.48s/it]
Training 1/1 epoch (loss 2.5660): 86%|βββββββββ | 539/625 [13:21<01:51, 1.30s/it]
Training 1/1 epoch (loss 2.4870): 86%|βββββββββ | 539/625 [13:22<01:51, 1.30s/it]
Training 1/1 epoch (loss 2.4870): 86%|βββββββββ | 540/625 [13:22<01:55, 1.35s/it]
Training 1/1 epoch (loss 2.4627): 86%|βββββββββ | 540/625 [13:23<01:55, 1.35s/it]
Training 1/1 epoch (loss 2.4627): 87%|βββββββββ | 541/625 [13:23<01:37, 1.16s/it]
Training 1/1 epoch (loss 2.5916): 87%|βββββββββ | 541/625 [13:25<01:37, 1.16s/it]
Training 1/1 epoch (loss 2.5916): 87%|βββββββββ | 542/625 [13:25<01:58, 1.43s/it]
Training 1/1 epoch (loss 2.4944): 87%|βββββββββ | 542/625 [13:27<01:58, 1.43s/it]
Training 1/1 epoch (loss 2.4944): 87%|βββββββββ | 543/625 [13:27<02:05, 1.53s/it]
Training 1/1 epoch (loss 2.6944): 87%|βββββββββ | 543/625 [13:27<02:05, 1.53s/it]
Training 1/1 epoch (loss 2.6944): 87%|βββββββββ | 544/625 [13:27<01:43, 1.27s/it]
Training 1/1 epoch (loss 2.8396): 87%|βββββββββ | 544/625 [13:30<01:43, 1.27s/it]
Training 1/1 epoch (loss 2.8396): 87%|βββββββββ | 545/625 [13:30<02:09, 1.62s/it]
Training 1/1 epoch (loss 2.5891): 87%|βββββββββ | 545/625 [13:31<02:09, 1.62s/it]
Training 1/1 epoch (loss 2.5891): 87%|βββββββββ | 546/625 [13:31<02:02, 1.55s/it]
Training 1/1 epoch (loss 2.5052): 87%|βββββββββ | 546/625 [13:32<02:02, 1.55s/it]
Training 1/1 epoch (loss 2.5052): 88%|βββββββββ | 547/625 [13:32<01:43, 1.32s/it]
Training 1/1 epoch (loss 2.3657): 88%|βββββββββ | 547/625 [13:34<01:43, 1.32s/it]
Training 1/1 epoch (loss 2.3657): 88%|βββββββββ | 548/625 [13:34<01:54, 1.49s/it]
Training 1/1 epoch (loss 2.5837): 88%|βββββββββ | 548/625 [13:35<01:54, 1.49s/it]
Training 1/1 epoch (loss 2.5837): 88%|βββββββββ | 549/625 [13:35<01:50, 1.46s/it]
Training 1/1 epoch (loss 2.4390): 88%|βββββββββ | 549/625 [13:36<01:50, 1.46s/it]
Training 1/1 epoch (loss 2.4390): 88%|βββββββββ | 550/625 [13:36<01:34, 1.26s/it]
Training 1/1 epoch (loss 2.7633): 88%|βββββββββ | 550/625 [13:38<01:34, 1.26s/it]
Training 1/1 epoch (loss 2.7633): 88%|βββββββββ | 551/625 [13:38<02:00, 1.63s/it]
Training 1/1 epoch (loss 2.5023): 88%|βββββββββ | 551/625 [13:39<02:00, 1.63s/it]
Training 1/1 epoch (loss 2.5023): 88%|βββββββββ | 552/625 [13:39<01:40, 1.38s/it]
Training 1/1 epoch (loss 2.5037): 88%|βββββββββ | 552/625 [13:41<01:40, 1.38s/it]
Training 1/1 epoch (loss 2.5037): 88%|βββββββββ | 553/625 [13:41<01:41, 1.42s/it]
Training 1/1 epoch (loss 2.5889): 88%|βββββββββ | 553/625 [13:43<01:41, 1.42s/it]
Training 1/1 epoch (loss 2.5889): 89%|βββββββββ | 554/625 [13:43<01:50, 1.56s/it]
Training 1/1 epoch (loss 2.6072): 89%|βββββββββ | 554/625 [13:44<01:50, 1.56s/it]
Training 1/1 epoch (loss 2.6072): 89%|βββββββββ | 555/625 [13:44<01:43, 1.47s/it]
Training 1/1 epoch (loss 2.4378): 89%|βββββββββ | 555/625 [13:46<01:43, 1.47s/it]
Training 1/1 epoch (loss 2.4378): 89%|βββββββββ | 556/625 [13:46<01:49, 1.59s/it]
Training 1/1 epoch (loss 2.5671): 89%|βββββββββ | 556/625 [13:47<01:49, 1.59s/it]
Training 1/1 epoch (loss 2.5671): 89%|βββββββββ | 557/625 [13:47<01:38, 1.45s/it]
Training 1/1 epoch (loss 2.6584): 89%|βββββββββ | 557/625 [13:48<01:38, 1.45s/it]
Training 1/1 epoch (loss 2.6584): 89%|βββββββββ | 558/625 [13:48<01:38, 1.47s/it]
Training 1/1 epoch (loss 2.4287): 89%|βββββββββ | 558/625 [13:50<01:38, 1.47s/it]
Training 1/1 epoch (loss 2.4287): 89%|βββββββββ | 559/625 [13:50<01:46, 1.61s/it]
Training 1/1 epoch (loss 2.6360): 89%|βββββββββ | 559/625 [13:51<01:46, 1.61s/it]
Training 1/1 epoch (loss 2.6360): 90%|βββββββββ | 560/625 [13:51<01:31, 1.41s/it]
Training 1/1 epoch (loss 2.6696): 90%|βββββββββ | 560/625 [13:53<01:31, 1.41s/it]
Training 1/1 epoch (loss 2.6696): 90%|βββββββββ | 561/625 [13:53<01:44, 1.63s/it]
Training 1/1 epoch (loss 2.7206): 90%|βββββββββ | 561/625 [13:55<01:44, 1.63s/it]
Training 1/1 epoch (loss 2.7206): 90%|βββββββββ | 562/625 [13:55<01:32, 1.47s/it]
Training 1/1 epoch (loss 2.5959): 90%|βββββββββ | 562/625 [13:55<01:32, 1.47s/it]
Training 1/1 epoch (loss 2.5959): 90%|βββββββββ | 563/625 [13:55<01:15, 1.21s/it]
Training 1/1 epoch (loss 2.3376): 90%|βββββββββ | 563/625 [13:56<01:15, 1.21s/it]
Training 1/1 epoch (loss 2.3376): 90%|βββββββββ | 564/625 [13:56<01:16, 1.25s/it]
Training 1/1 epoch (loss 2.5267): 90%|βββββββββ | 564/625 [13:58<01:16, 1.25s/it]
Training 1/1 epoch (loss 2.5267): 90%|βββββββββ | 565/625 [13:58<01:11, 1.20s/it]
Training 1/1 epoch (loss 2.5655): 90%|βββββββββ | 565/625 [13:58<01:11, 1.20s/it]
Training 1/1 epoch (loss 2.5655): 91%|βββββββββ | 566/625 [13:58<01:04, 1.09s/it]
Training 1/1 epoch (loss 2.4509): 91%|βββββββββ | 566/625 [14:00<01:04, 1.09s/it]
Training 1/1 epoch (loss 2.4509): 91%|βββββββββ | 567/625 [14:00<01:16, 1.31s/it]
Training 1/1 epoch (loss 2.5412): 91%|βββββββββ | 567/625 [14:01<01:16, 1.31s/it]
Training 1/1 epoch (loss 2.5412): 91%|βββββββββ | 568/625 [14:01<01:14, 1.30s/it]
Training 1/1 epoch (loss 2.7402): 91%|βββββββββ | 568/625 [14:02<01:14, 1.30s/it]
Training 1/1 epoch (loss 2.7402): 91%|βββββββββ | 569/625 [14:02<01:03, 1.14s/it]
Training 1/1 epoch (loss 2.7738): 91%|βββββββββ | 569/625 [14:05<01:03, 1.14s/it]
Training 1/1 epoch (loss 2.7738): 91%|βββββββββ | 570/625 [14:05<01:25, 1.56s/it]
Training 1/1 epoch (loss 2.7483): 91%|βββββββββ | 570/625 [14:06<01:25, 1.56s/it]
Training 1/1 epoch (loss 2.7483): 91%|ββββββββββ| 571/625 [14:06<01:13, 1.36s/it]
Training 1/1 epoch (loss 2.7684): 91%|ββββββββββ| 571/625 [14:07<01:13, 1.36s/it]
Training 1/1 epoch (loss 2.7684): 92%|ββββββββββ| 572/625 [14:07<01:08, 1.29s/it]
Training 1/1 epoch (loss 2.5513): 92%|ββββββββββ| 572/625 [14:08<01:08, 1.29s/it]
Training 1/1 epoch (loss 2.5513): 92%|ββββββββββ| 573/625 [14:08<01:10, 1.36s/it]
Training 1/1 epoch (loss 2.3920): 92%|ββββββββββ| 573/625 [14:09<01:10, 1.36s/it]
Training 1/1 epoch (loss 2.3920): 92%|ββββββββββ| 574/625 [14:09<01:01, 1.20s/it]
Training 1/1 epoch (loss 2.4038): 92%|ββββββββββ| 574/625 [14:11<01:01, 1.20s/it]
Training 1/1 epoch (loss 2.4038): 92%|ββββββββββ| 575/625 [14:11<01:17, 1.54s/it]
Training 1/1 epoch (loss 2.3916): 92%|ββββββββββ| 575/625 [14:13<01:17, 1.54s/it]
Training 1/1 epoch (loss 2.3916): 92%|ββββββββββ| 576/625 [14:13<01:14, 1.53s/it]
Training 1/1 epoch (loss 2.5998): 92%|ββββββββββ| 576/625 [14:14<01:14, 1.53s/it]
Training 1/1 epoch (loss 2.5998): 92%|ββββββββββ| 577/625 [14:14<01:07, 1.41s/it]
Training 1/1 epoch (loss 2.5704): 92%|ββββββββββ| 577/625 [14:15<01:07, 1.41s/it]
Training 1/1 epoch (loss 2.5704): 92%|ββββββββββ| 578/625 [14:15<01:04, 1.36s/it]
Training 1/1 epoch (loss 2.8403): 92%|ββββββββββ| 578/625 [14:16<01:04, 1.36s/it]
Training 1/1 epoch (loss 2.8403): 93%|ββββββββββ| 579/625 [14:16<00:56, 1.23s/it]
Training 1/1 epoch (loss 2.5626): 93%|ββββββββββ| 579/625 [14:18<00:56, 1.23s/it]
Training 1/1 epoch (loss 2.5626): 93%|ββββββββββ| 580/625 [14:18<00:58, 1.30s/it]
Training 1/1 epoch (loss 2.5523): 93%|ββββββββββ| 580/625 [14:20<00:58, 1.30s/it]
Training 1/1 epoch (loss 2.5523): 93%|ββββββββββ| 581/625 [14:20<01:03, 1.45s/it]
Training 1/1 epoch (loss 2.6252): 93%|ββββββββββ| 581/625 [14:21<01:03, 1.45s/it]
Training 1/1 epoch (loss 2.6252): 93%|ββββββββββ| 582/625 [14:21<01:01, 1.42s/it]
Training 1/1 epoch (loss 2.7541): 93%|ββββββββββ| 582/625 [14:22<01:01, 1.42s/it]
Training 1/1 epoch (loss 2.7541): 93%|ββββββββββ| 583/625 [14:22<00:57, 1.36s/it]
Training 1/1 epoch (loss 2.3823): 93%|ββββββββββ| 583/625 [14:24<00:57, 1.36s/it]
Training 1/1 epoch (loss 2.3823): 93%|ββββββββββ| 584/625 [14:24<01:01, 1.51s/it]
Training 1/1 epoch (loss 2.5739): 93%|ββββββββββ| 584/625 [14:25<01:01, 1.51s/it]
Training 1/1 epoch (loss 2.5739): 94%|ββββββββββ| 585/625 [14:25<00:49, 1.24s/it]
Training 1/1 epoch (loss 2.4541): 94%|ββββββββββ| 585/625 [14:26<00:49, 1.24s/it]
Training 1/1 epoch (loss 2.4541): 94%|ββββββββββ| 586/625 [14:26<00:49, 1.27s/it]
Training 1/1 epoch (loss 2.7613): 94%|ββββββββββ| 586/625 [14:28<00:49, 1.27s/it]
Training 1/1 epoch (loss 2.7613): 94%|ββββββββββ| 587/625 [14:28<00:54, 1.43s/it]
Training 1/1 epoch (loss 2.6122): 94%|ββββββββββ| 587/625 [14:28<00:54, 1.43s/it]
Training 1/1 epoch (loss 2.6122): 94%|ββββββββββ| 588/625 [14:28<00:43, 1.19s/it]
Training 1/1 epoch (loss 2.5836): 94%|ββββββββββ| 588/625 [14:30<00:43, 1.19s/it]
Training 1/1 epoch (loss 2.5836): 94%|ββββββββββ| 589/625 [14:30<00:47, 1.33s/it]
Training 1/1 epoch (loss 2.5075): 94%|ββββββββββ| 589/625 [14:32<00:47, 1.33s/it]
Training 1/1 epoch (loss 2.5075): 94%|ββββββββββ| 590/625 [14:32<00:56, 1.61s/it]
Training 1/1 epoch (loss 2.8792): 94%|ββββββββββ| 590/625 [14:33<00:56, 1.61s/it]
Training 1/1 epoch (loss 2.8792): 95%|ββββββββββ| 591/625 [14:33<00:46, 1.37s/it]
Training 1/1 epoch (loss 2.4610): 95%|ββββββββββ| 591/625 [14:35<00:46, 1.37s/it]
Training 1/1 epoch (loss 2.4610): 95%|ββββββββββ| 592/625 [14:35<00:54, 1.65s/it]
Training 1/1 epoch (loss 2.5690): 95%|ββββββββββ| 592/625 [14:37<00:54, 1.65s/it]
Training 1/1 epoch (loss 2.5690): 95%|ββββββββββ| 593/625 [14:37<00:49, 1.54s/it]
Training 1/1 epoch (loss 2.6187): 95%|ββββββββββ| 593/625 [14:39<00:49, 1.54s/it]
Training 1/1 epoch (loss 2.6187): 95%|ββββββββββ| 594/625 [14:39<00:54, 1.77s/it]
Training 1/1 epoch (loss 2.4698): 95%|ββββββββββ| 594/625 [14:41<00:54, 1.77s/it]
Training 1/1 epoch (loss 2.4698): 95%|ββββββββββ| 595/625 [14:41<00:55, 1.86s/it]
Training 1/1 epoch (loss 2.4868): 95%|ββββββββββ| 595/625 [14:41<00:55, 1.86s/it]
Training 1/1 epoch (loss 2.4868): 95%|ββββββββββ| 596/625 [14:41<00:41, 1.43s/it]
Training 1/1 epoch (loss 2.4728): 95%|ββββββββββ| 596/625 [14:44<00:41, 1.43s/it]
Training 1/1 epoch (loss 2.4728): 96%|ββββββββββ| 597/625 [14:44<00:46, 1.65s/it]
Training 1/1 epoch (loss 2.6100): 96%|ββββββββββ| 597/625 [14:45<00:46, 1.65s/it]
Training 1/1 epoch (loss 2.6100): 96%|ββββββββββ| 598/625 [14:45<00:44, 1.64s/it]
Training 1/1 epoch (loss 2.6433): 96%|ββββββββββ| 598/625 [14:46<00:44, 1.64s/it]
Training 1/1 epoch (loss 2.6433): 96%|ββββββββββ| 599/625 [14:46<00:34, 1.34s/it]
Training 1/1 epoch (loss 2.6095): 96%|ββββββββββ| 599/625 [14:48<00:34, 1.34s/it]
Training 1/1 epoch (loss 2.6095): 96%|ββββββββββ| 600/625 [14:48<00:38, 1.53s/it]
Training 1/1 epoch (loss 2.4736): 96%|ββββββββββ| 600/625 [14:49<00:38, 1.53s/it]
Training 1/1 epoch (loss 2.4736): 96%|ββββββββββ| 601/625 [14:49<00:32, 1.35s/it]
Training 1/1 epoch (loss 2.5780): 96%|ββββββββββ| 601/625 [14:50<00:32, 1.35s/it]
Training 1/1 epoch (loss 2.5780): 96%|ββββββββββ| 602/625 [14:50<00:29, 1.28s/it]
Training 1/1 epoch (loss 2.5002): 96%|ββββββββββ| 602/625 [14:51<00:29, 1.28s/it]
Training 1/1 epoch (loss 2.5002): 96%|ββββββββββ| 603/625 [14:51<00:27, 1.26s/it]
Training 1/1 epoch (loss 2.5750): 96%|ββββββββββ| 603/625 [14:52<00:27, 1.26s/it]
Training 1/1 epoch (loss 2.5750): 97%|ββββββββββ| 604/625 [14:52<00:23, 1.12s/it]
Training 1/1 epoch (loss 2.6743): 97%|ββββββββββ| 604/625 [14:54<00:23, 1.12s/it]
Training 1/1 epoch (loss 2.6743): 97%|ββββββββββ| 605/625 [14:54<00:27, 1.38s/it]
Training 1/1 epoch (loss 2.5772): 97%|ββββββββββ| 605/625 [14:55<00:27, 1.38s/it]
Training 1/1 epoch (loss 2.5772): 97%|ββββββββββ| 606/625 [14:55<00:26, 1.40s/it]
Training 1/1 epoch (loss 2.5493): 97%|ββββββββββ| 606/625 [14:56<00:26, 1.40s/it]
Training 1/1 epoch (loss 2.5493): 97%|ββββββββββ| 607/625 [14:56<00:20, 1.13s/it]
Training 1/1 epoch (loss 2.7055): 97%|ββββββββββ| 607/625 [14:57<00:20, 1.13s/it]
Training 1/1 epoch (loss 2.7055): 97%|ββββββββββ| 608/625 [14:57<00:20, 1.23s/it]
Training 1/1 epoch (loss 2.6612): 97%|ββββββββββ| 608/625 [14:59<00:20, 1.23s/it]
Training 1/1 epoch (loss 2.6612): 97%|ββββββββββ| 609/625 [14:59<00:22, 1.38s/it]
Training 1/1 epoch (loss 2.7099): 97%|ββββββββββ| 609/625 [15:00<00:22, 1.38s/it]
Training 1/1 epoch (loss 2.7099): 98%|ββββββββββ| 610/625 [15:00<00:20, 1.36s/it]
Training 1/1 epoch (loss 2.4881): 98%|ββββββββββ| 610/625 [15:03<00:20, 1.36s/it]
Training 1/1 epoch (loss 2.4881): 98%|ββββββββββ| 611/625 [15:03<00:23, 1.67s/it]
Training 1/1 epoch (loss 2.8311): 98%|ββββββββββ| 611/625 [15:04<00:23, 1.67s/it]
Training 1/1 epoch (loss 2.8311): 98%|ββββββββββ| 612/625 [15:04<00:21, 1.69s/it]
Training 1/1 epoch (loss 2.4760): 98%|ββββββββββ| 612/625 [15:06<00:21, 1.69s/it]
Training 1/1 epoch (loss 2.4760): 98%|ββββββββββ| 613/625 [15:06<00:19, 1.63s/it]
Training 1/1 epoch (loss 2.6579): 98%|ββββββββββ| 613/625 [15:08<00:19, 1.63s/it]
Training 1/1 epoch (loss 2.6579): 98%|ββββββββββ| 614/625 [15:08<00:18, 1.70s/it]
Training 1/1 epoch (loss 2.5033): 98%|ββββββββββ| 614/625 [15:09<00:18, 1.70s/it]
Training 1/1 epoch (loss 2.5033): 98%|ββββββββββ| 615/625 [15:09<00:15, 1.50s/it]
Training 1/1 epoch (loss 2.5139): 98%|ββββββββββ| 615/625 [15:10<00:15, 1.50s/it]
Training 1/1 epoch (loss 2.5139): 99%|ββββββββββ| 616/625 [15:10<00:13, 1.51s/it]
Training 1/1 epoch (loss 2.7079): 99%|ββββββββββ| 616/625 [15:12<00:13, 1.51s/it]
Training 1/1 epoch (loss 2.7079): 99%|ββββββββββ| 617/625 [15:12<00:11, 1.41s/it]
Training 1/1 epoch (loss 2.7059): 99%|ββββββββββ| 617/625 [15:12<00:11, 1.41s/it]
Training 1/1 epoch (loss 2.7059): 99%|ββββββββββ| 618/625 [15:12<00:07, 1.14s/it]
Training 1/1 epoch (loss 2.5707): 99%|ββββββββββ| 618/625 [15:14<00:07, 1.14s/it]
Training 1/1 epoch (loss 2.5707): 99%|ββββββββββ| 619/625 [15:14<00:08, 1.45s/it]
Training 1/1 epoch (loss 2.4534): 99%|ββββββββββ| 619/625 [15:17<00:08, 1.45s/it]
Training 1/1 epoch (loss 2.4534): 99%|ββββββββββ| 620/625 [15:17<00:08, 1.75s/it]
Training 1/1 epoch (loss 2.6991): 99%|ββββββββββ| 620/625 [15:17<00:08, 1.75s/it]
Training 1/1 epoch (loss 2.6991): 99%|ββββββββββ| 621/625 [15:17<00:05, 1.35s/it]
Training 1/1 epoch (loss 2.5219): 99%|ββββββββββ| 621/625 [15:19<00:05, 1.35s/it]
Training 1/1 epoch (loss 2.5219): 100%|ββββββββββ| 622/625 [15:19<00:04, 1.44s/it]
Training 1/1 epoch (loss 2.6362): 100%|ββββββββββ| 622/625 [15:21<00:04, 1.44s/it]
Training 1/1 epoch (loss 2.6362): 100%|ββββββββββ| 623/625 [15:21<00:03, 1.54s/it]
Training 1/1 epoch (loss 2.4406): 100%|ββββββββββ| 623/625 [15:21<00:03, 1.54s/it]
Training 1/1 epoch (loss 2.4406): 100%|ββββββββββ| 624/625 [15:21<00:01, 1.35s/it]
Training 1/1 epoch (loss 2.5896): 100%|ββββββββββ| 624/625 [15:23<00:01, 1.35s/it]
Training 1/1 epoch (loss 2.5896): 100%|ββββββββββ| 625/625 [15:23<00:00, 1.36s/it]
Training 1/1 epoch (loss 2.5896): 100%|ββββββββββ| 625/625 [15:23<00:00, 1.48s/it] |