xingjianleng commited on
Commit
f1755ad
·
verified ·
1 Parent(s): 0999a90

Upload folder using huggingface_hub

Browse files
stage2/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0025000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0d45e6ef2ce4b02cb5f533aea2a088bb6e13eee21f0926ad75f5ab8fe45d4b88
3
+ size 19243018610
stage2/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0050000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f5f1fab1b950fd41ebccdec0a4e90d6772ac74ba14e4dc9e4ed383d9b124d18e
3
+ size 19243018610
stage2/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0075000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c584cd41108abac376d346bafc209b25323f85106e66db7cbbb4cfa11208e52
3
+ size 19243018674
stage2/lightningdit-xl-dinov3-vit-l16-bf16/log.txt ADDED
@@ -0,0 +1,1015 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2025-10-28 00:14:21] Experiment directory created at results/stage2/hfdata/lightningdit-xl-dinov3-vit-l16-bf16
2
+ [2025-10-28 00:14:25] using base=100 for rope new
3
+ [2025-10-28 00:14:25] using min_period=None for rope new
4
+ [2025-10-28 00:14:25] using max_period=None for rope new
5
+ [2025-10-28 00:14:25] using normalize_coords=separate for rope new
6
+ [2025-10-28 00:14:25] using shift_coords=None for rope new
7
+ [2025-10-28 00:14:25] using rescale_coords=2 for rope new
8
+ [2025-10-28 00:14:25] using jitter_coords=None for rope new
9
+ [2025-10-28 00:14:25] using dtype=fp32 for rope new
10
+ [2025-10-28 00:14:25] using mlp layer as FFN
11
+ [2025-10-28 00:14:45] Model Parameters: 1202.82M
12
+ [2025-10-28 00:14:50] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/train)
13
+ [2025-10-28 00:14:50] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
14
+ [2025-10-28 00:14:50] Precision mode: bf16
15
+ [2025-10-28 00:14:50] Training configured for 80 epochs, 1251 steps per epoch.
16
+ [2025-10-28 00:14:50] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
17
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
18
+ [2025-10-28 00:14:50] Training for 80 epochs...
19
+ [2025-10-28 00:14:50] Beginning epoch 0...
20
+ [2025-10-28 00:46:23] Experiment directory created at results/stage2/hfdata/lightningdit-xl-dinov3-vit-l16-bf16
21
+ [2025-10-28 00:46:27] using base=100 for rope new
22
+ [2025-10-28 00:46:27] using min_period=None for rope new
23
+ [2025-10-28 00:46:27] using max_period=None for rope new
24
+ [2025-10-28 00:46:27] using normalize_coords=separate for rope new
25
+ [2025-10-28 00:46:27] using shift_coords=None for rope new
26
+ [2025-10-28 00:46:27] using rescale_coords=2 for rope new
27
+ [2025-10-28 00:46:27] using jitter_coords=None for rope new
28
+ [2025-10-28 00:46:27] using dtype=fp32 for rope new
29
+ [2025-10-28 00:46:27] using mlp layer as FFN
30
+ [2025-10-28 00:46:45] Model Parameters: 1202.82M
31
+ [2025-10-28 00:46:51] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/dinov3-vit-l16_hfdataset_precentercrop_True_train_bfloat16)
32
+ [2025-10-28 00:46:51] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
33
+ [2025-10-28 00:46:51] Precision mode: bf16
34
+ [2025-10-28 00:46:51] Training configured for 80 epochs, 1251 steps per epoch.
35
+ [2025-10-28 00:46:51] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
36
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
37
+ [2025-10-28 00:46:52] Training for 80 epochs...
38
+ [2025-10-28 00:46:52] Beginning epoch 0...
39
+ [2025-10-28 00:47:00] Generating EMA samples...
40
+ [2025-10-28 00:47:31] Generating EMA samples done.
41
+ [2025-10-28 00:48:58] (step=0000100) Train Loss: 1.6739, Train Steps/Sec: 0.79
42
+ [2025-10-28 00:50:25] (step=0000200) Train Loss: 1.2040, Train Steps/Sec: 1.14
43
+ [2025-10-28 00:51:53] (step=0000300) Train Loss: 0.9682, Train Steps/Sec: 1.15
44
+ [2025-10-28 00:53:20] (step=0000400) Train Loss: 0.8701, Train Steps/Sec: 1.15
45
+ [2025-10-28 00:54:47] (step=0000500) Train Loss: 0.8143, Train Steps/Sec: 1.15
46
+ [2025-10-28 00:56:14] (step=0000600) Train Loss: 0.7724, Train Steps/Sec: 1.15
47
+ [2025-10-28 00:57:42] (step=0000700) Train Loss: 0.7378, Train Steps/Sec: 1.15
48
+ [2025-10-28 00:59:09] (step=0000800) Train Loss: 0.7123, Train Steps/Sec: 1.14
49
+ [2025-10-28 01:00:37] (step=0000900) Train Loss: 0.6917, Train Steps/Sec: 1.14
50
+ [2025-10-28 01:02:04] (step=0001000) Train Loss: 0.6766, Train Steps/Sec: 1.15
51
+ [2025-10-28 01:03:31] (step=0001100) Train Loss: 0.6625, Train Steps/Sec: 1.15
52
+ [2025-10-28 01:04:58] (step=0001200) Train Loss: 0.6513, Train Steps/Sec: 1.15
53
+ [2025-10-28 01:05:44] Beginning epoch 1...
54
+ [2025-10-28 01:06:28] (step=0001300) Train Loss: 0.6398, Train Steps/Sec: 1.11
55
+ [2025-10-28 01:07:56] (step=0001400) Train Loss: 0.6330, Train Steps/Sec: 1.15
56
+ [2025-10-28 01:09:23] (step=0001500) Train Loss: 0.6244, Train Steps/Sec: 1.15
57
+ [2025-10-28 01:10:50] (step=0001600) Train Loss: 0.6172, Train Steps/Sec: 1.14
58
+ [2025-10-28 01:12:18] (step=0001700) Train Loss: 0.6123, Train Steps/Sec: 1.15
59
+ [2025-10-28 01:13:45] (step=0001800) Train Loss: 0.6069, Train Steps/Sec: 1.14
60
+ [2025-10-28 01:15:13] (step=0001900) Train Loss: 0.6013, Train Steps/Sec: 1.15
61
+ [2025-10-28 01:16:40] (step=0002000) Train Loss: 0.5952, Train Steps/Sec: 1.15
62
+ [2025-10-28 01:18:07] (step=0002100) Train Loss: 0.5921, Train Steps/Sec: 1.15
63
+ [2025-10-28 01:19:35] (step=0002200) Train Loss: 0.5886, Train Steps/Sec: 1.14
64
+ [2025-10-28 01:21:02] (step=0002300) Train Loss: 0.5839, Train Steps/Sec: 1.15
65
+ [2025-10-28 01:22:29] (step=0002400) Train Loss: 0.5815, Train Steps/Sec: 1.14
66
+ [2025-10-28 01:23:56] (step=0002500) Train Loss: 0.5785, Train Steps/Sec: 1.15
67
+ [2025-10-28 01:23:59] Beginning epoch 2...
68
+ [2025-10-28 01:25:27] (step=0002600) Train Loss: 0.5743, Train Steps/Sec: 1.11
69
+ [2025-10-28 01:26:54] (step=0002700) Train Loss: 0.5716, Train Steps/Sec: 1.14
70
+ [2025-10-28 01:28:22] (step=0002800) Train Loss: 0.5671, Train Steps/Sec: 1.15
71
+ [2025-10-28 01:29:49] (step=0002900) Train Loss: 0.5661, Train Steps/Sec: 1.15
72
+ [2025-10-28 01:31:16] (step=0003000) Train Loss: 0.5645, Train Steps/Sec: 1.15
73
+ [2025-10-28 01:32:43] (step=0003100) Train Loss: 0.5618, Train Steps/Sec: 1.15
74
+ [2025-10-28 01:34:11] (step=0003200) Train Loss: 0.5588, Train Steps/Sec: 1.14
75
+ [2025-10-28 01:35:38] (step=0003300) Train Loss: 0.5576, Train Steps/Sec: 1.15
76
+ [2025-10-28 01:37:05] (step=0003400) Train Loss: 0.5541, Train Steps/Sec: 1.15
77
+ [2025-10-28 01:38:33] (step=0003500) Train Loss: 0.5533, Train Steps/Sec: 1.14
78
+ [2025-10-28 01:40:00] (step=0003600) Train Loss: 0.5508, Train Steps/Sec: 1.15
79
+ [2025-10-28 01:41:28] (step=0003700) Train Loss: 0.5496, Train Steps/Sec: 1.15
80
+ [2025-10-28 01:42:14] Beginning epoch 3...
81
+ [2025-10-28 01:42:58] (step=0003800) Train Loss: 0.5476, Train Steps/Sec: 1.11
82
+ [2025-10-28 01:44:25] (step=0003900) Train Loss: 0.5444, Train Steps/Sec: 1.14
83
+ [2025-10-28 01:45:52] (step=0004000) Train Loss: 0.5438, Train Steps/Sec: 1.15
84
+ [2025-10-28 01:47:20] (step=0004100) Train Loss: 0.5430, Train Steps/Sec: 1.15
85
+ [2025-10-28 01:48:47] (step=0004200) Train Loss: 0.5415, Train Steps/Sec: 1.15
86
+ [2025-10-28 01:50:15] (step=0004300) Train Loss: 0.5397, Train Steps/Sec: 1.15
87
+ [2025-10-28 01:51:42] (step=0004400) Train Loss: 0.5378, Train Steps/Sec: 1.14
88
+ [2025-10-28 01:53:10] (step=0004500) Train Loss: 0.5366, Train Steps/Sec: 1.15
89
+ [2025-10-28 01:54:37] (step=0004600) Train Loss: 0.5356, Train Steps/Sec: 1.15
90
+ [2025-10-28 01:56:04] (step=0004700) Train Loss: 0.5349, Train Steps/Sec: 1.14
91
+ [2025-10-28 01:57:32] (step=0004800) Train Loss: 0.5331, Train Steps/Sec: 1.15
92
+ [2025-10-28 01:58:59] (step=0004900) Train Loss: 0.5329, Train Steps/Sec: 1.15
93
+ [2025-10-28 02:00:26] (step=0005000) Train Loss: 0.5316, Train Steps/Sec: 1.15
94
+ [2025-10-28 02:00:30] Beginning epoch 4...
95
+ [2025-10-28 02:01:56] (step=0005100) Train Loss: 0.5296, Train Steps/Sec: 1.11
96
+ [2025-10-28 02:03:24] (step=0005200) Train Loss: 0.5284, Train Steps/Sec: 1.14
97
+ [2025-10-28 02:04:51] (step=0005300) Train Loss: 0.5285, Train Steps/Sec: 1.15
98
+ [2025-10-28 02:06:19] (step=0005400) Train Loss: 0.5262, Train Steps/Sec: 1.15
99
+ [2025-10-28 02:07:46] (step=0005500) Train Loss: 0.5255, Train Steps/Sec: 1.14
100
+ [2025-10-28 02:09:13] (step=0005600) Train Loss: 0.5239, Train Steps/Sec: 1.15
101
+ [2025-10-28 02:10:41] (step=0005700) Train Loss: 0.5244, Train Steps/Sec: 1.15
102
+ [2025-10-28 02:12:08] (step=0005800) Train Loss: 0.5224, Train Steps/Sec: 1.15
103
+ [2025-10-28 02:13:35] (step=0005900) Train Loss: 0.5218, Train Steps/Sec: 1.15
104
+ [2025-10-28 02:15:03] (step=0006000) Train Loss: 0.5217, Train Steps/Sec: 1.15
105
+ [2025-10-28 02:16:30] (step=0006100) Train Loss: 0.5202, Train Steps/Sec: 1.14
106
+ [2025-10-28 02:17:58] (step=0006200) Train Loss: 0.5191, Train Steps/Sec: 1.15
107
+ [2025-10-28 02:18:46] Beginning epoch 5...
108
+ [2025-10-28 02:19:28] (step=0006300) Train Loss: 0.5196, Train Steps/Sec: 1.11
109
+ [2025-10-28 02:20:55] (step=0006400) Train Loss: 0.5186, Train Steps/Sec: 1.15
110
+ [2025-10-28 02:22:22] (step=0006500) Train Loss: 0.5177, Train Steps/Sec: 1.15
111
+ [2025-10-28 02:23:49] (step=0006600) Train Loss: 0.5157, Train Steps/Sec: 1.15
112
+ [2025-10-28 02:25:17] (step=0006700) Train Loss: 0.5152, Train Steps/Sec: 1.15
113
+ [2025-10-28 02:26:44] (step=0006800) Train Loss: 0.5145, Train Steps/Sec: 1.15
114
+ [2025-10-28 02:28:12] (step=0006900) Train Loss: 0.5153, Train Steps/Sec: 1.14
115
+ [2025-10-28 02:29:39] (step=0007000) Train Loss: 0.5137, Train Steps/Sec: 1.14
116
+ [2025-10-28 02:31:07] (step=0007100) Train Loss: 0.5126, Train Steps/Sec: 1.15
117
+ [2025-10-28 02:32:34] (step=0007200) Train Loss: 0.5128, Train Steps/Sec: 1.15
118
+ [2025-10-28 02:34:01] (step=0007300) Train Loss: 0.5124, Train Steps/Sec: 1.15
119
+ [2025-10-28 02:35:28] (step=0007400) Train Loss: 0.5107, Train Steps/Sec: 1.15
120
+ [2025-10-28 02:36:56] (step=0007500) Train Loss: 0.5102, Train Steps/Sec: 1.15
121
+ [2025-10-28 02:37:01] Beginning epoch 6...
122
+ [2025-10-28 02:38:26] (step=0007600) Train Loss: 0.5096, Train Steps/Sec: 1.11
123
+ [2025-10-28 02:39:53] (step=0007700) Train Loss: 0.5086, Train Steps/Sec: 1.15
124
+ [2025-10-28 02:41:21] (step=0007800) Train Loss: 0.5084, Train Steps/Sec: 1.14
125
+ [2025-10-28 02:42:48] (step=0007900) Train Loss: 0.5073, Train Steps/Sec: 1.15
126
+ [2025-10-28 02:44:16] (step=0008000) Train Loss: 0.5083, Train Steps/Sec: 1.15
127
+ [2025-10-28 02:45:43] (step=0008100) Train Loss: 0.5068, Train Steps/Sec: 1.15
128
+ [2025-10-28 02:47:10] (step=0008200) Train Loss: 0.5055, Train Steps/Sec: 1.15
129
+ [2025-10-28 02:48:37] (step=0008300) Train Loss: 0.5067, Train Steps/Sec: 1.15
130
+ [2025-10-28 02:50:05] (step=0008400) Train Loss: 0.5060, Train Steps/Sec: 1.15
131
+ [2025-10-28 02:51:32] (step=0008500) Train Loss: 0.5034, Train Steps/Sec: 1.15
132
+ [2025-10-28 02:52:59] (step=0008600) Train Loss: 0.5036, Train Steps/Sec: 1.14
133
+ [2025-10-28 02:54:27] (step=0008700) Train Loss: 0.5033, Train Steps/Sec: 1.15
134
+ [2025-10-28 02:55:17] Beginning epoch 7...
135
+ [2025-10-28 02:55:57] (step=0008800) Train Loss: 0.5028, Train Steps/Sec: 1.11
136
+ [2025-10-28 02:57:24] (step=0008900) Train Loss: 0.5017, Train Steps/Sec: 1.15
137
+ [2025-10-28 02:58:51] (step=0009000) Train Loss: 0.5014, Train Steps/Sec: 1.15
138
+ [2025-10-28 03:00:18] (step=0009100) Train Loss: 0.5008, Train Steps/Sec: 1.15
139
+ [2025-10-28 03:01:46] (step=0009200) Train Loss: 0.5007, Train Steps/Sec: 1.15
140
+ [2025-10-28 03:03:13] (step=0009300) Train Loss: 0.5002, Train Steps/Sec: 1.15
141
+ [2025-10-28 03:04:40] (step=0009400) Train Loss: 0.4995, Train Steps/Sec: 1.15
142
+ [2025-10-28 03:06:08] (step=0009500) Train Loss: 0.4994, Train Steps/Sec: 1.14
143
+ [2025-10-28 03:07:35] (step=0009600) Train Loss: 0.4998, Train Steps/Sec: 1.15
144
+ [2025-10-28 03:09:03] (step=0009700) Train Loss: 0.4998, Train Steps/Sec: 1.15
145
+ [2025-10-28 03:10:30] (step=0009800) Train Loss: 0.4987, Train Steps/Sec: 1.15
146
+ [2025-10-28 03:11:57] (step=0009900) Train Loss: 0.4976, Train Steps/Sec: 1.15
147
+ [2025-10-28 03:13:24] (step=0010000) Train Loss: 0.4983, Train Steps/Sec: 1.15
148
+ [2025-10-28 03:13:32] Beginning epoch 8...
149
+ [2025-10-28 03:14:55] (step=0010100) Train Loss: 0.4976, Train Steps/Sec: 1.10
150
+ [2025-10-28 03:16:23] (step=0010200) Train Loss: 0.4965, Train Steps/Sec: 1.15
151
+ [2025-10-28 03:17:50] (step=0010300) Train Loss: 0.4964, Train Steps/Sec: 1.14
152
+ [2025-10-28 03:19:18] (step=0010400) Train Loss: 0.4954, Train Steps/Sec: 1.15
153
+ [2025-10-28 03:20:45] (step=0010500) Train Loss: 0.4949, Train Steps/Sec: 1.15
154
+ [2025-10-28 03:22:12] (step=0010600) Train Loss: 0.4964, Train Steps/Sec: 1.15
155
+ [2025-10-28 03:23:40] (step=0010700) Train Loss: 0.4951, Train Steps/Sec: 1.15
156
+ [2025-10-28 03:25:07] (step=0010800) Train Loss: 0.4942, Train Steps/Sec: 1.15
157
+ [2025-10-28 03:26:34] (step=0010900) Train Loss: 0.4938, Train Steps/Sec: 1.14
158
+ [2025-10-28 03:28:02] (step=0011000) Train Loss: 0.4943, Train Steps/Sec: 1.15
159
+ [2025-10-28 03:29:29] (step=0011100) Train Loss: 0.4944, Train Steps/Sec: 1.15
160
+ [2025-10-28 03:30:57] (step=0011200) Train Loss: 0.4924, Train Steps/Sec: 1.14
161
+ [2025-10-28 03:31:49] Beginning epoch 9...
162
+ [2025-10-28 03:32:27] (step=0011300) Train Loss: 0.4930, Train Steps/Sec: 1.11
163
+ [2025-10-28 03:33:54] (step=0011400) Train Loss: 0.4911, Train Steps/Sec: 1.15
164
+ [2025-10-28 03:35:21] (step=0011500) Train Loss: 0.4918, Train Steps/Sec: 1.15
165
+ [2025-10-28 03:36:49] (step=0011600) Train Loss: 0.4924, Train Steps/Sec: 1.15
166
+ [2025-10-28 03:38:16] (step=0011700) Train Loss: 0.4914, Train Steps/Sec: 1.14
167
+ [2025-10-28 03:39:43] (step=0011800) Train Loss: 0.4912, Train Steps/Sec: 1.15
168
+ [2025-10-28 03:41:10] (step=0011900) Train Loss: 0.4902, Train Steps/Sec: 1.15
169
+ [2025-10-28 03:42:38] (step=0012000) Train Loss: 0.4912, Train Steps/Sec: 1.14
170
+ [2025-10-28 03:44:06] (step=0012100) Train Loss: 0.4906, Train Steps/Sec: 1.14
171
+ [2025-10-28 03:45:33] (step=0012200) Train Loss: 0.4909, Train Steps/Sec: 1.15
172
+ [2025-10-28 03:47:00] (step=0012300) Train Loss: 0.4892, Train Steps/Sec: 1.15
173
+ [2025-10-28 03:48:28] (step=0012400) Train Loss: 0.4882, Train Steps/Sec: 1.15
174
+ [2025-10-28 03:49:55] (step=0012500) Train Loss: 0.4889, Train Steps/Sec: 1.15
175
+ [2025-10-28 03:50:04] Beginning epoch 10...
176
+ [2025-10-28 03:51:25] (step=0012600) Train Loss: 0.4891, Train Steps/Sec: 1.11
177
+ [2025-10-28 03:52:52] (step=0012700) Train Loss: 0.4890, Train Steps/Sec: 1.15
178
+ [2025-10-28 03:54:20] (step=0012800) Train Loss: 0.4882, Train Steps/Sec: 1.15
179
+ [2025-10-28 03:55:48] (step=0012900) Train Loss: 0.4860, Train Steps/Sec: 1.14
180
+ [2025-10-28 03:57:15] (step=0013000) Train Loss: 0.4887, Train Steps/Sec: 1.15
181
+ [2025-10-28 03:58:42] (step=0013100) Train Loss: 0.4870, Train Steps/Sec: 1.14
182
+ [2025-10-28 04:00:10] (step=0013200) Train Loss: 0.4883, Train Steps/Sec: 1.15
183
+ [2025-10-28 04:01:37] (step=0013300) Train Loss: 0.4867, Train Steps/Sec: 1.15
184
+ [2025-10-28 04:03:04] (step=0013400) Train Loss: 0.4854, Train Steps/Sec: 1.15
185
+ [2025-10-28 04:04:31] (step=0013500) Train Loss: 0.4856, Train Steps/Sec: 1.15
186
+ [2025-10-28 04:05:59] (step=0013600) Train Loss: 0.4849, Train Steps/Sec: 1.15
187
+ [2025-10-28 04:07:26] (step=0013700) Train Loss: 0.4863, Train Steps/Sec: 1.14
188
+ [2025-10-28 04:08:20] Beginning epoch 11...
189
+ [2025-10-28 04:08:57] (step=0013800) Train Loss: 0.4843, Train Steps/Sec: 1.11
190
+ [2025-10-28 04:10:24] (step=0013900) Train Loss: 0.4853, Train Steps/Sec: 1.15
191
+ [2025-10-28 04:11:51] (step=0014000) Train Loss: 0.4838, Train Steps/Sec: 1.14
192
+ [2025-10-28 04:13:19] (step=0014100) Train Loss: 0.4841, Train Steps/Sec: 1.15
193
+ [2025-10-28 04:14:46] (step=0014200) Train Loss: 0.4843, Train Steps/Sec: 1.15
194
+ [2025-10-28 04:16:13] (step=0014300) Train Loss: 0.4849, Train Steps/Sec: 1.15
195
+ [2025-10-28 04:17:40] (step=0014400) Train Loss: 0.4840, Train Steps/Sec: 1.15
196
+ [2025-10-28 04:19:08] (step=0014500) Train Loss: 0.4838, Train Steps/Sec: 1.15
197
+ [2025-10-28 04:20:35] (step=0014600) Train Loss: 0.4829, Train Steps/Sec: 1.14
198
+ [2025-10-28 04:22:03] (step=0014700) Train Loss: 0.4830, Train Steps/Sec: 1.15
199
+ [2025-10-28 04:23:30] (step=0014800) Train Loss: 0.4824, Train Steps/Sec: 1.15
200
+ [2025-10-28 04:24:57] (step=0014900) Train Loss: 0.4832, Train Steps/Sec: 1.15
201
+ [2025-10-28 04:26:25] (step=0015000) Train Loss: 0.4822, Train Steps/Sec: 1.15
202
+ [2025-10-28 04:26:35] Beginning epoch 12...
203
+ [2025-10-28 04:27:55] (step=0015100) Train Loss: 0.4828, Train Steps/Sec: 1.11
204
+ [2025-10-28 04:29:22] (step=0015200) Train Loss: 0.4807, Train Steps/Sec: 1.15
205
+ [2025-10-28 04:30:49] (step=0015300) Train Loss: 0.4815, Train Steps/Sec: 1.15
206
+ [2025-10-28 04:32:17] (step=0015400) Train Loss: 0.4817, Train Steps/Sec: 1.14
207
+ [2025-10-28 04:33:45] (step=0015500) Train Loss: 0.4806, Train Steps/Sec: 1.14
208
+ [2025-10-28 04:35:12] (step=0015600) Train Loss: 0.4797, Train Steps/Sec: 1.15
209
+ [2025-10-28 04:36:39] (step=0015700) Train Loss: 0.4814, Train Steps/Sec: 1.15
210
+ [2025-10-28 04:38:07] (step=0015800) Train Loss: 0.4797, Train Steps/Sec: 1.15
211
+ [2025-10-28 04:39:34] (step=0015900) Train Loss: 0.4804, Train Steps/Sec: 1.15
212
+ [2025-10-28 04:41:01] (step=0016000) Train Loss: 0.4796, Train Steps/Sec: 1.15
213
+ [2025-10-28 04:42:28] (step=0016100) Train Loss: 0.4793, Train Steps/Sec: 1.15
214
+ [2025-10-28 04:43:56] (step=0016200) Train Loss: 0.4806, Train Steps/Sec: 1.15
215
+ [2025-10-28 04:44:51] Beginning epoch 13...
216
+ [2025-10-28 04:45:26] (step=0016300) Train Loss: 0.4790, Train Steps/Sec: 1.10
217
+ [2025-10-28 04:46:54] (step=0016400) Train Loss: 0.4793, Train Steps/Sec: 1.15
218
+ [2025-10-28 04:48:21] (step=0016500) Train Loss: 0.4793, Train Steps/Sec: 1.15
219
+ [2025-10-28 04:49:48] (step=0016600) Train Loss: 0.4783, Train Steps/Sec: 1.15
220
+ [2025-10-28 04:51:15] (step=0016700) Train Loss: 0.4781, Train Steps/Sec: 1.15
221
+ [2025-10-28 04:52:43] (step=0016800) Train Loss: 0.4781, Train Steps/Sec: 1.15
222
+ [2025-10-28 04:54:10] (step=0016900) Train Loss: 0.4778, Train Steps/Sec: 1.15
223
+ [2025-10-28 04:55:37] (step=0017000) Train Loss: 0.4772, Train Steps/Sec: 1.15
224
+ [2025-10-28 04:57:05] (step=0017100) Train Loss: 0.4790, Train Steps/Sec: 1.14
225
+ [2025-10-28 04:58:32] (step=0017200) Train Loss: 0.4782, Train Steps/Sec: 1.14
226
+ [2025-10-28 05:00:00] (step=0017300) Train Loss: 0.4755, Train Steps/Sec: 1.15
227
+ [2025-10-28 05:01:27] (step=0017400) Train Loss: 0.4762, Train Steps/Sec: 1.15
228
+ [2025-10-28 05:02:54] (step=0017500) Train Loss: 0.4775, Train Steps/Sec: 1.15
229
+ [2025-10-28 05:03:07] Beginning epoch 14...
230
+ [2025-10-28 05:04:24] (step=0017600) Train Loss: 0.4764, Train Steps/Sec: 1.11
231
+ [2025-10-28 05:05:51] (step=0017700) Train Loss: 0.4764, Train Steps/Sec: 1.15
232
+ [2025-10-28 05:07:19] (step=0017800) Train Loss: 0.4749, Train Steps/Sec: 1.15
233
+ [2025-10-28 05:08:46] (step=0017900) Train Loss: 0.4751, Train Steps/Sec: 1.14
234
+ [2025-10-28 05:10:14] (step=0018000) Train Loss: 0.4777, Train Steps/Sec: 1.14
235
+ [2025-10-28 05:11:41] (step=0018100) Train Loss: 0.4756, Train Steps/Sec: 1.15
236
+ [2025-10-28 05:13:08] (step=0018200) Train Loss: 0.4754, Train Steps/Sec: 1.15
237
+ [2025-10-28 05:14:36] (step=0018300) Train Loss: 0.4765, Train Steps/Sec: 1.15
238
+ [2025-10-28 05:16:03] (step=0018400) Train Loss: 0.4742, Train Steps/Sec: 1.15
239
+ [2025-10-28 05:17:30] (step=0018500) Train Loss: 0.4742, Train Steps/Sec: 1.15
240
+ [2025-10-28 05:18:57] (step=0018600) Train Loss: 0.4751, Train Steps/Sec: 1.15
241
+ [2025-10-28 05:20:25] (step=0018700) Train Loss: 0.4743, Train Steps/Sec: 1.15
242
+ [2025-10-28 05:21:22] Beginning epoch 15...
243
+ [2025-10-28 05:21:55] (step=0018800) Train Loss: 0.4733, Train Steps/Sec: 1.10
244
+ [2025-10-28 05:23:23] (step=0018900) Train Loss: 0.4735, Train Steps/Sec: 1.14
245
+ [2025-10-28 05:24:50] (step=0019000) Train Loss: 0.4733, Train Steps/Sec: 1.15
246
+ [2025-10-28 05:26:17] (step=0019100) Train Loss: 0.4725, Train Steps/Sec: 1.15
247
+ [2025-10-28 05:27:45] (step=0019200) Train Loss: 0.4740, Train Steps/Sec: 1.15
248
+ [2025-10-28 05:29:12] (step=0019300) Train Loss: 0.4733, Train Steps/Sec: 1.15
249
+ [2025-10-28 05:30:39] (step=0019400) Train Loss: 0.4730, Train Steps/Sec: 1.15
250
+ [2025-10-28 05:32:06] (step=0019500) Train Loss: 0.4735, Train Steps/Sec: 1.15
251
+ [2025-10-28 05:33:34] (step=0019600) Train Loss: 0.4729, Train Steps/Sec: 1.15
252
+ [2025-10-28 05:35:01] (step=0019700) Train Loss: 0.4730, Train Steps/Sec: 1.14
253
+ [2025-10-28 05:36:28] (step=0019800) Train Loss: 0.4727, Train Steps/Sec: 1.15
254
+ [2025-10-28 05:37:56] (step=0019900) Train Loss: 0.4719, Train Steps/Sec: 1.15
255
+ [2025-10-28 05:39:23] (step=0020000) Train Loss: 0.4713, Train Steps/Sec: 1.15
256
+ [2025-10-28 05:39:37] Beginning epoch 16...
257
+ [2025-10-28 05:40:53] (step=0020100) Train Loss: 0.4721, Train Steps/Sec: 1.11
258
+ [2025-10-28 05:42:20] (step=0020200) Train Loss: 0.4715, Train Steps/Sec: 1.15
259
+ [2025-10-28 05:43:48] (step=0020300) Train Loss: 0.4708, Train Steps/Sec: 1.15
260
+ [2025-10-28 05:45:15] (step=0020400) Train Loss: 0.4713, Train Steps/Sec: 1.15
261
+ [2025-10-28 05:46:43] (step=0020500) Train Loss: 0.4719, Train Steps/Sec: 1.14
262
+ [2025-10-28 05:48:10] (step=0020600) Train Loss: 0.4717, Train Steps/Sec: 1.14
263
+ [2025-10-28 05:49:38] (step=0020700) Train Loss: 0.4710, Train Steps/Sec: 1.15
264
+ [2025-10-28 05:51:05] (step=0020800) Train Loss: 0.4714, Train Steps/Sec: 1.15
265
+ [2025-10-28 05:52:32] (step=0020900) Train Loss: 0.4707, Train Steps/Sec: 1.15
266
+ [2025-10-28 05:54:00] (step=0021000) Train Loss: 0.4711, Train Steps/Sec: 1.14
267
+ [2025-10-28 05:55:27] (step=0021100) Train Loss: 0.4700, Train Steps/Sec: 1.15
268
+ [2025-10-28 05:56:54] (step=0021200) Train Loss: 0.4704, Train Steps/Sec: 1.15
269
+ [2025-10-28 05:57:53] Beginning epoch 17...
270
+ [2025-10-28 05:58:25] (step=0021300) Train Loss: 0.4703, Train Steps/Sec: 1.10
271
+ [2025-10-28 05:59:53] (step=0021400) Train Loss: 0.4700, Train Steps/Sec: 1.14
272
+ [2025-10-28 06:01:20] (step=0021500) Train Loss: 0.4701, Train Steps/Sec: 1.15
273
+ [2025-10-28 06:02:47] (step=0021600) Train Loss: 0.4700, Train Steps/Sec: 1.15
274
+ [2025-10-28 06:04:15] (step=0021700) Train Loss: 0.4716, Train Steps/Sec: 1.15
275
+ [2025-10-28 06:05:42] (step=0021800) Train Loss: 0.4697, Train Steps/Sec: 1.14
276
+ [2025-10-28 06:07:09] (step=0021900) Train Loss: 0.4691, Train Steps/Sec: 1.15
277
+ [2025-10-28 06:08:37] (step=0022000) Train Loss: 0.4687, Train Steps/Sec: 1.15
278
+ [2025-10-28 06:10:04] (step=0022100) Train Loss: 0.4696, Train Steps/Sec: 1.15
279
+ [2025-10-28 06:11:31] (step=0022200) Train Loss: 0.4687, Train Steps/Sec: 1.14
280
+ [2025-10-28 06:12:59] (step=0022300) Train Loss: 0.4698, Train Steps/Sec: 1.14
281
+ [2025-10-28 06:14:26] (step=0022400) Train Loss: 0.4690, Train Steps/Sec: 1.15
282
+ [2025-10-28 06:15:54] (step=0022500) Train Loss: 0.4688, Train Steps/Sec: 1.15
283
+ [2025-10-28 06:16:10] Beginning epoch 18...
284
+ [2025-10-28 06:17:24] (step=0022600) Train Loss: 0.4687, Train Steps/Sec: 1.11
285
+ [2025-10-28 06:18:51] (step=0022700) Train Loss: 0.4687, Train Steps/Sec: 1.15
286
+ [2025-10-28 06:20:18] (step=0022800) Train Loss: 0.4673, Train Steps/Sec: 1.15
287
+ [2025-10-28 06:21:46] (step=0022900) Train Loss: 0.4666, Train Steps/Sec: 1.15
288
+ [2025-10-28 06:23:13] (step=0023000) Train Loss: 0.4673, Train Steps/Sec: 1.15
289
+ [2025-10-28 06:24:41] (step=0023100) Train Loss: 0.4684, Train Steps/Sec: 1.14
290
+ [2025-10-28 06:26:08] (step=0023200) Train Loss: 0.4684, Train Steps/Sec: 1.15
291
+ [2025-10-28 06:27:36] (step=0023300) Train Loss: 0.4672, Train Steps/Sec: 1.14
292
+ [2025-10-28 06:29:03] (step=0023400) Train Loss: 0.4680, Train Steps/Sec: 1.15
293
+ [2025-10-28 06:30:30] (step=0023500) Train Loss: 0.4663, Train Steps/Sec: 1.15
294
+ [2025-10-28 06:31:57] (step=0023600) Train Loss: 0.4675, Train Steps/Sec: 1.15
295
+ [2025-10-28 06:33:25] (step=0023700) Train Loss: 0.4675, Train Steps/Sec: 1.15
296
+ [2025-10-28 06:34:25] Beginning epoch 19...
297
+ [2025-10-28 06:34:55] (step=0023800) Train Loss: 0.4663, Train Steps/Sec: 1.11
298
+ [2025-10-28 06:36:22] (step=0023900) Train Loss: 0.4665, Train Steps/Sec: 1.14
299
+ [2025-10-28 06:37:50] (step=0024000) Train Loss: 0.4662, Train Steps/Sec: 1.14
300
+ [2025-10-28 06:39:18] (step=0024100) Train Loss: 0.4649, Train Steps/Sec: 1.14
301
+ [2025-10-28 06:40:45] (step=0024200) Train Loss: 0.4673, Train Steps/Sec: 1.15
302
+ [2025-10-28 06:42:12] (step=0024300) Train Loss: 0.4660, Train Steps/Sec: 1.15
303
+ [2025-10-28 06:43:39] (step=0024400) Train Loss: 0.4657, Train Steps/Sec: 1.15
304
+ [2025-10-28 06:45:07] (step=0024500) Train Loss: 0.4655, Train Steps/Sec: 1.15
305
+ [2025-10-28 06:46:34] (step=0024600) Train Loss: 0.4666, Train Steps/Sec: 1.15
306
+ [2025-10-28 06:48:01] (step=0024700) Train Loss: 0.4659, Train Steps/Sec: 1.15
307
+ [2025-10-28 06:49:29] (step=0024800) Train Loss: 0.4649, Train Steps/Sec: 1.14
308
+ [2025-10-28 06:50:56] (step=0024900) Train Loss: 0.4648, Train Steps/Sec: 1.14
309
+ [2025-10-28 06:52:23] (step=0025000) Train Loss: 0.4642, Train Steps/Sec: 1.15
310
+ [2025-10-28 06:53:14] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0025000.pt
311
+ [2025-10-28 06:53:14] Generating EMA samples...
312
+ [2025-10-28 06:53:41] Generating EMA samples done.
313
+ [2025-10-28 06:53:59] Beginning epoch 20...
314
+ [2025-10-28 06:55:11] (step=0025100) Train Loss: 0.4665, Train Steps/Sec: 0.60
315
+ [2025-10-28 06:56:38] (step=0025200) Train Loss: 0.4639, Train Steps/Sec: 1.15
316
+ [2025-10-28 06:58:06] (step=0025300) Train Loss: 0.4649, Train Steps/Sec: 1.15
317
+ [2025-10-28 06:59:33] (step=0025400) Train Loss: 0.4652, Train Steps/Sec: 1.15
318
+ [2025-10-28 07:01:00] (step=0025500) Train Loss: 0.4647, Train Steps/Sec: 1.15
319
+ [2025-10-28 07:02:28] (step=0025600) Train Loss: 0.4633, Train Steps/Sec: 1.14
320
+ [2025-10-28 07:03:55] (step=0025700) Train Loss: 0.4629, Train Steps/Sec: 1.14
321
+ [2025-10-28 07:05:23] (step=0025800) Train Loss: 0.4642, Train Steps/Sec: 1.15
322
+ [2025-10-28 07:06:50] (step=0025900) Train Loss: 0.4643, Train Steps/Sec: 1.15
323
+ [2025-10-28 07:08:17] (step=0026000) Train Loss: 0.4647, Train Steps/Sec: 1.15
324
+ [2025-10-28 07:09:44] (step=0026100) Train Loss: 0.4631, Train Steps/Sec: 1.15
325
+ [2025-10-28 07:11:12] (step=0026200) Train Loss: 0.4644, Train Steps/Sec: 1.15
326
+ [2025-10-28 07:12:14] Beginning epoch 21...
327
+ [2025-10-28 07:12:42] (step=0026300) Train Loss: 0.4638, Train Steps/Sec: 1.11
328
+ [2025-10-28 07:14:09] (step=0026400) Train Loss: 0.4635, Train Steps/Sec: 1.14
329
+ [2025-10-28 07:15:37] (step=0026500) Train Loss: 0.4640, Train Steps/Sec: 1.14
330
+ [2025-10-28 07:17:04] (step=0026600) Train Loss: 0.4640, Train Steps/Sec: 1.14
331
+ [2025-10-28 07:18:31] (step=0026700) Train Loss: 0.4624, Train Steps/Sec: 1.15
332
+ [2025-10-28 07:19:59] (step=0026800) Train Loss: 0.4628, Train Steps/Sec: 1.15
333
+ [2025-10-28 07:21:26] (step=0026900) Train Loss: 0.4621, Train Steps/Sec: 1.15
334
+ [2025-10-28 07:22:53] (step=0027000) Train Loss: 0.4626, Train Steps/Sec: 1.15
335
+ [2025-10-28 07:24:20] (step=0027100) Train Loss: 0.4626, Train Steps/Sec: 1.15
336
+ [2025-10-28 07:25:47] (step=0027200) Train Loss: 0.4623, Train Steps/Sec: 1.15
337
+ [2025-10-28 07:27:15] (step=0027300) Train Loss: 0.4624, Train Steps/Sec: 1.14
338
+ [2025-10-28 07:28:42] (step=0027400) Train Loss: 0.4621, Train Steps/Sec: 1.14
339
+ [2025-10-28 07:30:10] (step=0027500) Train Loss: 0.4627, Train Steps/Sec: 1.15
340
+ [2025-10-28 07:30:29] Beginning epoch 22...
341
+ [2025-10-28 07:31:41] (step=0027600) Train Loss: 0.4608, Train Steps/Sec: 1.10
342
+ [2025-10-28 07:33:08] (step=0027700) Train Loss: 0.4617, Train Steps/Sec: 1.15
343
+ [2025-10-28 07:34:35] (step=0027800) Train Loss: 0.4625, Train Steps/Sec: 1.15
344
+ [2025-10-28 07:36:02] (step=0027900) Train Loss: 0.4618, Train Steps/Sec: 1.14
345
+ [2025-10-28 07:37:30] (step=0028000) Train Loss: 0.4610, Train Steps/Sec: 1.15
346
+ [2025-10-28 07:38:57] (step=0028100) Train Loss: 0.4615, Train Steps/Sec: 1.15
347
+ [2025-10-28 07:40:25] (step=0028200) Train Loss: 0.4612, Train Steps/Sec: 1.14
348
+ [2025-10-28 07:41:52] (step=0028300) Train Loss: 0.4609, Train Steps/Sec: 1.14
349
+ [2025-10-28 07:43:19] (step=0028400) Train Loss: 0.4631, Train Steps/Sec: 1.15
350
+ [2025-10-28 07:44:46] (step=0028500) Train Loss: 0.4627, Train Steps/Sec: 1.15
351
+ [2025-10-28 07:46:14] (step=0028600) Train Loss: 0.4614, Train Steps/Sec: 1.15
352
+ [2025-10-28 07:47:41] (step=0028700) Train Loss: 0.4616, Train Steps/Sec: 1.15
353
+ [2025-10-28 07:48:45] Beginning epoch 23...
354
+ [2025-10-28 07:49:11] (step=0028800) Train Loss: 0.4615, Train Steps/Sec: 1.11
355
+ [2025-10-28 07:50:38] (step=0028900) Train Loss: 0.4599, Train Steps/Sec: 1.15
356
+ [2025-10-28 07:52:06] (step=0029000) Train Loss: 0.4603, Train Steps/Sec: 1.14
357
+ [2025-10-28 07:53:34] (step=0029100) Train Loss: 0.4616, Train Steps/Sec: 1.14
358
+ [2025-10-28 07:55:01] (step=0029200) Train Loss: 0.4613, Train Steps/Sec: 1.15
359
+ [2025-10-28 07:56:28] (step=0029300) Train Loss: 0.4603, Train Steps/Sec: 1.15
360
+ [2025-10-28 07:57:56] (step=0029400) Train Loss: 0.4606, Train Steps/Sec: 1.15
361
+ [2025-10-28 07:59:23] (step=0029500) Train Loss: 0.4605, Train Steps/Sec: 1.15
362
+ [2025-10-28 08:00:50] (step=0029600) Train Loss: 0.4599, Train Steps/Sec: 1.15
363
+ [2025-10-28 08:02:17] (step=0029700) Train Loss: 0.4603, Train Steps/Sec: 1.15
364
+ [2025-10-28 08:03:45] (step=0029800) Train Loss: 0.4588, Train Steps/Sec: 1.15
365
+ [2025-10-28 08:05:12] (step=0029900) Train Loss: 0.4594, Train Steps/Sec: 1.14
366
+ [2025-10-28 08:06:39] (step=0030000) Train Loss: 0.4604, Train Steps/Sec: 1.14
367
+ [2025-10-28 08:07:01] Beginning epoch 24...
368
+ [2025-10-28 08:08:09] (step=0030100) Train Loss: 0.4605, Train Steps/Sec: 1.11
369
+ [2025-10-28 08:09:37] (step=0030200) Train Loss: 0.4586, Train Steps/Sec: 1.15
370
+ [2025-10-28 08:11:04] (step=0030300) Train Loss: 0.4600, Train Steps/Sec: 1.14
371
+ [2025-10-28 08:12:31] (step=0030400) Train Loss: 0.4591, Train Steps/Sec: 1.15
372
+ [2025-10-28 08:13:59] (step=0030500) Train Loss: 0.4585, Train Steps/Sec: 1.15
373
+ [2025-10-28 08:15:26] (step=0030600) Train Loss: 0.4602, Train Steps/Sec: 1.15
374
+ [2025-10-28 08:16:53] (step=0030700) Train Loss: 0.4587, Train Steps/Sec: 1.14
375
+ [2025-10-28 08:18:21] (step=0030800) Train Loss: 0.4584, Train Steps/Sec: 1.14
376
+ [2025-10-28 08:19:48] (step=0030900) Train Loss: 0.4597, Train Steps/Sec: 1.15
377
+ [2025-10-28 08:21:16] (step=0031000) Train Loss: 0.4584, Train Steps/Sec: 1.15
378
+ [2025-10-28 08:22:43] (step=0031100) Train Loss: 0.4589, Train Steps/Sec: 1.14
379
+ [2025-10-28 08:24:10] (step=0031200) Train Loss: 0.4578, Train Steps/Sec: 1.15
380
+ [2025-10-28 08:25:16] Beginning epoch 25...
381
+ [2025-10-28 08:25:40] (step=0031300) Train Loss: 0.4594, Train Steps/Sec: 1.11
382
+ [2025-10-28 08:27:08] (step=0031400) Train Loss: 0.4566, Train Steps/Sec: 1.15
383
+ [2025-10-28 08:28:35] (step=0031500) Train Loss: 0.4597, Train Steps/Sec: 1.15
384
+ [2025-10-28 08:30:02] (step=0031600) Train Loss: 0.4583, Train Steps/Sec: 1.14
385
+ [2025-10-28 08:31:30] (step=0031700) Train Loss: 0.4589, Train Steps/Sec: 1.14
386
+ [2025-10-28 08:32:57] (step=0031800) Train Loss: 0.4570, Train Steps/Sec: 1.15
387
+ [2025-10-28 08:34:25] (step=0031900) Train Loss: 0.4568, Train Steps/Sec: 1.15
388
+ [2025-10-28 08:35:52] (step=0032000) Train Loss: 0.4584, Train Steps/Sec: 1.15
389
+ [2025-10-28 08:37:19] (step=0032100) Train Loss: 0.4578, Train Steps/Sec: 1.15
390
+ [2025-10-28 08:38:46] (step=0032200) Train Loss: 0.4581, Train Steps/Sec: 1.15
391
+ [2025-10-28 08:40:14] (step=0032300) Train Loss: 0.4575, Train Steps/Sec: 1.15
392
+ [2025-10-28 08:41:41] (step=0032400) Train Loss: 0.4574, Train Steps/Sec: 1.15
393
+ [2025-10-28 08:43:08] (step=0032500) Train Loss: 0.4570, Train Steps/Sec: 1.14
394
+ [2025-10-28 08:43:32] Beginning epoch 26...
395
+ [2025-10-28 08:44:39] (step=0032600) Train Loss: 0.4578, Train Steps/Sec: 1.11
396
+ [2025-10-28 08:46:06] (step=0032700) Train Loss: 0.4563, Train Steps/Sec: 1.15
397
+ [2025-10-28 08:47:33] (step=0032800) Train Loss: 0.4571, Train Steps/Sec: 1.15
398
+ [2025-10-28 08:49:00] (step=0032900) Train Loss: 0.4552, Train Steps/Sec: 1.15
399
+ [2025-10-28 08:50:28] (step=0033000) Train Loss: 0.4577, Train Steps/Sec: 1.15
400
+ [2025-10-28 08:51:55] (step=0033100) Train Loss: 0.4565, Train Steps/Sec: 1.15
401
+ [2025-10-28 08:53:22] (step=0033200) Train Loss: 0.4561, Train Steps/Sec: 1.15
402
+ [2025-10-28 08:54:49] (step=0033300) Train Loss: 0.4582, Train Steps/Sec: 1.14
403
+ [2025-10-28 08:56:17] (step=0033400) Train Loss: 0.4566, Train Steps/Sec: 1.14
404
+ [2025-10-28 08:57:44] (step=0033500) Train Loss: 0.4565, Train Steps/Sec: 1.15
405
+ [2025-10-28 08:59:12] (step=0033600) Train Loss: 0.4560, Train Steps/Sec: 1.15
406
+ [2025-10-28 09:00:39] (step=0033700) Train Loss: 0.4557, Train Steps/Sec: 1.15
407
+ [2025-10-28 09:01:47] Beginning epoch 27...
408
+ [2025-10-28 09:02:09] (step=0033800) Train Loss: 0.4554, Train Steps/Sec: 1.11
409
+ [2025-10-28 09:03:36] (step=0033900) Train Loss: 0.4555, Train Steps/Sec: 1.15
410
+ [2025-10-28 09:05:03] (step=0034000) Train Loss: 0.4546, Train Steps/Sec: 1.15
411
+ [2025-10-28 09:06:31] (step=0034100) Train Loss: 0.4557, Train Steps/Sec: 1.15
412
+ [2025-10-28 09:07:59] (step=0034200) Train Loss: 0.4556, Train Steps/Sec: 1.14
413
+ [2025-10-28 09:09:26] (step=0034300) Train Loss: 0.4557, Train Steps/Sec: 1.14
414
+ [2025-10-28 09:10:53] (step=0034400) Train Loss: 0.4560, Train Steps/Sec: 1.15
415
+ [2025-10-28 09:12:20] (step=0034500) Train Loss: 0.4557, Train Steps/Sec: 1.15
416
+ [2025-10-28 09:13:48] (step=0034600) Train Loss: 0.4556, Train Steps/Sec: 1.15
417
+ [2025-10-28 09:15:15] (step=0034700) Train Loss: 0.4555, Train Steps/Sec: 1.15
418
+ [2025-10-28 09:16:42] (step=0034800) Train Loss: 0.4557, Train Steps/Sec: 1.15
419
+ [2025-10-28 09:18:10] (step=0034900) Train Loss: 0.4561, Train Steps/Sec: 1.15
420
+ [2025-10-28 09:19:37] (step=0035000) Train Loss: 0.4542, Train Steps/Sec: 1.14
421
+ [2025-10-28 09:20:02] Beginning epoch 28...
422
+ [2025-10-28 09:21:07] (step=0035100) Train Loss: 0.4546, Train Steps/Sec: 1.11
423
+ [2025-10-28 09:22:34] (step=0035200) Train Loss: 0.4545, Train Steps/Sec: 1.15
424
+ [2025-10-28 09:24:02] (step=0035300) Train Loss: 0.4544, Train Steps/Sec: 1.15
425
+ [2025-10-28 09:25:29] (step=0035400) Train Loss: 0.4544, Train Steps/Sec: 1.15
426
+ [2025-10-28 09:26:56] (step=0035500) Train Loss: 0.4538, Train Steps/Sec: 1.15
427
+ [2025-10-28 09:28:23] (step=0035600) Train Loss: 0.4546, Train Steps/Sec: 1.15
428
+ [2025-10-28 09:29:51] (step=0035700) Train Loss: 0.4545, Train Steps/Sec: 1.15
429
+ [2025-10-28 09:31:18] (step=0035800) Train Loss: 0.4541, Train Steps/Sec: 1.15
430
+ [2025-10-28 09:32:46] (step=0035900) Train Loss: 0.4539, Train Steps/Sec: 1.14
431
+ [2025-10-28 09:34:13] (step=0036000) Train Loss: 0.4543, Train Steps/Sec: 1.14
432
+ [2025-10-28 09:35:40] (step=0036100) Train Loss: 0.4548, Train Steps/Sec: 1.15
433
+ [2025-10-28 09:37:08] (step=0036200) Train Loss: 0.4551, Train Steps/Sec: 1.15
434
+ [2025-10-28 09:38:17] Beginning epoch 29...
435
+ [2025-10-28 09:38:38] (step=0036300) Train Loss: 0.4544, Train Steps/Sec: 1.11
436
+ [2025-10-28 09:40:05] (step=0036400) Train Loss: 0.4542, Train Steps/Sec: 1.15
437
+ [2025-10-28 09:41:32] (step=0036500) Train Loss: 0.4531, Train Steps/Sec: 1.15
438
+ [2025-10-28 09:42:59] (step=0036600) Train Loss: 0.4531, Train Steps/Sec: 1.15
439
+ [2025-10-28 09:44:27] (step=0036700) Train Loss: 0.4539, Train Steps/Sec: 1.15
440
+ [2025-10-28 09:45:55] (step=0036800) Train Loss: 0.4543, Train Steps/Sec: 1.14
441
+ [2025-10-28 09:47:22] (step=0036900) Train Loss: 0.4537, Train Steps/Sec: 1.15
442
+ [2025-10-28 09:48:49] (step=0037000) Train Loss: 0.4523, Train Steps/Sec: 1.15
443
+ [2025-10-28 09:50:16] (step=0037100) Train Loss: 0.4539, Train Steps/Sec: 1.15
444
+ [2025-10-28 09:51:43] (step=0037200) Train Loss: 0.4532, Train Steps/Sec: 1.15
445
+ [2025-10-28 09:53:11] (step=0037300) Train Loss: 0.4525, Train Steps/Sec: 1.15
446
+ [2025-10-28 09:54:38] (step=0037400) Train Loss: 0.4539, Train Steps/Sec: 1.15
447
+ [2025-10-28 09:56:05] (step=0037500) Train Loss: 0.4538, Train Steps/Sec: 1.15
448
+ [2025-10-28 09:56:32] Beginning epoch 30...
449
+ [2025-10-28 09:57:35] (step=0037600) Train Loss: 0.4537, Train Steps/Sec: 1.11
450
+ [2025-10-28 09:59:03] (step=0037700) Train Loss: 0.4536, Train Steps/Sec: 1.14
451
+ [2025-10-28 10:00:30] (step=0037800) Train Loss: 0.4530, Train Steps/Sec: 1.15
452
+ [2025-10-28 10:01:57] (step=0037900) Train Loss: 0.4530, Train Steps/Sec: 1.15
453
+ [2025-10-28 10:03:25] (step=0038000) Train Loss: 0.4527, Train Steps/Sec: 1.15
454
+ [2025-10-28 10:04:52] (step=0038100) Train Loss: 0.4514, Train Steps/Sec: 1.14
455
+ [2025-10-28 10:06:19] (step=0038200) Train Loss: 0.4533, Train Steps/Sec: 1.15
456
+ [2025-10-28 10:07:46] (step=0038300) Train Loss: 0.4530, Train Steps/Sec: 1.15
457
+ [2025-10-28 10:09:14] (step=0038400) Train Loss: 0.4525, Train Steps/Sec: 1.15
458
+ [2025-10-28 10:10:41] (step=0038500) Train Loss: 0.4517, Train Steps/Sec: 1.14
459
+ [2025-10-28 10:12:08] (step=0038600) Train Loss: 0.4520, Train Steps/Sec: 1.15
460
+ [2025-10-28 10:13:36] (step=0038700) Train Loss: 0.4524, Train Steps/Sec: 1.15
461
+ [2025-10-28 10:14:47] Beginning epoch 31...
462
+ [2025-10-28 10:15:06] (step=0038800) Train Loss: 0.4532, Train Steps/Sec: 1.11
463
+ [2025-10-28 10:16:33] (step=0038900) Train Loss: 0.4507, Train Steps/Sec: 1.15
464
+ [2025-10-28 10:18:00] (step=0039000) Train Loss: 0.4529, Train Steps/Sec: 1.15
465
+ [2025-10-28 10:19:27] (step=0039100) Train Loss: 0.4514, Train Steps/Sec: 1.15
466
+ [2025-10-28 10:20:55] (step=0039200) Train Loss: 0.4513, Train Steps/Sec: 1.15
467
+ [2025-10-28 10:22:22] (step=0039300) Train Loss: 0.4535, Train Steps/Sec: 1.14
468
+ [2025-10-28 10:23:50] (step=0039400) Train Loss: 0.4523, Train Steps/Sec: 1.14
469
+ [2025-10-28 10:25:17] (step=0039500) Train Loss: 0.4524, Train Steps/Sec: 1.15
470
+ [2025-10-28 10:26:44] (step=0039600) Train Loss: 0.4514, Train Steps/Sec: 1.15
471
+ [2025-10-28 10:28:11] (step=0039700) Train Loss: 0.4518, Train Steps/Sec: 1.15
472
+ [2025-10-28 10:29:39] (step=0039800) Train Loss: 0.4509, Train Steps/Sec: 1.15
473
+ [2025-10-28 10:31:06] (step=0039900) Train Loss: 0.4517, Train Steps/Sec: 1.15
474
+ [2025-10-28 10:32:33] (step=0040000) Train Loss: 0.4517, Train Steps/Sec: 1.15
475
+ [2025-10-28 10:33:01] Beginning epoch 32...
476
+ [2025-10-28 10:34:03] (step=0040100) Train Loss: 0.4508, Train Steps/Sec: 1.11
477
+ [2025-10-28 10:35:31] (step=0040200) Train Loss: 0.4515, Train Steps/Sec: 1.14
478
+ [2025-10-28 10:36:58] (step=0040300) Train Loss: 0.4521, Train Steps/Sec: 1.15
479
+ [2025-10-28 10:38:26] (step=0040400) Train Loss: 0.4507, Train Steps/Sec: 1.15
480
+ [2025-10-28 10:39:53] (step=0040500) Train Loss: 0.4504, Train Steps/Sec: 1.15
481
+ [2025-10-28 10:41:20] (step=0040600) Train Loss: 0.4506, Train Steps/Sec: 1.15
482
+ [2025-10-28 10:42:47] (step=0040700) Train Loss: 0.4508, Train Steps/Sec: 1.15
483
+ [2025-10-28 10:44:15] (step=0040800) Train Loss: 0.4510, Train Steps/Sec: 1.15
484
+ [2025-10-28 10:45:42] (step=0040900) Train Loss: 0.4507, Train Steps/Sec: 1.15
485
+ [2025-10-28 10:47:09] (step=0041000) Train Loss: 0.4491, Train Steps/Sec: 1.14
486
+ [2025-10-28 10:48:37] (step=0041100) Train Loss: 0.4501, Train Steps/Sec: 1.15
487
+ [2025-10-28 10:50:04] (step=0041200) Train Loss: 0.4504, Train Steps/Sec: 1.14
488
+ [2025-10-28 10:51:17] Beginning epoch 33...
489
+ [2025-10-28 10:51:34] (step=0041300) Train Loss: 0.4509, Train Steps/Sec: 1.11
490
+ [2025-10-28 10:53:01] (step=0041400) Train Loss: 0.4511, Train Steps/Sec: 1.15
491
+ [2025-10-28 10:54:29] (step=0041500) Train Loss: 0.4501, Train Steps/Sec: 1.15
492
+ [2025-10-28 10:55:56] (step=0041600) Train Loss: 0.4504, Train Steps/Sec: 1.15
493
+ [2025-10-28 10:57:23] (step=0041700) Train Loss: 0.4504, Train Steps/Sec: 1.15
494
+ [2025-10-28 10:58:50] (step=0041800) Train Loss: 0.4503, Train Steps/Sec: 1.15
495
+ [2025-10-28 11:00:18] (step=0041900) Train Loss: 0.4496, Train Steps/Sec: 1.14
496
+ [2025-10-28 11:01:46] (step=0042000) Train Loss: 0.4496, Train Steps/Sec: 1.15
497
+ [2025-10-28 11:03:13] (step=0042100) Train Loss: 0.4512, Train Steps/Sec: 1.15
498
+ [2025-10-28 11:04:40] (step=0042200) Train Loss: 0.4492, Train Steps/Sec: 1.15
499
+ [2025-10-28 11:06:07] (step=0042300) Train Loss: 0.4503, Train Steps/Sec: 1.15
500
+ [2025-10-28 11:07:35] (step=0042400) Train Loss: 0.4489, Train Steps/Sec: 1.15
501
+ [2025-10-28 11:09:02] (step=0042500) Train Loss: 0.4505, Train Steps/Sec: 1.15
502
+ [2025-10-28 11:09:32] Beginning epoch 34...
503
+ [2025-10-28 11:10:32] (step=0042600) Train Loss: 0.4490, Train Steps/Sec: 1.11
504
+ [2025-10-28 11:12:00] (step=0042700) Train Loss: 0.4489, Train Steps/Sec: 1.14
505
+ [2025-10-28 11:13:27] (step=0042800) Train Loss: 0.4486, Train Steps/Sec: 1.14
506
+ [2025-10-28 11:14:55] (step=0042900) Train Loss: 0.4490, Train Steps/Sec: 1.15
507
+ [2025-10-28 11:16:22] (step=0043000) Train Loss: 0.4492, Train Steps/Sec: 1.15
508
+ [2025-10-28 11:17:49] (step=0043100) Train Loss: 0.4498, Train Steps/Sec: 1.15
509
+ [2025-10-28 11:19:17] (step=0043200) Train Loss: 0.4496, Train Steps/Sec: 1.15
510
+ [2025-10-28 11:20:44] (step=0043300) Train Loss: 0.4495, Train Steps/Sec: 1.15
511
+ [2025-10-28 11:22:11] (step=0043400) Train Loss: 0.4497, Train Steps/Sec: 1.15
512
+ [2025-10-28 11:23:38] (step=0043500) Train Loss: 0.4485, Train Steps/Sec: 1.14
513
+ [2025-10-28 11:25:06] (step=0043600) Train Loss: 0.4481, Train Steps/Sec: 1.14
514
+ [2025-10-28 11:26:33] (step=0043700) Train Loss: 0.4487, Train Steps/Sec: 1.14
515
+ [2025-10-28 11:27:48] Beginning epoch 35...
516
+ [2025-10-28 11:28:03] (step=0043800) Train Loss: 0.4501, Train Steps/Sec: 1.11
517
+ [2025-10-28 11:29:31] (step=0043900) Train Loss: 0.4486, Train Steps/Sec: 1.15
518
+ [2025-10-28 11:30:58] (step=0044000) Train Loss: 0.4493, Train Steps/Sec: 1.15
519
+ [2025-10-28 11:32:25] (step=0044100) Train Loss: 0.4479, Train Steps/Sec: 1.15
520
+ [2025-10-28 11:33:52] (step=0044200) Train Loss: 0.4488, Train Steps/Sec: 1.15
521
+ [2025-10-28 11:35:20] (step=0044300) Train Loss: 0.4483, Train Steps/Sec: 1.14
522
+ [2025-10-28 11:36:47] (step=0044400) Train Loss: 0.4485, Train Steps/Sec: 1.14
523
+ [2025-10-28 11:38:15] (step=0044500) Train Loss: 0.4487, Train Steps/Sec: 1.14
524
+ [2025-10-28 11:39:42] (step=0044600) Train Loss: 0.4485, Train Steps/Sec: 1.15
525
+ [2025-10-28 11:41:09] (step=0044700) Train Loss: 0.4483, Train Steps/Sec: 1.15
526
+ [2025-10-28 11:42:37] (step=0044800) Train Loss: 0.4492, Train Steps/Sec: 1.15
527
+ [2025-10-28 11:44:04] (step=0044900) Train Loss: 0.4475, Train Steps/Sec: 1.15
528
+ [2025-10-28 11:45:31] (step=0045000) Train Loss: 0.4482, Train Steps/Sec: 1.15
529
+ [2025-10-28 11:46:03] Beginning epoch 36...
530
+ [2025-10-28 11:47:01] (step=0045100) Train Loss: 0.4484, Train Steps/Sec: 1.11
531
+ [2025-10-28 11:48:28] (step=0045200) Train Loss: 0.4467, Train Steps/Sec: 1.15
532
+ [2025-10-28 11:49:56] (step=0045300) Train Loss: 0.4470, Train Steps/Sec: 1.14
533
+ [2025-10-28 11:51:24] (step=0045400) Train Loss: 0.4484, Train Steps/Sec: 1.15
534
+ [2025-10-28 11:52:51] (step=0045500) Train Loss: 0.4475, Train Steps/Sec: 1.15
535
+ [2025-10-28 11:54:18] (step=0045600) Train Loss: 0.4481, Train Steps/Sec: 1.15
536
+ [2025-10-28 11:55:45] (step=0045700) Train Loss: 0.4468, Train Steps/Sec: 1.15
537
+ [2025-10-28 11:57:12] (step=0045800) Train Loss: 0.4474, Train Steps/Sec: 1.15
538
+ [2025-10-28 11:58:40] (step=0045900) Train Loss: 0.4465, Train Steps/Sec: 1.15
539
+ [2025-10-28 12:00:07] (step=0046000) Train Loss: 0.4473, Train Steps/Sec: 1.15
540
+ [2025-10-28 12:01:34] (step=0046100) Train Loss: 0.4474, Train Steps/Sec: 1.14
541
+ [2025-10-28 12:03:02] (step=0046200) Train Loss: 0.4464, Train Steps/Sec: 1.14
542
+ [2025-10-28 12:04:18] Beginning epoch 37...
543
+ [2025-10-28 12:04:32] (step=0046300) Train Loss: 0.4479, Train Steps/Sec: 1.11
544
+ [2025-10-28 12:05:59] (step=0046400) Train Loss: 0.4476, Train Steps/Sec: 1.15
545
+ [2025-10-28 12:07:27] (step=0046500) Train Loss: 0.4465, Train Steps/Sec: 1.15
546
+ [2025-10-28 12:08:54] (step=0046600) Train Loss: 0.4483, Train Steps/Sec: 1.15
547
+ [2025-10-28 12:10:21] (step=0046700) Train Loss: 0.4469, Train Steps/Sec: 1.14
548
+ [2025-10-28 12:11:49] (step=0046800) Train Loss: 0.4464, Train Steps/Sec: 1.15
549
+ [2025-10-28 12:13:16] (step=0046900) Train Loss: 0.4474, Train Steps/Sec: 1.15
550
+ [2025-10-28 12:14:44] (step=0047000) Train Loss: 0.4467, Train Steps/Sec: 1.14
551
+ [2025-10-28 12:16:11] (step=0047100) Train Loss: 0.4476, Train Steps/Sec: 1.14
552
+ [2025-10-28 12:17:38] (step=0047200) Train Loss: 0.4472, Train Steps/Sec: 1.15
553
+ [2025-10-28 12:19:05] (step=0047300) Train Loss: 0.4464, Train Steps/Sec: 1.15
554
+ [2025-10-28 12:20:33] (step=0047400) Train Loss: 0.4461, Train Steps/Sec: 1.15
555
+ [2025-10-28 12:22:00] (step=0047500) Train Loss: 0.4469, Train Steps/Sec: 1.15
556
+ [2025-10-28 12:22:33] Beginning epoch 38...
557
+ [2025-10-28 12:23:30] (step=0047600) Train Loss: 0.4472, Train Steps/Sec: 1.11
558
+ [2025-10-28 12:24:57] (step=0047700) Train Loss: 0.4478, Train Steps/Sec: 1.15
559
+ [2025-10-28 12:26:25] (step=0047800) Train Loss: 0.4477, Train Steps/Sec: 1.14
560
+ [2025-10-28 12:27:52] (step=0047900) Train Loss: 0.4465, Train Steps/Sec: 1.14
561
+ [2025-10-28 12:29:20] (step=0048000) Train Loss: 0.4460, Train Steps/Sec: 1.15
562
+ [2025-10-28 12:30:47] (step=0048100) Train Loss: 0.4474, Train Steps/Sec: 1.15
563
+ [2025-10-28 12:32:14] (step=0048200) Train Loss: 0.4472, Train Steps/Sec: 1.15
564
+ [2025-10-28 12:33:41] (step=0048300) Train Loss: 0.4474, Train Steps/Sec: 1.15
565
+ [2025-10-28 12:35:09] (step=0048400) Train Loss: 0.4468, Train Steps/Sec: 1.15
566
+ [2025-10-28 12:36:36] (step=0048500) Train Loss: 0.4463, Train Steps/Sec: 1.15
567
+ [2025-10-28 12:38:03] (step=0048600) Train Loss: 0.4454, Train Steps/Sec: 1.15
568
+ [2025-10-28 12:39:30] (step=0048700) Train Loss: 0.4455, Train Steps/Sec: 1.14
569
+ [2025-10-28 12:40:49] Beginning epoch 39...
570
+ [2025-10-28 12:41:00] (step=0048800) Train Loss: 0.4456, Train Steps/Sec: 1.11
571
+ [2025-10-28 12:42:28] (step=0048900) Train Loss: 0.4456, Train Steps/Sec: 1.15
572
+ [2025-10-28 12:43:55] (step=0049000) Train Loss: 0.4451, Train Steps/Sec: 1.15
573
+ [2025-10-28 12:45:22] (step=0049100) Train Loss: 0.4455, Train Steps/Sec: 1.15
574
+ [2025-10-28 12:46:49] (step=0049200) Train Loss: 0.4453, Train Steps/Sec: 1.15
575
+ [2025-10-28 12:48:17] (step=0049300) Train Loss: 0.4459, Train Steps/Sec: 1.15
576
+ [2025-10-28 12:49:44] (step=0049400) Train Loss: 0.4446, Train Steps/Sec: 1.15
577
+ [2025-10-28 12:51:11] (step=0049500) Train Loss: 0.4452, Train Steps/Sec: 1.14
578
+ [2025-10-28 12:52:39] (step=0049600) Train Loss: 0.4453, Train Steps/Sec: 1.14
579
+ [2025-10-28 12:54:06] (step=0049700) Train Loss: 0.4465, Train Steps/Sec: 1.15
580
+ [2025-10-28 12:55:34] (step=0049800) Train Loss: 0.4457, Train Steps/Sec: 1.14
581
+ [2025-10-28 12:57:01] (step=0049900) Train Loss: 0.4452, Train Steps/Sec: 1.15
582
+ [2025-10-28 12:58:28] (step=0050000) Train Loss: 0.4457, Train Steps/Sec: 1.15
583
+ [2025-10-28 12:59:24] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0050000.pt
584
+ [2025-10-28 12:59:24] Generating EMA samples...
585
+ [2025-10-28 12:59:52] Generating EMA samples done.
586
+ [2025-10-28 13:00:27] Beginning epoch 40...
587
+ [2025-10-28 13:01:21] (step=0050100) Train Loss: 0.4451, Train Steps/Sec: 0.58
588
+ [2025-10-28 13:02:49] (step=0050200) Train Loss: 0.4465, Train Steps/Sec: 1.15
589
+ [2025-10-28 13:04:16] (step=0050300) Train Loss: 0.4458, Train Steps/Sec: 1.15
590
+ [2025-10-28 13:05:44] (step=0050400) Train Loss: 0.4442, Train Steps/Sec: 1.14
591
+ [2025-10-28 13:07:11] (step=0050500) Train Loss: 0.4434, Train Steps/Sec: 1.14
592
+ [2025-10-28 13:08:39] (step=0050600) Train Loss: 0.4442, Train Steps/Sec: 1.15
593
+ [2025-10-28 13:10:06] (step=0050700) Train Loss: 0.4444, Train Steps/Sec: 1.15
594
+ [2025-10-28 13:11:33] (step=0050800) Train Loss: 0.4437, Train Steps/Sec: 1.15
595
+ [2025-10-28 13:13:00] (step=0050900) Train Loss: 0.4450, Train Steps/Sec: 1.15
596
+ [2025-10-28 13:14:28] (step=0051000) Train Loss: 0.4444, Train Steps/Sec: 1.15
597
+ [2025-10-28 13:15:55] (step=0051100) Train Loss: 0.4456, Train Steps/Sec: 1.15
598
+ [2025-10-28 13:17:22] (step=0051200) Train Loss: 0.4444, Train Steps/Sec: 1.15
599
+ [2025-10-28 13:18:42] Beginning epoch 41...
600
+ [2025-10-28 13:18:52] (step=0051300) Train Loss: 0.4461, Train Steps/Sec: 1.11
601
+ [2025-10-28 13:20:20] (step=0051400) Train Loss: 0.4429, Train Steps/Sec: 1.15
602
+ [2025-10-28 13:21:47] (step=0051500) Train Loss: 0.4432, Train Steps/Sec: 1.15
603
+ [2025-10-28 13:23:14] (step=0051600) Train Loss: 0.4448, Train Steps/Sec: 1.15
604
+ [2025-10-28 13:24:41] (step=0051700) Train Loss: 0.4444, Train Steps/Sec: 1.15
605
+ [2025-10-28 13:26:09] (step=0051800) Train Loss: 0.4440, Train Steps/Sec: 1.15
606
+ [2025-10-28 13:27:36] (step=0051900) Train Loss: 0.4435, Train Steps/Sec: 1.15
607
+ [2025-10-28 13:29:03] (step=0052000) Train Loss: 0.4444, Train Steps/Sec: 1.15
608
+ [2025-10-28 13:30:31] (step=0052100) Train Loss: 0.4450, Train Steps/Sec: 1.14
609
+ [2025-10-28 13:31:59] (step=0052200) Train Loss: 0.4442, Train Steps/Sec: 1.14
610
+ [2025-10-28 13:33:26] (step=0052300) Train Loss: 0.4425, Train Steps/Sec: 1.15
611
+ [2025-10-28 13:34:53] (step=0052400) Train Loss: 0.4435, Train Steps/Sec: 1.15
612
+ [2025-10-28 13:36:20] (step=0052500) Train Loss: 0.4439, Train Steps/Sec: 1.15
613
+ [2025-10-28 13:36:57] Beginning epoch 42...
614
+ [2025-10-28 13:37:50] (step=0052600) Train Loss: 0.4432, Train Steps/Sec: 1.11
615
+ [2025-10-28 13:39:17] (step=0052700) Train Loss: 0.4433, Train Steps/Sec: 1.15
616
+ [2025-10-28 13:40:45] (step=0052800) Train Loss: 0.4433, Train Steps/Sec: 1.14
617
+ [2025-10-28 13:42:12] (step=0052900) Train Loss: 0.4434, Train Steps/Sec: 1.14
618
+ [2025-10-28 13:43:40] (step=0053000) Train Loss: 0.4450, Train Steps/Sec: 1.14
619
+ [2025-10-28 13:45:07] (step=0053100) Train Loss: 0.4431, Train Steps/Sec: 1.15
620
+ [2025-10-28 13:46:35] (step=0053200) Train Loss: 0.4437, Train Steps/Sec: 1.15
621
+ [2025-10-28 13:48:02] (step=0053300) Train Loss: 0.4429, Train Steps/Sec: 1.15
622
+ [2025-10-28 13:49:29] (step=0053400) Train Loss: 0.4431, Train Steps/Sec: 1.15
623
+ [2025-10-28 13:50:56] (step=0053500) Train Loss: 0.4440, Train Steps/Sec: 1.15
624
+ [2025-10-28 13:52:24] (step=0053600) Train Loss: 0.4423, Train Steps/Sec: 1.15
625
+ [2025-10-28 13:53:51] (step=0053700) Train Loss: 0.4421, Train Steps/Sec: 1.15
626
+ [2025-10-28 13:55:13] Beginning epoch 43...
627
+ [2025-10-28 13:55:21] (step=0053800) Train Loss: 0.4446, Train Steps/Sec: 1.11
628
+ [2025-10-28 13:56:49] (step=0053900) Train Loss: 0.4431, Train Steps/Sec: 1.14
629
+ [2025-10-28 13:58:16] (step=0054000) Train Loss: 0.4430, Train Steps/Sec: 1.15
630
+ [2025-10-28 13:59:43] (step=0054100) Train Loss: 0.4440, Train Steps/Sec: 1.15
631
+ [2025-10-28 14:01:10] (step=0054200) Train Loss: 0.4422, Train Steps/Sec: 1.15
632
+ [2025-10-28 14:02:38] (step=0054300) Train Loss: 0.4434, Train Steps/Sec: 1.15
633
+ [2025-10-28 14:04:05] (step=0054400) Train Loss: 0.4423, Train Steps/Sec: 1.14
634
+ [2025-10-28 14:05:32] (step=0054500) Train Loss: 0.4445, Train Steps/Sec: 1.15
635
+ [2025-10-28 14:07:00] (step=0054600) Train Loss: 0.4425, Train Steps/Sec: 1.14
636
+ [2025-10-28 14:08:27] (step=0054700) Train Loss: 0.4431, Train Steps/Sec: 1.14
637
+ [2025-10-28 14:09:54] (step=0054800) Train Loss: 0.4438, Train Steps/Sec: 1.15
638
+ [2025-10-28 14:11:22] (step=0054900) Train Loss: 0.4433, Train Steps/Sec: 1.15
639
+ [2025-10-28 14:12:49] (step=0055000) Train Loss: 0.4428, Train Steps/Sec: 1.15
640
+ [2025-10-28 14:13:28] Beginning epoch 44...
641
+ [2025-10-28 14:14:19] (step=0055100) Train Loss: 0.4429, Train Steps/Sec: 1.11
642
+ [2025-10-28 14:15:46] (step=0055200) Train Loss: 0.4412, Train Steps/Sec: 1.14
643
+ [2025-10-28 14:17:14] (step=0055300) Train Loss: 0.4422, Train Steps/Sec: 1.15
644
+ [2025-10-28 14:18:41] (step=0055400) Train Loss: 0.4436, Train Steps/Sec: 1.15
645
+ [2025-10-28 14:20:09] (step=0055500) Train Loss: 0.4422, Train Steps/Sec: 1.14
646
+ [2025-10-28 14:21:36] (step=0055600) Train Loss: 0.4428, Train Steps/Sec: 1.14
647
+ [2025-10-28 14:23:03] (step=0055700) Train Loss: 0.4429, Train Steps/Sec: 1.15
648
+ [2025-10-28 14:24:31] (step=0055800) Train Loss: 0.4416, Train Steps/Sec: 1.15
649
+ [2025-10-28 14:25:58] (step=0055900) Train Loss: 0.4431, Train Steps/Sec: 1.15
650
+ [2025-10-28 14:27:25] (step=0056000) Train Loss: 0.4415, Train Steps/Sec: 1.14
651
+ [2025-10-28 14:28:52] (step=0056100) Train Loss: 0.4428, Train Steps/Sec: 1.15
652
+ [2025-10-28 14:30:20] (step=0056200) Train Loss: 0.4417, Train Steps/Sec: 1.15
653
+ [2025-10-28 14:31:43] Beginning epoch 45...
654
+ [2025-10-28 14:31:50] (step=0056300) Train Loss: 0.4430, Train Steps/Sec: 1.11
655
+ [2025-10-28 14:33:17] (step=0056400) Train Loss: 0.4413, Train Steps/Sec: 1.14
656
+ [2025-10-28 14:34:45] (step=0056500) Train Loss: 0.4421, Train Steps/Sec: 1.15
657
+ [2025-10-28 14:36:12] (step=0056600) Train Loss: 0.4426, Train Steps/Sec: 1.15
658
+ [2025-10-28 14:37:39] (step=0056700) Train Loss: 0.4418, Train Steps/Sec: 1.15
659
+ [2025-10-28 14:39:06] (step=0056800) Train Loss: 0.4428, Train Steps/Sec: 1.15
660
+ [2025-10-28 14:40:34] (step=0056900) Train Loss: 0.4419, Train Steps/Sec: 1.15
661
+ [2025-10-28 14:42:01] (step=0057000) Train Loss: 0.4413, Train Steps/Sec: 1.15
662
+ [2025-10-28 14:43:28] (step=0057100) Train Loss: 0.4404, Train Steps/Sec: 1.15
663
+ [2025-10-28 14:44:56] (step=0057200) Train Loss: 0.4420, Train Steps/Sec: 1.14
664
+ [2025-10-28 14:46:23] (step=0057300) Train Loss: 0.4424, Train Steps/Sec: 1.14
665
+ [2025-10-28 14:47:50] (step=0057400) Train Loss: 0.4419, Train Steps/Sec: 1.15
666
+ [2025-10-28 14:49:18] (step=0057500) Train Loss: 0.4426, Train Steps/Sec: 1.15
667
+ [2025-10-28 14:49:58] Beginning epoch 46...
668
+ [2025-10-28 14:50:48] (step=0057600) Train Loss: 0.4420, Train Steps/Sec: 1.11
669
+ [2025-10-28 14:52:15] (step=0057700) Train Loss: 0.4412, Train Steps/Sec: 1.15
670
+ [2025-10-28 14:53:42] (step=0057800) Train Loss: 0.4406, Train Steps/Sec: 1.15
671
+ [2025-10-28 14:55:09] (step=0057900) Train Loss: 0.4425, Train Steps/Sec: 1.15
672
+ [2025-10-28 14:56:37] (step=0058000) Train Loss: 0.4416, Train Steps/Sec: 1.14
673
+ [2025-10-28 14:58:04] (step=0058100) Train Loss: 0.4418, Train Steps/Sec: 1.14
674
+ [2025-10-28 14:59:32] (step=0058200) Train Loss: 0.4419, Train Steps/Sec: 1.15
675
+ [2025-10-28 15:00:59] (step=0058300) Train Loss: 0.4417, Train Steps/Sec: 1.15
676
+ [2025-10-28 15:02:26] (step=0058400) Train Loss: 0.4422, Train Steps/Sec: 1.15
677
+ [2025-10-28 15:03:53] (step=0058500) Train Loss: 0.4416, Train Steps/Sec: 1.15
678
+ [2025-10-28 15:05:21] (step=0058600) Train Loss: 0.4409, Train Steps/Sec: 1.15
679
+ [2025-10-28 15:06:48] (step=0058700) Train Loss: 0.4414, Train Steps/Sec: 1.15
680
+ [2025-10-28 15:08:13] Beginning epoch 47...
681
+ [2025-10-28 15:08:18] (step=0058800) Train Loss: 0.4410, Train Steps/Sec: 1.11
682
+ [2025-10-28 15:09:46] (step=0058900) Train Loss: 0.4413, Train Steps/Sec: 1.14
683
+ [2025-10-28 15:11:13] (step=0059000) Train Loss: 0.4417, Train Steps/Sec: 1.14
684
+ [2025-10-28 15:12:41] (step=0059100) Train Loss: 0.4396, Train Steps/Sec: 1.14
685
+ [2025-10-28 15:14:08] (step=0059200) Train Loss: 0.4404, Train Steps/Sec: 1.15
686
+ [2025-10-28 15:15:35] (step=0059300) Train Loss: 0.4404, Train Steps/Sec: 1.15
687
+ [2025-10-28 15:17:02] (step=0059400) Train Loss: 0.4419, Train Steps/Sec: 1.15
688
+ [2025-10-28 15:18:30] (step=0059500) Train Loss: 0.4397, Train Steps/Sec: 1.15
689
+ [2025-10-28 15:19:57] (step=0059600) Train Loss: 0.4402, Train Steps/Sec: 1.15
690
+ [2025-10-28 15:21:24] (step=0059700) Train Loss: 0.4410, Train Steps/Sec: 1.14
691
+ [2025-10-28 15:22:52] (step=0059800) Train Loss: 0.4404, Train Steps/Sec: 1.14
692
+ [2025-10-28 15:24:19] (step=0059900) Train Loss: 0.4415, Train Steps/Sec: 1.15
693
+ [2025-10-28 15:25:46] (step=0060000) Train Loss: 0.4400, Train Steps/Sec: 1.15
694
+ [2025-10-28 15:26:29] Beginning epoch 48...
695
+ [2025-10-28 15:27:17] (step=0060100) Train Loss: 0.4408, Train Steps/Sec: 1.11
696
+ [2025-10-28 15:28:44] (step=0060200) Train Loss: 0.4406, Train Steps/Sec: 1.15
697
+ [2025-10-28 15:30:11] (step=0060300) Train Loss: 0.4399, Train Steps/Sec: 1.15
698
+ [2025-10-28 15:31:38] (step=0060400) Train Loss: 0.4406, Train Steps/Sec: 1.15
699
+ [2025-10-28 15:33:05] (step=0060500) Train Loss: 0.4411, Train Steps/Sec: 1.15
700
+ [2025-10-28 15:34:33] (step=0060600) Train Loss: 0.4412, Train Steps/Sec: 1.14
701
+ [2025-10-28 15:36:01] (step=0060700) Train Loss: 0.4394, Train Steps/Sec: 1.14
702
+ [2025-10-28 15:37:28] (step=0060800) Train Loss: 0.4410, Train Steps/Sec: 1.15
703
+ [2025-10-28 15:38:55] (step=0060900) Train Loss: 0.4402, Train Steps/Sec: 1.15
704
+ [2025-10-28 15:40:22] (step=0061000) Train Loss: 0.4411, Train Steps/Sec: 1.15
705
+ [2025-10-28 15:41:50] (step=0061100) Train Loss: 0.4391, Train Steps/Sec: 1.15
706
+ [2025-10-28 15:43:17] (step=0061200) Train Loss: 0.4392, Train Steps/Sec: 1.15
707
+ [2025-10-28 15:44:44] Beginning epoch 49...
708
+ [2025-10-28 15:44:47] (step=0061300) Train Loss: 0.4402, Train Steps/Sec: 1.11
709
+ [2025-10-28 15:46:14] (step=0061400) Train Loss: 0.4400, Train Steps/Sec: 1.14
710
+ [2025-10-28 15:47:42] (step=0061500) Train Loss: 0.4405, Train Steps/Sec: 1.14
711
+ [2025-10-28 15:49:09] (step=0061600) Train Loss: 0.4397, Train Steps/Sec: 1.15
712
+ [2025-10-28 15:50:37] (step=0061700) Train Loss: 0.4405, Train Steps/Sec: 1.15
713
+ [2025-10-28 15:52:04] (step=0061800) Train Loss: 0.4407, Train Steps/Sec: 1.15
714
+ [2025-10-28 15:53:31] (step=0061900) Train Loss: 0.4390, Train Steps/Sec: 1.15
715
+ [2025-10-28 15:54:58] (step=0062000) Train Loss: 0.4392, Train Steps/Sec: 1.15
716
+ [2025-10-28 15:56:25] (step=0062100) Train Loss: 0.4397, Train Steps/Sec: 1.15
717
+ [2025-10-28 15:57:53] (step=0062200) Train Loss: 0.4406, Train Steps/Sec: 1.14
718
+ [2025-10-28 15:59:20] (step=0062300) Train Loss: 0.4393, Train Steps/Sec: 1.14
719
+ [2025-10-28 16:00:48] (step=0062400) Train Loss: 0.4392, Train Steps/Sec: 1.14
720
+ [2025-10-28 16:02:15] (step=0062500) Train Loss: 0.4403, Train Steps/Sec: 1.15
721
+ [2025-10-28 16:02:59] Beginning epoch 50...
722
+ [2025-10-28 16:03:45] (step=0062600) Train Loss: 0.4393, Train Steps/Sec: 1.11
723
+ [2025-10-28 16:05:12] (step=0062700) Train Loss: 0.4388, Train Steps/Sec: 1.15
724
+ [2025-10-28 16:06:40] (step=0062800) Train Loss: 0.4396, Train Steps/Sec: 1.15
725
+ [2025-10-28 16:08:07] (step=0062900) Train Loss: 0.4399, Train Steps/Sec: 1.15
726
+ [2025-10-28 16:09:34] (step=0063000) Train Loss: 0.4387, Train Steps/Sec: 1.15
727
+ [2025-10-28 16:11:01] (step=0063100) Train Loss: 0.4403, Train Steps/Sec: 1.15
728
+ [2025-10-28 16:12:29] (step=0063200) Train Loss: 0.4399, Train Steps/Sec: 1.14
729
+ [2025-10-28 16:13:56] (step=0063300) Train Loss: 0.4400, Train Steps/Sec: 1.14
730
+ [2025-10-28 16:15:24] (step=0063400) Train Loss: 0.4398, Train Steps/Sec: 1.15
731
+ [2025-10-28 16:16:51] (step=0063500) Train Loss: 0.4381, Train Steps/Sec: 1.15
732
+ [2025-10-28 16:18:18] (step=0063600) Train Loss: 0.4383, Train Steps/Sec: 1.15
733
+ [2025-10-28 16:19:45] (step=0063700) Train Loss: 0.4409, Train Steps/Sec: 1.15
734
+ [2025-10-28 16:21:13] (step=0063800) Train Loss: 0.4398, Train Steps/Sec: 1.15
735
+ [2025-10-28 16:21:14] Beginning epoch 51...
736
+ [2025-10-28 16:22:43] (step=0063900) Train Loss: 0.4379, Train Steps/Sec: 1.11
737
+ [2025-10-28 16:24:11] (step=0064000) Train Loss: 0.4398, Train Steps/Sec: 1.14
738
+ [2025-10-28 16:25:38] (step=0064100) Train Loss: 0.4402, Train Steps/Sec: 1.14
739
+ [2025-10-28 16:27:05] (step=0064200) Train Loss: 0.4403, Train Steps/Sec: 1.15
740
+ [2025-10-28 16:28:33] (step=0064300) Train Loss: 0.4388, Train Steps/Sec: 1.15
741
+ [2025-10-28 16:30:00] (step=0064400) Train Loss: 0.4381, Train Steps/Sec: 1.15
742
+ [2025-10-28 16:31:27] (step=0064500) Train Loss: 0.4379, Train Steps/Sec: 1.15
743
+ [2025-10-28 16:32:55] (step=0064600) Train Loss: 0.4374, Train Steps/Sec: 1.15
744
+ [2025-10-28 16:34:22] (step=0064700) Train Loss: 0.4390, Train Steps/Sec: 1.15
745
+ [2025-10-28 16:35:49] (step=0064800) Train Loss: 0.4384, Train Steps/Sec: 1.15
746
+ [2025-10-28 16:37:17] (step=0064900) Train Loss: 0.4385, Train Steps/Sec: 1.14
747
+ [2025-10-28 16:38:44] (step=0065000) Train Loss: 0.4387, Train Steps/Sec: 1.14
748
+ [2025-10-28 16:39:30] Beginning epoch 52...
749
+ [2025-10-28 16:40:14] (step=0065100) Train Loss: 0.4382, Train Steps/Sec: 1.11
750
+ [2025-10-28 16:41:42] (step=0065200) Train Loss: 0.4377, Train Steps/Sec: 1.15
751
+ [2025-10-28 16:43:09] (step=0065300) Train Loss: 0.4368, Train Steps/Sec: 1.14
752
+ [2025-10-28 16:44:36] (step=0065400) Train Loss: 0.4378, Train Steps/Sec: 1.15
753
+ [2025-10-28 16:46:04] (step=0065500) Train Loss: 0.4397, Train Steps/Sec: 1.15
754
+ [2025-10-28 16:47:31] (step=0065600) Train Loss: 0.4390, Train Steps/Sec: 1.15
755
+ [2025-10-28 16:48:58] (step=0065700) Train Loss: 0.4383, Train Steps/Sec: 1.14
756
+ [2025-10-28 16:50:26] (step=0065800) Train Loss: 0.4382, Train Steps/Sec: 1.14
757
+ [2025-10-28 16:51:53] (step=0065900) Train Loss: 0.4386, Train Steps/Sec: 1.15
758
+ [2025-10-28 16:53:20] (step=0066000) Train Loss: 0.4384, Train Steps/Sec: 1.15
759
+ [2025-10-28 16:54:48] (step=0066100) Train Loss: 0.4387, Train Steps/Sec: 1.14
760
+ [2025-10-28 16:56:15] (step=0066200) Train Loss: 0.4388, Train Steps/Sec: 1.15
761
+ [2025-10-28 16:57:42] (step=0066300) Train Loss: 0.4398, Train Steps/Sec: 1.15
762
+ [2025-10-28 16:57:45] Beginning epoch 53...
763
+ [2025-10-28 16:59:12] (step=0066400) Train Loss: 0.4382, Train Steps/Sec: 1.11
764
+ [2025-10-28 17:00:40] (step=0066500) Train Loss: 0.4379, Train Steps/Sec: 1.15
765
+ [2025-10-28 17:02:07] (step=0066600) Train Loss: 0.4376, Train Steps/Sec: 1.14
766
+ [2025-10-28 17:03:35] (step=0066700) Train Loss: 0.4368, Train Steps/Sec: 1.14
767
+ [2025-10-28 17:05:02] (step=0066800) Train Loss: 0.4388, Train Steps/Sec: 1.15
768
+ [2025-10-28 17:06:29] (step=0066900) Train Loss: 0.4377, Train Steps/Sec: 1.14
769
+ [2025-10-28 17:07:57] (step=0067000) Train Loss: 0.4377, Train Steps/Sec: 1.15
770
+ [2025-10-28 17:09:24] (step=0067100) Train Loss: 0.4380, Train Steps/Sec: 1.15
771
+ [2025-10-28 17:10:51] (step=0067200) Train Loss: 0.4376, Train Steps/Sec: 1.15
772
+ [2025-10-28 17:12:18] (step=0067300) Train Loss: 0.4377, Train Steps/Sec: 1.15
773
+ [2025-10-28 17:13:46] (step=0067400) Train Loss: 0.4380, Train Steps/Sec: 1.14
774
+ [2025-10-28 17:15:13] (step=0067500) Train Loss: 0.4378, Train Steps/Sec: 1.14
775
+ [2025-10-28 17:16:01] Beginning epoch 54...
776
+ [2025-10-28 17:16:43] (step=0067600) Train Loss: 0.4377, Train Steps/Sec: 1.11
777
+ [2025-10-28 17:18:11] (step=0067700) Train Loss: 0.4364, Train Steps/Sec: 1.15
778
+ [2025-10-28 17:19:38] (step=0067800) Train Loss: 0.4378, Train Steps/Sec: 1.15
779
+ [2025-10-28 17:21:05] (step=0067900) Train Loss: 0.4368, Train Steps/Sec: 1.15
780
+ [2025-10-28 17:22:32] (step=0068000) Train Loss: 0.4373, Train Steps/Sec: 1.15
781
+ [2025-10-28 17:24:00] (step=0068100) Train Loss: 0.4378, Train Steps/Sec: 1.15
782
+ [2025-10-28 17:25:27] (step=0068200) Train Loss: 0.4377, Train Steps/Sec: 1.15
783
+ [2025-10-28 17:26:54] (step=0068300) Train Loss: 0.4367, Train Steps/Sec: 1.14
784
+ [2025-10-28 17:28:22] (step=0068400) Train Loss: 0.4373, Train Steps/Sec: 1.14
785
+ [2025-10-28 17:29:49] (step=0068500) Train Loss: 0.4373, Train Steps/Sec: 1.15
786
+ [2025-10-28 17:31:16] (step=0068600) Train Loss: 0.4384, Train Steps/Sec: 1.15
787
+ [2025-10-28 17:32:43] (step=0068700) Train Loss: 0.4374, Train Steps/Sec: 1.15
788
+ [2025-10-28 17:34:11] (step=0068800) Train Loss: 0.4360, Train Steps/Sec: 1.15
789
+ [2025-10-28 17:34:16] Beginning epoch 55...
790
+ [2025-10-28 17:35:41] (step=0068900) Train Loss: 0.4380, Train Steps/Sec: 1.11
791
+ [2025-10-28 17:37:08] (step=0069000) Train Loss: 0.4373, Train Steps/Sec: 1.15
792
+ [2025-10-28 17:38:36] (step=0069100) Train Loss: 0.4371, Train Steps/Sec: 1.14
793
+ [2025-10-28 17:40:04] (step=0069200) Train Loss: 0.4363, Train Steps/Sec: 1.14
794
+ [2025-10-28 17:41:31] (step=0069300) Train Loss: 0.4377, Train Steps/Sec: 1.15
795
+ [2025-10-28 17:42:58] (step=0069400) Train Loss: 0.4361, Train Steps/Sec: 1.15
796
+ [2025-10-28 17:44:25] (step=0069500) Train Loss: 0.4380, Train Steps/Sec: 1.15
797
+ [2025-10-28 17:45:52] (step=0069600) Train Loss: 0.4377, Train Steps/Sec: 1.15
798
+ [2025-10-28 17:47:20] (step=0069700) Train Loss: 0.4361, Train Steps/Sec: 1.15
799
+ [2025-10-28 17:48:47] (step=0069800) Train Loss: 0.4359, Train Steps/Sec: 1.15
800
+ [2025-10-28 17:50:14] (step=0069900) Train Loss: 0.4363, Train Steps/Sec: 1.15
801
+ [2025-10-28 17:51:42] (step=0070000) Train Loss: 0.4368, Train Steps/Sec: 1.14
802
+ [2025-10-28 17:52:31] Beginning epoch 56...
803
+ [2025-10-28 17:53:12] (step=0070100) Train Loss: 0.4356, Train Steps/Sec: 1.11
804
+ [2025-10-28 17:54:39] (step=0070200) Train Loss: 0.4376, Train Steps/Sec: 1.15
805
+ [2025-10-28 17:56:06] (step=0070300) Train Loss: 0.4364, Train Steps/Sec: 1.15
806
+ [2025-10-28 17:57:33] (step=0070400) Train Loss: 0.4368, Train Steps/Sec: 1.15
807
+ [2025-10-28 17:59:01] (step=0070500) Train Loss: 0.4370, Train Steps/Sec: 1.15
808
+ [2025-10-28 18:00:28] (step=0070600) Train Loss: 0.4356, Train Steps/Sec: 1.15
809
+ [2025-10-28 18:01:55] (step=0070700) Train Loss: 0.4374, Train Steps/Sec: 1.15
810
+ [2025-10-28 18:03:23] (step=0070800) Train Loss: 0.4365, Train Steps/Sec: 1.14
811
+ [2025-10-28 18:04:50] (step=0070900) Train Loss: 0.4365, Train Steps/Sec: 1.14
812
+ [2025-10-28 18:06:18] (step=0071000) Train Loss: 0.4367, Train Steps/Sec: 1.15
813
+ [2025-10-28 18:07:45] (step=0071100) Train Loss: 0.4379, Train Steps/Sec: 1.15
814
+ [2025-10-28 18:09:12] (step=0071200) Train Loss: 0.4374, Train Steps/Sec: 1.15
815
+ [2025-10-28 18:10:39] (step=0071300) Train Loss: 0.4367, Train Steps/Sec: 1.15
816
+ [2025-10-28 18:10:46] Beginning epoch 57...
817
+ [2025-10-28 18:12:10] (step=0071400) Train Loss: 0.4368, Train Steps/Sec: 1.11
818
+ [2025-10-28 18:13:37] (step=0071500) Train Loss: 0.4356, Train Steps/Sec: 1.15
819
+ [2025-10-28 18:15:04] (step=0071600) Train Loss: 0.4366, Train Steps/Sec: 1.15
820
+ [2025-10-28 18:16:32] (step=0071700) Train Loss: 0.4365, Train Steps/Sec: 1.14
821
+ [2025-10-28 18:17:59] (step=0071800) Train Loss: 0.4364, Train Steps/Sec: 1.14
822
+ [2025-10-28 18:19:26] (step=0071900) Train Loss: 0.4353, Train Steps/Sec: 1.15
823
+ [2025-10-28 18:20:54] (step=0072000) Train Loss: 0.4364, Train Steps/Sec: 1.15
824
+ [2025-10-28 18:22:21] (step=0072100) Train Loss: 0.4365, Train Steps/Sec: 1.15
825
+ [2025-10-28 18:23:48] (step=0072200) Train Loss: 0.4363, Train Steps/Sec: 1.15
826
+ [2025-10-28 18:25:15] (step=0072300) Train Loss: 0.4357, Train Steps/Sec: 1.15
827
+ [2025-10-28 18:26:43] (step=0072400) Train Loss: 0.4360, Train Steps/Sec: 1.15
828
+ [2025-10-28 18:28:10] (step=0072500) Train Loss: 0.4363, Train Steps/Sec: 1.15
829
+ [2025-10-28 18:29:01] Beginning epoch 58...
830
+ [2025-10-28 18:29:41] (step=0072600) Train Loss: 0.4364, Train Steps/Sec: 1.10
831
+ [2025-10-28 18:31:08] (step=0072700) Train Loss: 0.4359, Train Steps/Sec: 1.15
832
+ [2025-10-28 18:32:35] (step=0072800) Train Loss: 0.4364, Train Steps/Sec: 1.15
833
+ [2025-10-28 18:34:02] (step=0072900) Train Loss: 0.4344, Train Steps/Sec: 1.15
834
+ [2025-10-28 18:35:30] (step=0073000) Train Loss: 0.4353, Train Steps/Sec: 1.15
835
+ [2025-10-28 18:36:57] (step=0073100) Train Loss: 0.4360, Train Steps/Sec: 1.14
836
+ [2025-10-28 18:38:24] (step=0073200) Train Loss: 0.4360, Train Steps/Sec: 1.15
837
+ [2025-10-28 18:39:51] (step=0073300) Train Loss: 0.4364, Train Steps/Sec: 1.15
838
+ [2025-10-28 18:41:19] (step=0073400) Train Loss: 0.4348, Train Steps/Sec: 1.14
839
+ [2025-10-28 18:42:46] (step=0073500) Train Loss: 0.4372, Train Steps/Sec: 1.14
840
+ [2025-10-28 18:44:14] (step=0073600) Train Loss: 0.4349, Train Steps/Sec: 1.15
841
+ [2025-10-28 18:45:41] (step=0073700) Train Loss: 0.4356, Train Steps/Sec: 1.15
842
+ [2025-10-28 18:47:08] (step=0073800) Train Loss: 0.4363, Train Steps/Sec: 1.15
843
+ [2025-10-28 18:47:16] Beginning epoch 59...
844
+ [2025-10-28 18:48:38] (step=0073900) Train Loss: 0.4356, Train Steps/Sec: 1.11
845
+ [2025-10-28 18:50:05] (step=0074000) Train Loss: 0.4349, Train Steps/Sec: 1.15
846
+ [2025-10-28 18:51:33] (step=0074100) Train Loss: 0.4356, Train Steps/Sec: 1.15
847
+ [2025-10-28 18:53:00] (step=0074200) Train Loss: 0.4341, Train Steps/Sec: 1.14
848
+ [2025-10-28 18:54:28] (step=0074300) Train Loss: 0.4363, Train Steps/Sec: 1.14
849
+ [2025-10-28 18:55:55] (step=0074400) Train Loss: 0.4343, Train Steps/Sec: 1.14
850
+ [2025-10-28 18:57:22] (step=0074500) Train Loss: 0.4350, Train Steps/Sec: 1.15
851
+ [2025-10-28 18:58:50] (step=0074600) Train Loss: 0.4368, Train Steps/Sec: 1.15
852
+ [2025-10-28 19:00:17] (step=0074700) Train Loss: 0.4349, Train Steps/Sec: 1.14
853
+ [2025-10-28 19:01:44] (step=0074800) Train Loss: 0.4348, Train Steps/Sec: 1.15
854
+ [2025-10-28 19:03:11] (step=0074900) Train Loss: 0.4350, Train Steps/Sec: 1.15
855
+ [2025-10-28 19:04:39] (step=0075000) Train Loss: 0.4346, Train Steps/Sec: 1.15
856
+ [2025-10-28 19:05:35] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-dinov3-vit-l16-bf16/checkpoints/0075000.pt
857
+ [2025-10-28 19:05:35] Generating EMA samples...
858
+ [2025-10-28 19:06:02] Generating EMA samples done.
859
+ [2025-10-28 19:06:55] Beginning epoch 60...
860
+ [2025-10-28 19:07:33] (step=0075100) Train Loss: 0.4342, Train Steps/Sec: 0.58
861
+ [2025-10-28 19:09:00] (step=0075200) Train Loss: 0.4350, Train Steps/Sec: 1.14
862
+ [2025-10-28 19:10:28] (step=0075300) Train Loss: 0.4346, Train Steps/Sec: 1.15
863
+ [2025-10-28 19:11:55] (step=0075400) Train Loss: 0.4366, Train Steps/Sec: 1.14
864
+ [2025-10-28 19:13:22] (step=0075500) Train Loss: 0.4361, Train Steps/Sec: 1.15
865
+ [2025-10-28 19:14:49] (step=0075600) Train Loss: 0.4344, Train Steps/Sec: 1.15
866
+ [2025-10-28 19:16:17] (step=0075700) Train Loss: 0.4358, Train Steps/Sec: 1.15
867
+ [2025-10-28 19:17:44] (step=0075800) Train Loss: 0.4333, Train Steps/Sec: 1.15
868
+ [2025-10-28 19:19:11] (step=0075900) Train Loss: 0.4366, Train Steps/Sec: 1.15
869
+ [2025-10-28 19:20:39] (step=0076000) Train Loss: 0.4352, Train Steps/Sec: 1.14
870
+ [2025-10-28 19:22:06] (step=0076100) Train Loss: 0.4346, Train Steps/Sec: 1.15
871
+ [2025-10-28 19:23:33] (step=0076200) Train Loss: 0.4350, Train Steps/Sec: 1.14
872
+ [2025-10-28 19:25:01] (step=0076300) Train Loss: 0.4353, Train Steps/Sec: 1.15
873
+ [2025-10-28 19:25:11] Beginning epoch 61...
874
+ [2025-10-28 19:26:31] (step=0076400) Train Loss: 0.4347, Train Steps/Sec: 1.11
875
+ [2025-10-28 19:27:58] (step=0076500) Train Loss: 0.4352, Train Steps/Sec: 1.15
876
+ [2025-10-28 19:29:25] (step=0076600) Train Loss: 0.4341, Train Steps/Sec: 1.15
877
+ [2025-10-28 19:30:53] (step=0076700) Train Loss: 0.4350, Train Steps/Sec: 1.15
878
+ [2025-10-28 19:32:20] (step=0076800) Train Loss: 0.4340, Train Steps/Sec: 1.14
879
+ [2025-10-28 19:33:48] (step=0076900) Train Loss: 0.4347, Train Steps/Sec: 1.14
880
+ [2025-10-28 19:35:15] (step=0077000) Train Loss: 0.4329, Train Steps/Sec: 1.15
881
+ [2025-10-28 19:36:43] (step=0077100) Train Loss: 0.4359, Train Steps/Sec: 1.15
882
+ [2025-10-28 19:38:10] (step=0077200) Train Loss: 0.4337, Train Steps/Sec: 1.15
883
+ [2025-10-28 19:39:37] (step=0077300) Train Loss: 0.4345, Train Steps/Sec: 1.15
884
+ [2025-10-28 19:41:05] (step=0077400) Train Loss: 0.4344, Train Steps/Sec: 1.15
885
+ [2025-10-28 19:42:32] (step=0077500) Train Loss: 0.4353, Train Steps/Sec: 1.15
886
+ [2025-10-28 19:43:26] Beginning epoch 62...
887
+ [2025-10-28 19:44:02] (step=0077600) Train Loss: 0.4345, Train Steps/Sec: 1.11
888
+ [2025-10-28 19:45:30] (step=0077700) Train Loss: 0.4336, Train Steps/Sec: 1.14
889
+ [2025-10-28 19:46:57] (step=0077800) Train Loss: 0.4335, Train Steps/Sec: 1.14
890
+ [2025-10-28 19:48:24] (step=0077900) Train Loss: 0.4344, Train Steps/Sec: 1.15
891
+ [2025-10-28 19:49:52] (step=0078000) Train Loss: 0.4338, Train Steps/Sec: 1.15
892
+ [2025-10-28 19:51:19] (step=0078100) Train Loss: 0.4341, Train Steps/Sec: 1.15
893
+ [2025-10-28 19:52:46] (step=0078200) Train Loss: 0.4346, Train Steps/Sec: 1.15
894
+ [2025-10-28 19:54:13] (step=0078300) Train Loss: 0.4347, Train Steps/Sec: 1.15
895
+ [2025-10-28 19:55:41] (step=0078400) Train Loss: 0.4337, Train Steps/Sec: 1.15
896
+ [2025-10-28 19:57:08] (step=0078500) Train Loss: 0.4336, Train Steps/Sec: 1.14
897
+ [2025-10-28 19:58:35] (step=0078600) Train Loss: 0.4325, Train Steps/Sec: 1.14
898
+ [2025-10-28 20:00:03] (step=0078700) Train Loss: 0.4334, Train Steps/Sec: 1.15
899
+ [2025-10-28 20:01:30] (step=0078800) Train Loss: 0.4334, Train Steps/Sec: 1.15
900
+ [2025-10-28 20:01:42] Beginning epoch 63...
901
+ [2025-10-28 20:03:00] (step=0078900) Train Loss: 0.4344, Train Steps/Sec: 1.11
902
+ [2025-10-28 20:04:27] (step=0079000) Train Loss: 0.4331, Train Steps/Sec: 1.15
903
+ [2025-10-28 20:05:54] (step=0079100) Train Loss: 0.4341, Train Steps/Sec: 1.15
904
+ [2025-10-28 20:07:22] (step=0079200) Train Loss: 0.4336, Train Steps/Sec: 1.15
905
+ [2025-10-28 20:08:49] (step=0079300) Train Loss: 0.4345, Train Steps/Sec: 1.14
906
+ [2025-10-28 20:10:17] (step=0079400) Train Loss: 0.4343, Train Steps/Sec: 1.14
907
+ [2025-10-28 20:11:44] (step=0079500) Train Loss: 0.4333, Train Steps/Sec: 1.14
908
+ [2025-10-28 20:13:12] (step=0079600) Train Loss: 0.4339, Train Steps/Sec: 1.15
909
+ [2025-10-28 20:14:39] (step=0079700) Train Loss: 0.4347, Train Steps/Sec: 1.15
910
+ [2025-10-28 20:16:06] (step=0079800) Train Loss: 0.4317, Train Steps/Sec: 1.15
911
+ [2025-10-28 20:17:33] (step=0079900) Train Loss: 0.4339, Train Steps/Sec: 1.15
912
+ [2025-10-28 20:19:01] (step=0080000) Train Loss: 0.4345, Train Steps/Sec: 1.15
913
+ [2025-10-28 20:19:57] Beginning epoch 64...
914
+ [2025-10-28 20:20:31] (step=0080100) Train Loss: 0.4333, Train Steps/Sec: 1.11
915
+ [2025-10-28 20:21:58] (step=0080200) Train Loss: 0.4337, Train Steps/Sec: 1.14
916
+ [2025-10-28 20:23:26] (step=0080300) Train Loss: 0.4335, Train Steps/Sec: 1.14
917
+ [2025-10-28 20:24:53] (step=0080400) Train Loss: 0.4337, Train Steps/Sec: 1.15
918
+ [2025-10-28 20:26:20] (step=0080500) Train Loss: 0.4333, Train Steps/Sec: 1.15
919
+ [2025-10-28 20:27:48] (step=0080600) Train Loss: 0.4331, Train Steps/Sec: 1.15
920
+ [2025-10-28 20:29:15] (step=0080700) Train Loss: 0.4333, Train Steps/Sec: 1.15
921
+ [2025-10-28 20:30:42] (step=0080800) Train Loss: 0.4340, Train Steps/Sec: 1.15
922
+ [2025-10-28 20:32:09] (step=0080900) Train Loss: 0.4334, Train Steps/Sec: 1.15
923
+ [2025-10-28 20:33:37] (step=0081000) Train Loss: 0.4329, Train Steps/Sec: 1.14
924
+ [2025-10-28 20:35:04] (step=0081100) Train Loss: 0.4338, Train Steps/Sec: 1.14
925
+ [2025-10-28 20:36:32] (step=0081200) Train Loss: 0.4318, Train Steps/Sec: 1.14
926
+ [2025-10-28 20:37:59] (step=0081300) Train Loss: 0.4337, Train Steps/Sec: 1.15
927
+ [2025-10-28 20:38:12] Beginning epoch 65...
928
+ [2025-10-28 20:39:29] (step=0081400) Train Loss: 0.4333, Train Steps/Sec: 1.11
929
+ [2025-10-28 20:40:56] (step=0081500) Train Loss: 0.4327, Train Steps/Sec: 1.15
930
+ [2025-10-28 20:42:24] (step=0081600) Train Loss: 0.4330, Train Steps/Sec: 1.15
931
+ [2025-10-28 20:43:51] (step=0081700) Train Loss: 0.4325, Train Steps/Sec: 1.15
932
+ [2025-10-28 20:45:18] (step=0081800) Train Loss: 0.4329, Train Steps/Sec: 1.15
933
+ [2025-10-28 20:46:46] (step=0081900) Train Loss: 0.4327, Train Steps/Sec: 1.14
934
+ [2025-10-28 20:48:14] (step=0082000) Train Loss: 0.4327, Train Steps/Sec: 1.14
935
+ [2025-10-28 20:49:41] (step=0082100) Train Loss: 0.4322, Train Steps/Sec: 1.14
936
+ [2025-10-28 20:51:08] (step=0082200) Train Loss: 0.4327, Train Steps/Sec: 1.15
937
+ [2025-10-28 20:52:35] (step=0082300) Train Loss: 0.4324, Train Steps/Sec: 1.15
938
+ [2025-10-28 20:54:03] (step=0082400) Train Loss: 0.4325, Train Steps/Sec: 1.15
939
+ [2025-10-28 20:55:30] (step=0082500) Train Loss: 0.4326, Train Steps/Sec: 1.15
940
+ [2025-10-28 20:56:28] Beginning epoch 66...
941
+ [2025-10-28 20:57:00] (step=0082600) Train Loss: 0.4323, Train Steps/Sec: 1.11
942
+ [2025-10-28 20:58:27] (step=0082700) Train Loss: 0.4318, Train Steps/Sec: 1.14
943
+ [2025-10-28 20:59:55] (step=0082800) Train Loss: 0.4317, Train Steps/Sec: 1.14
944
+ [2025-10-28 21:01:22] (step=0082900) Train Loss: 0.4324, Train Steps/Sec: 1.14
945
+ [2025-10-28 21:02:50] (step=0083000) Train Loss: 0.4322, Train Steps/Sec: 1.15
946
+ [2025-10-28 21:04:17] (step=0083100) Train Loss: 0.4325, Train Steps/Sec: 1.15
947
+ [2025-10-28 21:05:44] (step=0083200) Train Loss: 0.4316, Train Steps/Sec: 1.14
948
+ [2025-10-28 21:07:11] (step=0083300) Train Loss: 0.4332, Train Steps/Sec: 1.15
949
+ [2025-10-28 21:08:39] (step=0083400) Train Loss: 0.4323, Train Steps/Sec: 1.15
950
+ [2025-10-28 21:10:06] (step=0083500) Train Loss: 0.4320, Train Steps/Sec: 1.15
951
+ [2025-10-28 21:11:33] (step=0083600) Train Loss: 0.4332, Train Steps/Sec: 1.15
952
+ [2025-10-28 21:13:01] (step=0083700) Train Loss: 0.4310, Train Steps/Sec: 1.14
953
+ [2025-10-28 21:14:28] (step=0083800) Train Loss: 0.4328, Train Steps/Sec: 1.15
954
+ [2025-10-28 21:14:44] Beginning epoch 67...
955
+ [2025-10-28 21:15:59] (step=0083900) Train Loss: 0.4331, Train Steps/Sec: 1.11
956
+ [2025-10-28 21:17:26] (step=0084000) Train Loss: 0.4305, Train Steps/Sec: 1.14
957
+ [2025-10-28 21:18:53] (step=0084100) Train Loss: 0.4314, Train Steps/Sec: 1.15
958
+ [2025-10-28 21:20:20] (step=0084200) Train Loss: 0.4325, Train Steps/Sec: 1.15
959
+ [2025-10-28 21:21:48] (step=0084300) Train Loss: 0.4324, Train Steps/Sec: 1.15
960
+ [2025-10-28 21:23:15] (step=0084400) Train Loss: 0.4302, Train Steps/Sec: 1.15
961
+ [2025-10-28 21:24:43] (step=0084500) Train Loss: 0.4318, Train Steps/Sec: 1.14
962
+ [2025-10-28 21:26:10] (step=0084600) Train Loss: 0.4328, Train Steps/Sec: 1.14
963
+ [2025-10-28 21:27:37] (step=0084700) Train Loss: 0.4325, Train Steps/Sec: 1.15
964
+ [2025-10-28 21:29:05] (step=0084800) Train Loss: 0.4321, Train Steps/Sec: 1.15
965
+ [2025-10-28 21:30:32] (step=0084900) Train Loss: 0.4332, Train Steps/Sec: 1.15
966
+ [2025-10-28 21:31:59] (step=0085000) Train Loss: 0.4326, Train Steps/Sec: 1.15
967
+ [2025-10-28 21:32:59] Beginning epoch 68...
968
+ [2025-10-28 21:33:29] (step=0085100) Train Loss: 0.4309, Train Steps/Sec: 1.11
969
+ [2025-10-28 21:34:57] (step=0085200) Train Loss: 0.4326, Train Steps/Sec: 1.15
970
+ [2025-10-28 21:36:24] (step=0085300) Train Loss: 0.4316, Train Steps/Sec: 1.14
971
+ [2025-10-28 21:37:52] (step=0085400) Train Loss: 0.4312, Train Steps/Sec: 1.14
972
+ [2025-10-28 21:39:19] (step=0085500) Train Loss: 0.4317, Train Steps/Sec: 1.14
973
+ [2025-10-28 21:40:47] (step=0085600) Train Loss: 0.4334, Train Steps/Sec: 1.15
974
+ [2025-10-28 21:42:14] (step=0085700) Train Loss: 0.4328, Train Steps/Sec: 1.15
975
+ [2025-10-28 21:43:41] (step=0085800) Train Loss: 0.4329, Train Steps/Sec: 1.15
976
+ [2025-10-28 21:45:08] (step=0085900) Train Loss: 0.4320, Train Steps/Sec: 1.15
977
+ [2025-10-28 21:46:36] (step=0086000) Train Loss: 0.4313, Train Steps/Sec: 1.15
978
+ [2025-10-28 21:48:03] (step=0086100) Train Loss: 0.4315, Train Steps/Sec: 1.15
979
+ [2025-10-28 21:49:30] (step=0086200) Train Loss: 0.4319, Train Steps/Sec: 1.14
980
+ [2025-10-28 21:50:58] (step=0086300) Train Loss: 0.4318, Train Steps/Sec: 1.14
981
+ [2025-10-28 21:51:15] Beginning epoch 69...
982
+ [2025-10-28 21:52:28] (step=0086400) Train Loss: 0.4324, Train Steps/Sec: 1.11
983
+ [2025-10-28 21:53:55] (step=0086500) Train Loss: 0.4320, Train Steps/Sec: 1.15
984
+ [2025-10-28 21:55:22] (step=0086600) Train Loss: 0.4310, Train Steps/Sec: 1.15
985
+ [2025-10-28 21:56:50] (step=0086700) Train Loss: 0.4315, Train Steps/Sec: 1.15
986
+ [2025-10-28 21:58:17] (step=0086800) Train Loss: 0.4319, Train Steps/Sec: 1.15
987
+ [2025-10-28 21:59:44] (step=0086900) Train Loss: 0.4309, Train Steps/Sec: 1.15
988
+ [2025-10-28 22:01:12] (step=0087000) Train Loss: 0.4314, Train Steps/Sec: 1.14
989
+ [2025-10-28 22:02:39] (step=0087100) Train Loss: 0.4322, Train Steps/Sec: 1.14
990
+ [2025-10-28 22:04:07] (step=0087200) Train Loss: 0.4316, Train Steps/Sec: 1.15
991
+ [2025-10-28 22:05:34] (step=0087300) Train Loss: 0.4329, Train Steps/Sec: 1.15
992
+ [2025-10-28 22:07:01] (step=0087400) Train Loss: 0.4315, Train Steps/Sec: 1.15
993
+ [2025-10-28 22:08:28] (step=0087500) Train Loss: 0.4313, Train Steps/Sec: 1.15
994
+ [2025-10-28 22:09:30] Beginning epoch 70...
995
+ [2025-10-28 22:09:59] (step=0087600) Train Loss: 0.4316, Train Steps/Sec: 1.11
996
+ [2025-10-28 22:11:26] (step=0087700) Train Loss: 0.4310, Train Steps/Sec: 1.15
997
+ [2025-10-28 22:12:53] (step=0087800) Train Loss: 0.4309, Train Steps/Sec: 1.15
998
+ [2025-10-28 22:14:21] (step=0087900) Train Loss: 0.4313, Train Steps/Sec: 1.14
999
+ [2025-10-28 22:15:48] (step=0088000) Train Loss: 0.4314, Train Steps/Sec: 1.14
1000
+ [2025-10-28 22:17:16] (step=0088100) Train Loss: 0.4306, Train Steps/Sec: 1.15
1001
+ [2025-10-28 22:18:43] (step=0088200) Train Loss: 0.4321, Train Steps/Sec: 1.15
1002
+ [2025-10-28 22:20:10] (step=0088300) Train Loss: 0.4313, Train Steps/Sec: 1.15
1003
+ [2025-10-28 22:21:37] (step=0088400) Train Loss: 0.4324, Train Steps/Sec: 1.15
1004
+ [2025-10-28 22:23:05] (step=0088500) Train Loss: 0.4302, Train Steps/Sec: 1.15
1005
+ [2025-10-28 22:24:32] (step=0088600) Train Loss: 0.4308, Train Steps/Sec: 1.15
1006
+ [2025-10-28 22:25:59] (step=0088700) Train Loss: 0.4317, Train Steps/Sec: 1.15
1007
+ [2025-10-28 22:27:27] (step=0088800) Train Loss: 0.4317, Train Steps/Sec: 1.14
1008
+ [2025-10-28 22:27:45] Beginning epoch 71...
1009
+ [2025-10-28 22:28:57] (step=0088900) Train Loss: 0.4313, Train Steps/Sec: 1.11
1010
+ [2025-10-28 22:30:24] (step=0089000) Train Loss: 0.4303, Train Steps/Sec: 1.15
1011
+ [2025-10-28 22:31:51] (step=0089100) Train Loss: 0.4308, Train Steps/Sec: 1.15
1012
+ [2025-10-28 22:33:19] (step=0089200) Train Loss: 0.4299, Train Steps/Sec: 1.15
1013
+ [2025-10-28 22:34:46] (step=0089300) Train Loss: 0.4317, Train Steps/Sec: 1.15
1014
+ [2025-10-28 22:36:13] (step=0089400) Train Loss: 0.4320, Train Steps/Sec: 1.15
1015
+ [2025-10-28 22:37:40] (step=0089500) Train Loss: 0.4303, Train Steps/Sec: 1.15