xingjianleng commited on
Commit
21247ef
·
verified ·
1 Parent(s): 854683e

Upload folder using huggingface_hub

Browse files
stage2/lightningdit-xl-dinov3-vit-s16-bf16/log.txt ADDED
@@ -0,0 +1,357 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2025-10-29 05:36:56] Experiment directory created at results/stage2/hfdata/lightningdit-xl-dinov3-vit-s16-bf16
2
+ [2025-10-29 05:36:58] using base=100 for rope new
3
+ [2025-10-29 05:36:58] using min_period=None for rope new
4
+ [2025-10-29 05:36:58] using max_period=None for rope new
5
+ [2025-10-29 05:36:58] using normalize_coords=separate for rope new
6
+ [2025-10-29 05:36:58] using shift_coords=None for rope new
7
+ [2025-10-29 05:36:58] using rescale_coords=2 for rope new
8
+ [2025-10-29 05:36:58] using jitter_coords=None for rope new
9
+ [2025-10-29 05:36:58] using dtype=fp32 for rope new
10
+ [2025-10-29 05:36:58] using mlp layer as FFN
11
+ [2025-10-29 05:37:13] Model Parameters: 1200.86M
12
+ [2025-10-29 05:37:17] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/train)
13
+ [2025-10-29 05:37:17] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
14
+ [2025-10-29 05:37:17] Precision mode: bf16
15
+ [2025-10-29 05:37:17] Training configured for 80 epochs, 1251 steps per epoch.
16
+ [2025-10-29 05:37:17] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
17
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
18
+ [2025-10-29 05:37:17] Training for 80 epochs...
19
+ [2025-10-29 05:37:17] Beginning epoch 0...
20
+ [2025-10-29 05:37:22] Generating EMA samples...
21
+ [2025-10-29 05:37:50] Generating EMA samples done.
22
+ [2025-10-29 05:39:07] (step=0000100) Train Loss: 1.5261, Train Steps/Sec: 0.91
23
+ [2025-10-29 05:40:26] (step=0000200) Train Loss: 0.8134, Train Steps/Sec: 1.27
24
+ [2025-10-29 05:41:46] (step=0000300) Train Loss: 0.6489, Train Steps/Sec: 1.27
25
+ [2025-10-29 05:43:05] (step=0000400) Train Loss: 0.5884, Train Steps/Sec: 1.27
26
+ [2025-10-29 05:44:24] (step=0000500) Train Loss: 0.5466, Train Steps/Sec: 1.27
27
+ [2025-10-29 05:45:43] (step=0000600) Train Loss: 0.5207, Train Steps/Sec: 1.27
28
+ [2025-10-29 05:47:02] (step=0000700) Train Loss: 0.5021, Train Steps/Sec: 1.27
29
+ [2025-10-29 05:48:21] (step=0000800) Train Loss: 0.4870, Train Steps/Sec: 1.27
30
+ [2025-10-29 05:49:40] (step=0000900) Train Loss: 0.4769, Train Steps/Sec: 1.26
31
+ [2025-10-29 05:50:59] (step=0001000) Train Loss: 0.4682, Train Steps/Sec: 1.26
32
+ [2025-10-29 05:52:18] (step=0001100) Train Loss: 0.4592, Train Steps/Sec: 1.27
33
+ [2025-10-29 05:53:37] (step=0001200) Train Loss: 0.4531, Train Steps/Sec: 1.27
34
+ [2025-10-29 05:54:18] Beginning epoch 1...
35
+ [2025-10-29 05:54:59] (step=0001300) Train Loss: 0.4488, Train Steps/Sec: 1.22
36
+ [2025-10-29 05:56:18] (step=0001400) Train Loss: 0.4414, Train Steps/Sec: 1.27
37
+ [2025-10-29 05:57:37] (step=0001500) Train Loss: 0.4381, Train Steps/Sec: 1.27
38
+ [2025-10-29 05:58:56] (step=0001600) Train Loss: 0.4339, Train Steps/Sec: 1.27
39
+ [2025-10-29 06:00:15] (step=0001700) Train Loss: 0.4297, Train Steps/Sec: 1.27
40
+ [2025-10-29 06:01:34] (step=0001800) Train Loss: 0.4283, Train Steps/Sec: 1.27
41
+ [2025-10-29 06:02:53] (step=0001900) Train Loss: 0.4250, Train Steps/Sec: 1.27
42
+ [2025-10-29 06:04:12] (step=0002000) Train Loss: 0.4210, Train Steps/Sec: 1.27
43
+ [2025-10-29 06:05:31] (step=0002100) Train Loss: 0.4205, Train Steps/Sec: 1.26
44
+ [2025-10-29 06:06:50] (step=0002200) Train Loss: 0.4180, Train Steps/Sec: 1.27
45
+ [2025-10-29 06:08:10] (step=0002300) Train Loss: 0.4139, Train Steps/Sec: 1.26
46
+ [2025-10-29 06:09:29] (step=0002400) Train Loss: 0.4129, Train Steps/Sec: 1.26
47
+ [2025-10-29 06:10:48] (step=0002500) Train Loss: 0.4119, Train Steps/Sec: 1.27
48
+ [2025-10-29 06:10:50] Beginning epoch 2...
49
+ [2025-10-29 06:12:10] (step=0002600) Train Loss: 0.4087, Train Steps/Sec: 1.21
50
+ [2025-10-29 06:13:29] (step=0002700) Train Loss: 0.4066, Train Steps/Sec: 1.26
51
+ [2025-10-29 06:14:48] (step=0002800) Train Loss: 0.4051, Train Steps/Sec: 1.26
52
+ [2025-10-29 06:16:07] (step=0002900) Train Loss: 0.4051, Train Steps/Sec: 1.26
53
+ [2025-10-29 06:17:26] (step=0003000) Train Loss: 0.4023, Train Steps/Sec: 1.27
54
+ [2025-10-29 06:18:45] (step=0003100) Train Loss: 0.4009, Train Steps/Sec: 1.27
55
+ [2025-10-29 06:20:04] (step=0003200) Train Loss: 0.4005, Train Steps/Sec: 1.27
56
+ [2025-10-29 06:21:23] (step=0003300) Train Loss: 0.3994, Train Steps/Sec: 1.27
57
+ [2025-10-29 06:22:42] (step=0003400) Train Loss: 0.3972, Train Steps/Sec: 1.27
58
+ [2025-10-29 06:24:01] (step=0003500) Train Loss: 0.3970, Train Steps/Sec: 1.27
59
+ [2025-10-29 06:25:20] (step=0003600) Train Loss: 0.3948, Train Steps/Sec: 1.26
60
+ [2025-10-29 06:26:40] (step=0003700) Train Loss: 0.3932, Train Steps/Sec: 1.26
61
+ [2025-10-29 06:27:22] Beginning epoch 3...
62
+ [2025-10-29 06:28:01] (step=0003800) Train Loss: 0.3936, Train Steps/Sec: 1.22
63
+ [2025-10-29 06:29:21] (step=0003900) Train Loss: 0.3916, Train Steps/Sec: 1.26
64
+ [2025-10-29 06:30:40] (step=0004000) Train Loss: 0.3909, Train Steps/Sec: 1.27
65
+ [2025-10-29 06:31:59] (step=0004100) Train Loss: 0.3898, Train Steps/Sec: 1.27
66
+ [2025-10-29 06:33:18] (step=0004200) Train Loss: 0.3899, Train Steps/Sec: 1.27
67
+ [2025-10-29 06:34:38] (step=0004300) Train Loss: 0.3879, Train Steps/Sec: 1.25
68
+ [2025-10-29 06:35:57] (step=0004400) Train Loss: 0.3872, Train Steps/Sec: 1.26
69
+ [2025-10-29 06:37:16] (step=0004500) Train Loss: 0.3870, Train Steps/Sec: 1.26
70
+ [2025-10-29 06:38:35] (step=0004600) Train Loss: 0.3851, Train Steps/Sec: 1.26
71
+ [2025-10-29 06:39:54] (step=0004700) Train Loss: 0.3843, Train Steps/Sec: 1.26
72
+ [2025-10-29 06:41:13] (step=0004800) Train Loss: 0.3834, Train Steps/Sec: 1.27
73
+ [2025-10-29 06:42:32] (step=0004900) Train Loss: 0.3831, Train Steps/Sec: 1.26
74
+ [2025-10-29 06:43:51] (step=0005000) Train Loss: 0.3824, Train Steps/Sec: 1.27
75
+ [2025-10-29 06:43:55] Beginning epoch 4...
76
+ [2025-10-29 06:45:13] (step=0005100) Train Loss: 0.3830, Train Steps/Sec: 1.22
77
+ [2025-10-29 06:46:32] (step=0005200) Train Loss: 0.3811, Train Steps/Sec: 1.27
78
+ [2025-10-29 06:47:51] (step=0005300) Train Loss: 0.3813, Train Steps/Sec: 1.26
79
+ [2025-10-29 06:49:10] (step=0005400) Train Loss: 0.3807, Train Steps/Sec: 1.27
80
+ [2025-10-29 06:50:29] (step=0005500) Train Loss: 0.3799, Train Steps/Sec: 1.26
81
+ [2025-10-29 06:51:48] (step=0005600) Train Loss: 0.3792, Train Steps/Sec: 1.26
82
+ [2025-10-29 06:53:07] (step=0005700) Train Loss: 0.3780, Train Steps/Sec: 1.27
83
+ [2025-10-29 06:54:26] (step=0005800) Train Loss: 0.3787, Train Steps/Sec: 1.26
84
+ [2025-10-29 06:55:46] (step=0005900) Train Loss: 0.3774, Train Steps/Sec: 1.26
85
+ [2025-10-29 06:57:05] (step=0006000) Train Loss: 0.3771, Train Steps/Sec: 1.26
86
+ [2025-10-29 06:58:24] (step=0006100) Train Loss: 0.3760, Train Steps/Sec: 1.27
87
+ [2025-10-29 06:59:43] (step=0006200) Train Loss: 0.3753, Train Steps/Sec: 1.27
88
+ [2025-10-29 07:00:27] Beginning epoch 5...
89
+ [2025-10-29 07:01:05] (step=0006300) Train Loss: 0.3758, Train Steps/Sec: 1.22
90
+ [2025-10-29 07:02:24] (step=0006400) Train Loss: 0.3735, Train Steps/Sec: 1.27
91
+ [2025-10-29 07:03:43] (step=0006500) Train Loss: 0.3735, Train Steps/Sec: 1.26
92
+ [2025-10-29 07:05:02] (step=0006600) Train Loss: 0.3748, Train Steps/Sec: 1.27
93
+ [2025-10-29 07:06:21] (step=0006700) Train Loss: 0.3721, Train Steps/Sec: 1.27
94
+ [2025-10-29 07:07:41] (step=0006800) Train Loss: 0.3721, Train Steps/Sec: 1.27
95
+ [2025-10-29 07:09:00] (step=0006900) Train Loss: 0.3727, Train Steps/Sec: 1.27
96
+ [2025-10-29 07:10:19] (step=0007000) Train Loss: 0.3736, Train Steps/Sec: 1.26
97
+ [2025-10-29 07:11:38] (step=0007100) Train Loss: 0.3714, Train Steps/Sec: 1.27
98
+ [2025-10-29 07:12:57] (step=0007200) Train Loss: 0.3714, Train Steps/Sec: 1.26
99
+ [2025-10-29 07:14:16] (step=0007300) Train Loss: 0.3717, Train Steps/Sec: 1.26
100
+ [2025-10-29 07:15:35] (step=0007400) Train Loss: 0.3700, Train Steps/Sec: 1.26
101
+ [2025-10-29 07:16:54] (step=0007500) Train Loss: 0.3699, Train Steps/Sec: 1.26
102
+ [2025-10-29 07:16:59] Beginning epoch 6...
103
+ [2025-10-29 07:18:16] (step=0007600) Train Loss: 0.3703, Train Steps/Sec: 1.22
104
+ [2025-10-29 07:19:35] (step=0007700) Train Loss: 0.3705, Train Steps/Sec: 1.26
105
+ [2025-10-29 07:20:54] (step=0007800) Train Loss: 0.3697, Train Steps/Sec: 1.27
106
+ [2025-10-29 07:22:13] (step=0007900) Train Loss: 0.3684, Train Steps/Sec: 1.26
107
+ [2025-10-29 07:23:32] (step=0008000) Train Loss: 0.3688, Train Steps/Sec: 1.27
108
+ [2025-10-29 07:24:51] (step=0008100) Train Loss: 0.3672, Train Steps/Sec: 1.27
109
+ [2025-10-29 07:26:10] (step=0008200) Train Loss: 0.3674, Train Steps/Sec: 1.27
110
+ [2025-10-29 07:27:30] (step=0008300) Train Loss: 0.3659, Train Steps/Sec: 1.27
111
+ [2025-10-29 07:28:49] (step=0008400) Train Loss: 0.3665, Train Steps/Sec: 1.27
112
+ [2025-10-29 07:30:08] (step=0008500) Train Loss: 0.3671, Train Steps/Sec: 1.27
113
+ [2025-10-29 07:31:27] (step=0008600) Train Loss: 0.3655, Train Steps/Sec: 1.27
114
+ [2025-10-29 07:32:46] (step=0008700) Train Loss: 0.3662, Train Steps/Sec: 1.27
115
+ [2025-10-29 07:33:31] Beginning epoch 7...
116
+ [2025-10-29 07:34:07] (step=0008800) Train Loss: 0.3652, Train Steps/Sec: 1.23
117
+ [2025-10-29 07:35:26] (step=0008900) Train Loss: 0.3637, Train Steps/Sec: 1.27
118
+ [2025-10-29 07:36:45] (step=0009000) Train Loss: 0.3653, Train Steps/Sec: 1.27
119
+ [2025-10-29 07:38:04] (step=0009100) Train Loss: 0.3652, Train Steps/Sec: 1.27
120
+ [2025-10-29 07:39:23] (step=0009200) Train Loss: 0.3654, Train Steps/Sec: 1.27
121
+ [2025-10-29 07:40:43] (step=0009300) Train Loss: 0.3643, Train Steps/Sec: 1.25
122
+ [2025-10-29 07:42:02] (step=0009400) Train Loss: 0.3640, Train Steps/Sec: 1.26
123
+ [2025-10-29 07:43:21] (step=0009500) Train Loss: 0.3639, Train Steps/Sec: 1.27
124
+ [2025-10-29 07:44:40] (step=0009600) Train Loss: 0.3615, Train Steps/Sec: 1.26
125
+ [2025-10-29 07:45:59] (step=0009700) Train Loss: 0.3625, Train Steps/Sec: 1.26
126
+ [2025-10-29 07:47:18] (step=0009800) Train Loss: 0.3622, Train Steps/Sec: 1.27
127
+ [2025-10-29 07:48:38] (step=0009900) Train Loss: 0.3617, Train Steps/Sec: 1.27
128
+ [2025-10-29 07:49:57] (step=0010000) Train Loss: 0.3631, Train Steps/Sec: 1.27
129
+ [2025-10-29 07:50:03] Beginning epoch 8...
130
+ [2025-10-29 07:51:18] (step=0010100) Train Loss: 0.3631, Train Steps/Sec: 1.23
131
+ [2025-10-29 07:52:37] (step=0010200) Train Loss: 0.3622, Train Steps/Sec: 1.27
132
+ [2025-10-29 07:53:56] (step=0010300) Train Loss: 0.3620, Train Steps/Sec: 1.27
133
+ [2025-10-29 07:55:15] (step=0010400) Train Loss: 0.3606, Train Steps/Sec: 1.27
134
+ [2025-10-29 07:56:34] (step=0010500) Train Loss: 0.3616, Train Steps/Sec: 1.27
135
+ [2025-10-29 07:57:53] (step=0010600) Train Loss: 0.3610, Train Steps/Sec: 1.26
136
+ [2025-10-29 07:59:12] (step=0010700) Train Loss: 0.3610, Train Steps/Sec: 1.27
137
+ [2025-10-29 08:00:31] (step=0010800) Train Loss: 0.3613, Train Steps/Sec: 1.26
138
+ [2025-10-29 08:01:51] (step=0010900) Train Loss: 0.3599, Train Steps/Sec: 1.26
139
+ [2025-10-29 08:03:10] (step=0011000) Train Loss: 0.3606, Train Steps/Sec: 1.25
140
+ [2025-10-29 08:04:29] (step=0011100) Train Loss: 0.3602, Train Steps/Sec: 1.27
141
+ [2025-10-29 08:05:48] (step=0011200) Train Loss: 0.3608, Train Steps/Sec: 1.27
142
+ [2025-10-29 08:06:36] Beginning epoch 9...
143
+ [2025-10-29 08:07:10] (step=0011300) Train Loss: 0.3583, Train Steps/Sec: 1.23
144
+ [2025-10-29 08:08:29] (step=0011400) Train Loss: 0.3591, Train Steps/Sec: 1.27
145
+ [2025-10-29 08:09:48] (step=0011500) Train Loss: 0.3589, Train Steps/Sec: 1.27
146
+ [2025-10-29 08:11:07] (step=0011600) Train Loss: 0.3579, Train Steps/Sec: 1.27
147
+ [2025-10-29 08:12:26] (step=0011700) Train Loss: 0.3588, Train Steps/Sec: 1.27
148
+ [2025-10-29 08:13:45] (step=0011800) Train Loss: 0.3581, Train Steps/Sec: 1.27
149
+ [2025-10-29 08:15:04] (step=0011900) Train Loss: 0.3589, Train Steps/Sec: 1.27
150
+ [2025-10-29 08:16:23] (step=0012000) Train Loss: 0.3587, Train Steps/Sec: 1.27
151
+ [2025-10-29 08:17:42] (step=0012100) Train Loss: 0.3567, Train Steps/Sec: 1.27
152
+ [2025-10-29 08:19:01] (step=0012200) Train Loss: 0.3575, Train Steps/Sec: 1.26
153
+ [2025-10-29 08:20:20] (step=0012300) Train Loss: 0.3571, Train Steps/Sec: 1.27
154
+ [2025-10-29 08:21:39] (step=0012400) Train Loss: 0.3566, Train Steps/Sec: 1.27
155
+ [2025-10-29 08:22:58] (step=0012500) Train Loss: 0.3559, Train Steps/Sec: 1.27
156
+ [2025-10-29 08:23:07] Beginning epoch 10...
157
+ [2025-10-29 08:24:20] (step=0012600) Train Loss: 0.3557, Train Steps/Sec: 1.23
158
+ [2025-10-29 08:25:40] (step=0012700) Train Loss: 0.3553, Train Steps/Sec: 1.26
159
+ [2025-10-29 08:26:59] (step=0012800) Train Loss: 0.3564, Train Steps/Sec: 1.27
160
+ [2025-10-29 08:28:18] (step=0012900) Train Loss: 0.3562, Train Steps/Sec: 1.27
161
+ [2025-10-29 08:29:37] (step=0013000) Train Loss: 0.3553, Train Steps/Sec: 1.26
162
+ [2025-10-29 08:30:56] (step=0013100) Train Loss: 0.3547, Train Steps/Sec: 1.26
163
+ [2025-10-29 08:32:15] (step=0013200) Train Loss: 0.3549, Train Steps/Sec: 1.27
164
+ [2025-10-29 08:33:34] (step=0013300) Train Loss: 0.3551, Train Steps/Sec: 1.27
165
+ [2025-10-29 08:34:53] (step=0013400) Train Loss: 0.3537, Train Steps/Sec: 1.27
166
+ [2025-10-29 08:36:12] (step=0013500) Train Loss: 0.3561, Train Steps/Sec: 1.27
167
+ [2025-10-29 08:37:31] (step=0013600) Train Loss: 0.3551, Train Steps/Sec: 1.26
168
+ [2025-10-29 08:38:50] (step=0013700) Train Loss: 0.3546, Train Steps/Sec: 1.27
169
+ [2025-10-29 08:39:39] Beginning epoch 11...
170
+ [2025-10-29 08:40:11] (step=0013800) Train Loss: 0.3541, Train Steps/Sec: 1.23
171
+ [2025-10-29 08:41:30] (step=0013900) Train Loss: 0.3529, Train Steps/Sec: 1.27
172
+ [2025-10-29 08:42:49] (step=0014000) Train Loss: 0.3540, Train Steps/Sec: 1.27
173
+ [2025-10-29 08:44:08] (step=0014100) Train Loss: 0.3509, Train Steps/Sec: 1.27
174
+ [2025-10-29 08:45:27] (step=0014200) Train Loss: 0.3527, Train Steps/Sec: 1.27
175
+ [2025-10-29 08:46:47] (step=0014300) Train Loss: 0.3525, Train Steps/Sec: 1.25
176
+ [2025-10-29 08:48:06] (step=0014400) Train Loss: 0.3544, Train Steps/Sec: 1.26
177
+ [2025-10-29 08:49:26] (step=0014500) Train Loss: 0.3530, Train Steps/Sec: 1.26
178
+ [2025-10-29 08:50:45] (step=0014600) Train Loss: 0.3531, Train Steps/Sec: 1.27
179
+ [2025-10-29 08:52:04] (step=0014700) Train Loss: 0.3533, Train Steps/Sec: 1.27
180
+ [2025-10-29 08:53:23] (step=0014800) Train Loss: 0.3524, Train Steps/Sec: 1.27
181
+ [2025-10-29 08:54:42] (step=0014900) Train Loss: 0.3524, Train Steps/Sec: 1.27
182
+ [2025-10-29 08:56:01] (step=0015000) Train Loss: 0.3540, Train Steps/Sec: 1.27
183
+ [2025-10-29 08:56:11] Beginning epoch 12...
184
+ [2025-10-29 08:57:22] (step=0015100) Train Loss: 0.3517, Train Steps/Sec: 1.23
185
+ [2025-10-29 08:58:41] (step=0015200) Train Loss: 0.3506, Train Steps/Sec: 1.27
186
+ [2025-10-29 09:00:00] (step=0015300) Train Loss: 0.3528, Train Steps/Sec: 1.27
187
+ [2025-10-29 09:01:19] (step=0015400) Train Loss: 0.3516, Train Steps/Sec: 1.27
188
+ [2025-10-29 09:02:38] (step=0015500) Train Loss: 0.3524, Train Steps/Sec: 1.27
189
+ [2025-10-29 09:03:57] (step=0015600) Train Loss: 0.3522, Train Steps/Sec: 1.27
190
+ [2025-10-29 09:05:16] (step=0015700) Train Loss: 0.3526, Train Steps/Sec: 1.27
191
+ [2025-10-29 09:06:35] (step=0015800) Train Loss: 0.3503, Train Steps/Sec: 1.27
192
+ [2025-10-29 09:07:54] (step=0015900) Train Loss: 0.3501, Train Steps/Sec: 1.27
193
+ [2025-10-29 09:09:14] (step=0016000) Train Loss: 0.3513, Train Steps/Sec: 1.26
194
+ [2025-10-29 09:10:33] (step=0016100) Train Loss: 0.3509, Train Steps/Sec: 1.26
195
+ [2025-10-29 09:11:52] (step=0016200) Train Loss: 0.3505, Train Steps/Sec: 1.27
196
+ [2025-10-29 09:12:42] Beginning epoch 13...
197
+ [2025-10-29 09:13:13] (step=0016300) Train Loss: 0.3503, Train Steps/Sec: 1.23
198
+ [2025-10-29 09:14:32] (step=0016400) Train Loss: 0.3512, Train Steps/Sec: 1.27
199
+ [2025-10-29 09:15:51] (step=0016500) Train Loss: 0.3494, Train Steps/Sec: 1.26
200
+ [2025-10-29 09:17:10] (step=0016600) Train Loss: 0.3494, Train Steps/Sec: 1.27
201
+ [2025-10-29 09:18:29] (step=0016700) Train Loss: 0.3498, Train Steps/Sec: 1.27
202
+ [2025-10-29 09:19:48] (step=0016800) Train Loss: 0.3497, Train Steps/Sec: 1.27
203
+ [2025-10-29 09:21:07] (step=0016900) Train Loss: 0.3496, Train Steps/Sec: 1.27
204
+ [2025-10-29 09:22:26] (step=0017000) Train Loss: 0.3498, Train Steps/Sec: 1.27
205
+ [2025-10-29 09:23:45] (step=0017100) Train Loss: 0.3504, Train Steps/Sec: 1.27
206
+ [2025-10-29 09:25:04] (step=0017200) Train Loss: 0.3497, Train Steps/Sec: 1.27
207
+ [2025-10-29 09:26:23] (step=0017300) Train Loss: 0.3496, Train Steps/Sec: 1.27
208
+ [2025-10-29 09:27:42] (step=0017400) Train Loss: 0.3477, Train Steps/Sec: 1.27
209
+ [2025-10-29 09:29:01] (step=0017500) Train Loss: 0.3491, Train Steps/Sec: 1.27
210
+ [2025-10-29 09:29:13] Beginning epoch 14...
211
+ [2025-10-29 09:30:23] (step=0017600) Train Loss: 0.3483, Train Steps/Sec: 1.23
212
+ [2025-10-29 09:31:42] (step=0017700) Train Loss: 0.3479, Train Steps/Sec: 1.26
213
+ [2025-10-29 09:33:02] (step=0017800) Train Loss: 0.3479, Train Steps/Sec: 1.26
214
+ [2025-10-29 09:34:21] (step=0017900) Train Loss: 0.3481, Train Steps/Sec: 1.26
215
+ [2025-10-29 09:35:40] (step=0018000) Train Loss: 0.3490, Train Steps/Sec: 1.27
216
+ [2025-10-29 09:36:59] (step=0018100) Train Loss: 0.3487, Train Steps/Sec: 1.27
217
+ [2025-10-29 09:38:18] (step=0018200) Train Loss: 0.3477, Train Steps/Sec: 1.27
218
+ [2025-10-29 09:39:37] (step=0018300) Train Loss: 0.3478, Train Steps/Sec: 1.27
219
+ [2025-10-29 09:40:56] (step=0018400) Train Loss: 0.3485, Train Steps/Sec: 1.27
220
+ [2025-10-29 09:42:15] (step=0018500) Train Loss: 0.3477, Train Steps/Sec: 1.27
221
+ [2025-10-29 09:43:34] (step=0018600) Train Loss: 0.3486, Train Steps/Sec: 1.27
222
+ [2025-10-29 09:44:53] (step=0018700) Train Loss: 0.3481, Train Steps/Sec: 1.27
223
+ [2025-10-29 09:45:45] Beginning epoch 15...
224
+ [2025-10-29 09:46:14] (step=0018800) Train Loss: 0.3473, Train Steps/Sec: 1.23
225
+ [2025-10-29 09:47:33] (step=0018900) Train Loss: 0.3456, Train Steps/Sec: 1.27
226
+ [2025-10-29 09:48:52] (step=0019000) Train Loss: 0.3469, Train Steps/Sec: 1.27
227
+ [2025-10-29 09:50:11] (step=0019100) Train Loss: 0.3477, Train Steps/Sec: 1.27
228
+ [2025-10-29 09:51:30] (step=0019200) Train Loss: 0.3458, Train Steps/Sec: 1.27
229
+ [2025-10-29 09:52:49] (step=0019300) Train Loss: 0.3454, Train Steps/Sec: 1.26
230
+ [2025-10-29 09:54:09] (step=0019400) Train Loss: 0.3454, Train Steps/Sec: 1.26
231
+ [2025-10-29 09:55:28] (step=0019500) Train Loss: 0.3473, Train Steps/Sec: 1.26
232
+ [2025-10-29 09:56:47] (step=0019600) Train Loss: 0.3476, Train Steps/Sec: 1.27
233
+ [2025-10-29 09:58:06] (step=0019700) Train Loss: 0.3462, Train Steps/Sec: 1.27
234
+ [2025-10-29 09:59:25] (step=0019800) Train Loss: 0.3459, Train Steps/Sec: 1.27
235
+ [2025-10-29 10:00:44] (step=0019900) Train Loss: 0.3452, Train Steps/Sec: 1.27
236
+ [2025-10-29 10:02:03] (step=0020000) Train Loss: 0.3459, Train Steps/Sec: 1.27
237
+ [2025-10-29 10:02:16] Beginning epoch 16...
238
+ [2025-10-29 10:03:25] (step=0020100) Train Loss: 0.3461, Train Steps/Sec: 1.22
239
+ [2025-10-29 10:04:44] (step=0020200) Train Loss: 0.3453, Train Steps/Sec: 1.27
240
+ [2025-10-29 10:06:03] (step=0020300) Train Loss: 0.3463, Train Steps/Sec: 1.27
241
+ [2025-10-29 10:07:22] (step=0020400) Train Loss: 0.3453, Train Steps/Sec: 1.27
242
+ [2025-10-29 10:08:41] (step=0020500) Train Loss: 0.3462, Train Steps/Sec: 1.27
243
+ [2025-10-29 10:10:00] (step=0020600) Train Loss: 0.3454, Train Steps/Sec: 1.27
244
+ [2025-10-29 10:11:19] (step=0020700) Train Loss: 0.3461, Train Steps/Sec: 1.26
245
+ [2025-10-29 10:12:38] (step=0020800) Train Loss: 0.3454, Train Steps/Sec: 1.27
246
+ [2025-10-29 10:13:57] (step=0020900) Train Loss: 0.3440, Train Steps/Sec: 1.27
247
+ [2025-10-29 10:15:16] (step=0021000) Train Loss: 0.3442, Train Steps/Sec: 1.26
248
+ [2025-10-29 10:16:36] (step=0021100) Train Loss: 0.3452, Train Steps/Sec: 1.26
249
+ [2025-10-29 10:17:55] (step=0021200) Train Loss: 0.3452, Train Steps/Sec: 1.26
250
+ [2025-10-29 10:18:48] Beginning epoch 17...
251
+ [2025-10-29 10:19:17] (step=0021300) Train Loss: 0.3449, Train Steps/Sec: 1.22
252
+ [2025-10-29 10:20:36] (step=0021400) Train Loss: 0.3453, Train Steps/Sec: 1.27
253
+ [2025-10-29 10:21:55] (step=0021500) Train Loss: 0.3449, Train Steps/Sec: 1.27
254
+ [2025-10-29 10:23:14] (step=0021600) Train Loss: 0.3435, Train Steps/Sec: 1.27
255
+ [2025-10-29 10:24:33] (step=0021700) Train Loss: 0.3436, Train Steps/Sec: 1.27
256
+ [2025-10-29 10:25:52] (step=0021800) Train Loss: 0.3438, Train Steps/Sec: 1.27
257
+ [2025-10-29 10:27:11] (step=0021900) Train Loss: 0.3432, Train Steps/Sec: 1.27
258
+ [2025-10-29 10:28:30] (step=0022000) Train Loss: 0.3434, Train Steps/Sec: 1.27
259
+ [2025-10-29 10:29:49] (step=0022100) Train Loss: 0.3441, Train Steps/Sec: 1.27
260
+ [2025-10-29 10:31:08] (step=0022200) Train Loss: 0.3434, Train Steps/Sec: 1.26
261
+ [2025-10-29 10:32:27] (step=0022300) Train Loss: 0.3428, Train Steps/Sec: 1.27
262
+ [2025-10-29 10:33:46] (step=0022400) Train Loss: 0.3430, Train Steps/Sec: 1.27
263
+ [2025-10-29 10:35:05] (step=0022500) Train Loss: 0.3439, Train Steps/Sec: 1.27
264
+ [2025-10-29 10:35:19] Beginning epoch 18...
265
+ [2025-10-29 10:36:26] (step=0022600) Train Loss: 0.3429, Train Steps/Sec: 1.23
266
+ [2025-10-29 10:37:46] (step=0022700) Train Loss: 0.3462, Train Steps/Sec: 1.26
267
+ [2025-10-29 10:39:05] (step=0022800) Train Loss: 0.3756, Train Steps/Sec: 1.26
268
+ [2025-10-29 10:40:24] (step=0022900) Train Loss: 0.9067, Train Steps/Sec: 1.27
269
+ [2025-10-29 10:41:43] (step=0023000) Train Loss: 0.6716, Train Steps/Sec: 1.27
270
+ [2025-10-29 10:43:02] (step=0023100) Train Loss: 0.7666, Train Steps/Sec: 1.26
271
+ [2025-10-29 10:44:21] (step=0023200) Train Loss: 0.7574, Train Steps/Sec: 1.26
272
+ [2025-10-29 10:45:40] (step=0023300) Train Loss: 0.9820, Train Steps/Sec: 1.26
273
+ [2025-10-29 10:46:59] (step=0023400) Train Loss: 0.6974, Train Steps/Sec: 1.26
274
+ [2025-10-29 10:48:18] (step=0023500) Train Loss: 0.7070, Train Steps/Sec: 1.26
275
+ [2025-10-29 10:49:38] (step=0023600) Train Loss: 0.8931, Train Steps/Sec: 1.26
276
+ [2025-10-29 10:50:57] (step=0023700) Train Loss: 0.5795, Train Steps/Sec: 1.26
277
+ [2025-10-29 10:51:50] Beginning epoch 19...
278
+ [2025-10-29 10:52:16] (step=0023800) Train Loss: nan, Train Steps/Sec: 1.27
279
+ [2025-10-29 10:53:32] (step=0023900) Train Loss: nan, Train Steps/Sec: 1.31
280
+ [2025-10-29 10:54:48] (step=0024000) Train Loss: nan, Train Steps/Sec: 1.31
281
+ [2025-10-29 10:56:04] (step=0024100) Train Loss: nan, Train Steps/Sec: 1.31
282
+ [2025-10-29 10:57:21] (step=0024200) Train Loss: nan, Train Steps/Sec: 1.31
283
+ [2025-10-29 10:58:37] (step=0024300) Train Loss: nan, Train Steps/Sec: 1.31
284
+ [2025-10-29 10:59:54] (step=0024400) Train Loss: nan, Train Steps/Sec: 1.30
285
+ [2025-10-29 11:01:11] (step=0024500) Train Loss: nan, Train Steps/Sec: 1.31
286
+ [2025-10-29 11:02:27] (step=0024600) Train Loss: nan, Train Steps/Sec: 1.31
287
+ [2025-10-29 11:03:43] (step=0024700) Train Loss: nan, Train Steps/Sec: 1.31
288
+ [2025-10-29 11:05:00] (step=0024800) Train Loss: nan, Train Steps/Sec: 1.31
289
+ [2025-10-29 11:06:16] (step=0024900) Train Loss: nan, Train Steps/Sec: 1.31
290
+ [2025-10-29 11:07:32] (step=0025000) Train Loss: nan, Train Steps/Sec: 1.31
291
+ [2025-10-29 11:08:29] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-dinov3-vit-s16-bf16/checkpoints/0025000.pt
292
+ [2025-10-29 11:08:29] Generating EMA samples...
293
+ [2025-10-29 11:08:42] Generating EMA samples done.
294
+ [2025-10-29 11:08:58] Beginning epoch 20...
295
+ [2025-10-29 11:10:01] (step=0025100) Train Loss: nan, Train Steps/Sec: 0.67
296
+ [2025-10-29 11:11:18] (step=0025200) Train Loss: nan, Train Steps/Sec: 1.31
297
+ [2025-10-29 11:12:34] (step=0025300) Train Loss: nan, Train Steps/Sec: 1.31
298
+ [2025-10-29 11:13:50] (step=0025400) Train Loss: nan, Train Steps/Sec: 1.31
299
+ [2025-10-29 11:15:07] (step=0025500) Train Loss: nan, Train Steps/Sec: 1.31
300
+ [2025-10-29 11:16:23] (step=0025600) Train Loss: nan, Train Steps/Sec: 1.31
301
+ [2025-10-29 11:17:40] (step=0025700) Train Loss: nan, Train Steps/Sec: 1.31
302
+ [2025-10-29 11:18:56] (step=0025800) Train Loss: nan, Train Steps/Sec: 1.31
303
+ [2025-10-29 11:20:12] (step=0025900) Train Loss: nan, Train Steps/Sec: 1.31
304
+ [2025-10-29 11:21:29] (step=0026000) Train Loss: nan, Train Steps/Sec: 1.30
305
+ [2025-10-29 11:22:46] (step=0026100) Train Loss: nan, Train Steps/Sec: 1.30
306
+ [2025-10-29 11:24:02] (step=0026200) Train Loss: nan, Train Steps/Sec: 1.31
307
+ [2025-10-29 11:24:57] Beginning epoch 21...
308
+ [2025-10-29 11:25:21] (step=0026300) Train Loss: nan, Train Steps/Sec: 1.27
309
+ [2025-10-29 11:26:38] (step=0026400) Train Loss: nan, Train Steps/Sec: 1.31
310
+ [2025-10-29 11:27:54] (step=0026500) Train Loss: nan, Train Steps/Sec: 1.31
311
+ [2025-10-29 11:29:10] (step=0026600) Train Loss: nan, Train Steps/Sec: 1.31
312
+ [2025-10-29 11:30:27] (step=0026700) Train Loss: nan, Train Steps/Sec: 1.31
313
+ [2025-10-29 11:31:43] (step=0026800) Train Loss: nan, Train Steps/Sec: 1.31
314
+ [2025-10-29 11:33:00] (step=0026900) Train Loss: nan, Train Steps/Sec: 1.31
315
+ [2025-10-29 11:34:16] (step=0027000) Train Loss: nan, Train Steps/Sec: 1.31
316
+ [2025-10-29 11:35:32] (step=0027100) Train Loss: nan, Train Steps/Sec: 1.31
317
+ [2025-10-29 11:36:49] (step=0027200) Train Loss: nan, Train Steps/Sec: 1.31
318
+ [2025-10-29 11:38:05] (step=0027300) Train Loss: nan, Train Steps/Sec: 1.31
319
+ [2025-10-29 11:39:21] (step=0027400) Train Loss: nan, Train Steps/Sec: 1.31
320
+ [2025-10-29 11:40:38] (step=0027500) Train Loss: nan, Train Steps/Sec: 1.31
321
+ [2025-10-29 11:40:55] Beginning epoch 22...
322
+ [2025-10-29 11:41:57] (step=0027600) Train Loss: nan, Train Steps/Sec: 1.26
323
+ [2025-10-29 11:43:14] (step=0027700) Train Loss: nan, Train Steps/Sec: 1.30
324
+ [2025-10-29 11:44:31] (step=0027800) Train Loss: nan, Train Steps/Sec: 1.30
325
+ [2025-10-29 11:45:47] (step=0027900) Train Loss: nan, Train Steps/Sec: 1.31
326
+ [2025-10-29 11:47:03] (step=0028000) Train Loss: nan, Train Steps/Sec: 1.31
327
+ [2025-10-29 11:48:20] (step=0028100) Train Loss: nan, Train Steps/Sec: 1.31
328
+ [2025-10-29 11:49:36] (step=0028200) Train Loss: nan, Train Steps/Sec: 1.31
329
+ [2025-10-29 11:50:53] (step=0028300) Train Loss: nan, Train Steps/Sec: 1.31
330
+ [2025-10-29 11:52:09] (step=0028400) Train Loss: nan, Train Steps/Sec: 1.31
331
+ [2025-10-29 11:53:25] (step=0028500) Train Loss: nan, Train Steps/Sec: 1.31
332
+ [2025-10-29 11:54:42] (step=0028600) Train Loss: nan, Train Steps/Sec: 1.31
333
+ [2025-10-29 11:55:58] (step=0028700) Train Loss: nan, Train Steps/Sec: 1.31
334
+ [2025-10-29 11:56:54] Beginning epoch 23...
335
+ [2025-10-29 11:57:17] (step=0028800) Train Loss: nan, Train Steps/Sec: 1.27
336
+ [2025-10-29 11:58:33] (step=0028900) Train Loss: nan, Train Steps/Sec: 1.31
337
+ [2025-10-29 11:59:50] (step=0029000) Train Loss: nan, Train Steps/Sec: 1.31
338
+ [2025-10-29 12:01:06] (step=0029100) Train Loss: nan, Train Steps/Sec: 1.31
339
+ [2025-10-29 12:02:22] (step=0029200) Train Loss: nan, Train Steps/Sec: 1.31
340
+ [2025-10-29 12:03:39] (step=0029300) Train Loss: nan, Train Steps/Sec: 1.31
341
+ [2025-10-29 12:04:56] (step=0029400) Train Loss: nan, Train Steps/Sec: 1.30
342
+ [2025-10-29 12:06:13] (step=0029500) Train Loss: nan, Train Steps/Sec: 1.30
343
+ [2025-10-29 12:07:29] (step=0029600) Train Loss: nan, Train Steps/Sec: 1.31
344
+ [2025-10-29 12:08:45] (step=0029700) Train Loss: nan, Train Steps/Sec: 1.31
345
+ [2025-10-29 12:10:02] (step=0029800) Train Loss: nan, Train Steps/Sec: 1.31
346
+ [2025-10-29 12:11:19] (step=0029900) Train Loss: nan, Train Steps/Sec: 1.31
347
+ [2025-10-29 12:12:35] (step=0030000) Train Loss: nan, Train Steps/Sec: 1.31
348
+ [2025-10-29 12:12:54] Beginning epoch 24...
349
+ [2025-10-29 12:13:54] (step=0030100) Train Loss: nan, Train Steps/Sec: 1.27
350
+ [2025-10-29 12:15:10] (step=0030200) Train Loss: nan, Train Steps/Sec: 1.31
351
+ [2025-10-29 12:16:26] (step=0030300) Train Loss: nan, Train Steps/Sec: 1.31
352
+ [2025-10-29 12:17:43] (step=0030400) Train Loss: nan, Train Steps/Sec: 1.31
353
+ [2025-10-29 12:18:59] (step=0030500) Train Loss: nan, Train Steps/Sec: 1.31
354
+ [2025-10-29 12:20:16] (step=0030600) Train Loss: nan, Train Steps/Sec: 1.31
355
+ [2025-10-29 12:21:32] (step=0030700) Train Loss: nan, Train Steps/Sec: 1.31
356
+ [2025-10-29 12:22:48] (step=0030800) Train Loss: nan, Train Steps/Sec: 1.31
357
+ [2025-10-29 12:24:05] (step=0030900) Train Loss: nan, Train Steps/Sec: 1.31