xingjianleng commited on
Commit
e7935a1
·
verified ·
1 Parent(s): 6a0ae0d

Upload folder using huggingface_hub

Browse files
stage2/lightningdit-xl-pe-vit-g-bf16/checkpoints/0025000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:abe4d50e7a4f319a753f1774c7b806f3cb30842c882ac7f44f5f9c48bbee394e
3
+ size 19268192626
stage2/lightningdit-xl-pe-vit-g-bf16/checkpoints/0050000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e2eb438b33107845bc020061c54e86e512b54a3846befa13f196c6090e65652
3
+ size 19268192626
stage2/lightningdit-xl-pe-vit-g-bf16/checkpoints/0075000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a67f5879c02cbd15bdf345700dbc74a82d88f504bce130415f2eade86639436
3
+ size 19268192626
stage2/lightningdit-xl-pe-vit-g-bf16/log.txt ADDED
@@ -0,0 +1,990 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2025-10-28 00:22:14] Experiment directory created at results/stage2/hfdata/lightningdit-xl-pe-vit-g-bf16
2
+ [2025-10-28 00:22:47] Missing keys for loading vision encoder: []
3
+ [2025-10-28 00:22:47] Unexpected keys for loading vision encoder: []
4
+ [2025-10-28 00:23:04] Model Parameters: 1204.40M
5
+ [2025-10-28 00:23:12] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/train)
6
+ [2025-10-28 00:23:12] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
7
+ [2025-10-28 00:23:12] Precision mode: bf16
8
+ [2025-10-28 00:23:12] Training configured for 80 epochs, 1251 steps per epoch.
9
+ [2025-10-28 00:23:12] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
10
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
11
+ [2025-10-28 00:23:12] Training for 80 epochs...
12
+ [2025-10-28 00:23:12] Beginning epoch 0...
13
+ [2025-10-28 00:54:00] Experiment directory created at results/stage2/hfdata/lightningdit-xl-pe-vit-g-bf16
14
+ [2025-10-28 00:54:32] Missing keys for loading vision encoder: []
15
+ [2025-10-28 00:54:32] Unexpected keys for loading vision encoder: []
16
+ [2025-10-28 00:54:49] Model Parameters: 1204.40M
17
+ [2025-10-28 00:54:54] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/pe-vit-g_hfdataset_precentercrop_True_train_bfloat16)
18
+ [2025-10-28 00:54:54] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
19
+ [2025-10-28 00:54:54] Precision mode: bf16
20
+ [2025-10-28 00:54:54] Training configured for 80 epochs, 1251 steps per epoch.
21
+ [2025-10-28 00:54:54] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
22
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
23
+ [2025-10-28 00:54:54] Training for 80 epochs...
24
+ [2025-10-28 00:54:54] Beginning epoch 0...
25
+ [2025-10-28 00:55:04] Generating EMA samples...
26
+ [2025-10-28 00:55:34] Generating EMA samples done.
27
+ [2025-10-28 00:57:01] (step=0000100) Train Loss: 1.7139, Train Steps/Sec: 0.79
28
+ [2025-10-28 00:58:28] (step=0000200) Train Loss: 1.2748, Train Steps/Sec: 1.14
29
+ [2025-10-28 00:59:56] (step=0000300) Train Loss: 1.1312, Train Steps/Sec: 1.14
30
+ [2025-10-28 01:01:24] (step=0000400) Train Loss: 1.0568, Train Steps/Sec: 1.14
31
+ [2025-10-28 01:02:52] (step=0000500) Train Loss: 1.0123, Train Steps/Sec: 1.14
32
+ [2025-10-28 01:04:19] (step=0000600) Train Loss: 0.9803, Train Steps/Sec: 1.14
33
+ [2025-10-28 01:05:47] (step=0000700) Train Loss: 0.9564, Train Steps/Sec: 1.14
34
+ [2025-10-28 01:07:15] (step=0000800) Train Loss: 0.9386, Train Steps/Sec: 1.14
35
+ [2025-10-28 01:08:43] (step=0000900) Train Loss: 0.9206, Train Steps/Sec: 1.14
36
+ [2025-10-28 01:10:11] (step=0001000) Train Loss: 0.9097, Train Steps/Sec: 1.14
37
+ [2025-10-28 01:11:38] (step=0001100) Train Loss: 0.8984, Train Steps/Sec: 1.14
38
+ [2025-10-28 01:13:06] (step=0001200) Train Loss: 0.8896, Train Steps/Sec: 1.14
39
+ [2025-10-28 01:13:52] Beginning epoch 1...
40
+ [2025-10-28 01:14:38] (step=0001300) Train Loss: 0.8825, Train Steps/Sec: 1.09
41
+ [2025-10-28 01:16:06] (step=0001400) Train Loss: 0.8756, Train Steps/Sec: 1.14
42
+ [2025-10-28 01:17:33] (step=0001500) Train Loss: 0.8713, Train Steps/Sec: 1.14
43
+ [2025-10-28 01:19:01] (step=0001600) Train Loss: 0.8637, Train Steps/Sec: 1.14
44
+ [2025-10-28 01:20:29] (step=0001700) Train Loss: 0.8599, Train Steps/Sec: 1.14
45
+ [2025-10-28 01:21:57] (step=0001800) Train Loss: 0.8552, Train Steps/Sec: 1.13
46
+ [2025-10-28 01:23:25] (step=0001900) Train Loss: 0.8508, Train Steps/Sec: 1.14
47
+ [2025-10-28 01:24:53] (step=0002000) Train Loss: 0.8476, Train Steps/Sec: 1.14
48
+ [2025-10-28 01:26:20] (step=0002100) Train Loss: 0.8453, Train Steps/Sec: 1.14
49
+ [2025-10-28 01:27:48] (step=0002200) Train Loss: 0.8421, Train Steps/Sec: 1.14
50
+ [2025-10-28 01:29:16] (step=0002300) Train Loss: 0.8391, Train Steps/Sec: 1.14
51
+ [2025-10-28 01:30:44] (step=0002400) Train Loss: 0.8365, Train Steps/Sec: 1.14
52
+ [2025-10-28 01:32:11] (step=0002500) Train Loss: 0.8330, Train Steps/Sec: 1.14
53
+ [2025-10-28 01:32:14] Beginning epoch 2...
54
+ [2025-10-28 01:33:43] (step=0002600) Train Loss: 0.8308, Train Steps/Sec: 1.09
55
+ [2025-10-28 01:35:11] (step=0002700) Train Loss: 0.8268, Train Steps/Sec: 1.14
56
+ [2025-10-28 01:36:39] (step=0002800) Train Loss: 0.8258, Train Steps/Sec: 1.14
57
+ [2025-10-28 01:38:07] (step=0002900) Train Loss: 0.8248, Train Steps/Sec: 1.14
58
+ [2025-10-28 01:39:34] (step=0003000) Train Loss: 0.8217, Train Steps/Sec: 1.14
59
+ [2025-10-28 01:41:02] (step=0003100) Train Loss: 0.8208, Train Steps/Sec: 1.14
60
+ [2025-10-28 01:42:30] (step=0003200) Train Loss: 0.8191, Train Steps/Sec: 1.14
61
+ [2025-10-28 01:43:58] (step=0003300) Train Loss: 0.8181, Train Steps/Sec: 1.14
62
+ [2025-10-28 01:45:26] (step=0003400) Train Loss: 0.8156, Train Steps/Sec: 1.14
63
+ [2025-10-28 01:46:54] (step=0003500) Train Loss: 0.8125, Train Steps/Sec: 1.14
64
+ [2025-10-28 01:48:22] (step=0003600) Train Loss: 0.8130, Train Steps/Sec: 1.14
65
+ [2025-10-28 01:49:49] (step=0003700) Train Loss: 0.8110, Train Steps/Sec: 1.14
66
+ [2025-10-28 01:50:36] Beginning epoch 3...
67
+ [2025-10-28 01:51:21] (step=0003800) Train Loss: 0.8101, Train Steps/Sec: 1.09
68
+ [2025-10-28 01:52:48] (step=0003900) Train Loss: 0.8086, Train Steps/Sec: 1.14
69
+ [2025-10-28 01:54:16] (step=0004000) Train Loss: 0.8060, Train Steps/Sec: 1.14
70
+ [2025-10-28 01:55:44] (step=0004100) Train Loss: 0.8054, Train Steps/Sec: 1.14
71
+ [2025-10-28 01:57:12] (step=0004200) Train Loss: 0.8032, Train Steps/Sec: 1.14
72
+ [2025-10-28 01:58:40] (step=0004300) Train Loss: 0.8037, Train Steps/Sec: 1.14
73
+ [2025-10-28 02:00:08] (step=0004400) Train Loss: 0.8010, Train Steps/Sec: 1.13
74
+ [2025-10-28 02:01:36] (step=0004500) Train Loss: 0.8001, Train Steps/Sec: 1.14
75
+ [2025-10-28 02:03:04] (step=0004600) Train Loss: 0.7997, Train Steps/Sec: 1.14
76
+ [2025-10-28 02:04:31] (step=0004700) Train Loss: 0.8002, Train Steps/Sec: 1.14
77
+ [2025-10-28 02:05:59] (step=0004800) Train Loss: 0.7968, Train Steps/Sec: 1.14
78
+ [2025-10-28 02:07:27] (step=0004900) Train Loss: 0.7976, Train Steps/Sec: 1.14
79
+ [2025-10-28 02:08:55] (step=0005000) Train Loss: 0.7974, Train Steps/Sec: 1.14
80
+ [2025-10-28 02:08:59] Beginning epoch 4...
81
+ [2025-10-28 02:10:26] (step=0005100) Train Loss: 0.7964, Train Steps/Sec: 1.09
82
+ [2025-10-28 02:11:54] (step=0005200) Train Loss: 0.7938, Train Steps/Sec: 1.13
83
+ [2025-10-28 02:13:22] (step=0005300) Train Loss: 0.7931, Train Steps/Sec: 1.14
84
+ [2025-10-28 02:14:50] (step=0005400) Train Loss: 0.7926, Train Steps/Sec: 1.14
85
+ [2025-10-28 02:16:18] (step=0005500) Train Loss: 0.7930, Train Steps/Sec: 1.14
86
+ [2025-10-28 02:17:46] (step=0005600) Train Loss: 0.7915, Train Steps/Sec: 1.14
87
+ [2025-10-28 02:19:13] (step=0005700) Train Loss: 0.7891, Train Steps/Sec: 1.14
88
+ [2025-10-28 02:20:41] (step=0005800) Train Loss: 0.7895, Train Steps/Sec: 1.14
89
+ [2025-10-28 02:22:09] (step=0005900) Train Loss: 0.7898, Train Steps/Sec: 1.14
90
+ [2025-10-28 02:23:37] (step=0006000) Train Loss: 0.7876, Train Steps/Sec: 1.14
91
+ [2025-10-28 02:25:05] (step=0006100) Train Loss: 0.7875, Train Steps/Sec: 1.14
92
+ [2025-10-28 02:26:33] (step=0006200) Train Loss: 0.7860, Train Steps/Sec: 1.14
93
+ [2025-10-28 02:27:22] Beginning epoch 5...
94
+ [2025-10-28 02:28:05] (step=0006300) Train Loss: 0.7868, Train Steps/Sec: 1.09
95
+ [2025-10-28 02:29:32] (step=0006400) Train Loss: 0.7862, Train Steps/Sec: 1.14
96
+ [2025-10-28 02:31:00] (step=0006500) Train Loss: 0.7843, Train Steps/Sec: 1.14
97
+ [2025-10-28 02:32:28] (step=0006600) Train Loss: 0.7850, Train Steps/Sec: 1.14
98
+ [2025-10-28 02:33:56] (step=0006700) Train Loss: 0.7833, Train Steps/Sec: 1.14
99
+ [2025-10-28 02:35:24] (step=0006800) Train Loss: 0.7836, Train Steps/Sec: 1.14
100
+ [2025-10-28 02:36:52] (step=0006900) Train Loss: 0.7824, Train Steps/Sec: 1.14
101
+ [2025-10-28 02:38:20] (step=0007000) Train Loss: 0.7820, Train Steps/Sec: 1.13
102
+ [2025-10-28 02:39:48] (step=0007100) Train Loss: 0.7823, Train Steps/Sec: 1.14
103
+ [2025-10-28 02:41:15] (step=0007200) Train Loss: 0.7815, Train Steps/Sec: 1.14
104
+ [2025-10-28 02:42:43] (step=0007300) Train Loss: 0.7800, Train Steps/Sec: 1.14
105
+ [2025-10-28 02:44:11] (step=0007400) Train Loss: 0.7798, Train Steps/Sec: 1.14
106
+ [2025-10-28 02:45:39] (step=0007500) Train Loss: 0.7793, Train Steps/Sec: 1.14
107
+ [2025-10-28 02:45:45] Beginning epoch 6...
108
+ [2025-10-28 02:47:10] (step=0007600) Train Loss: 0.7795, Train Steps/Sec: 1.09
109
+ [2025-10-28 02:48:38] (step=0007700) Train Loss: 0.7778, Train Steps/Sec: 1.14
110
+ [2025-10-28 02:50:06] (step=0007800) Train Loss: 0.7774, Train Steps/Sec: 1.13
111
+ [2025-10-28 02:51:34] (step=0007900) Train Loss: 0.7782, Train Steps/Sec: 1.14
112
+ [2025-10-28 02:53:02] (step=0008000) Train Loss: 0.7778, Train Steps/Sec: 1.14
113
+ [2025-10-28 02:54:30] (step=0008100) Train Loss: 0.7773, Train Steps/Sec: 1.14
114
+ [2025-10-28 02:55:57] (step=0008200) Train Loss: 0.7773, Train Steps/Sec: 1.14
115
+ [2025-10-28 02:57:25] (step=0008300) Train Loss: 0.7759, Train Steps/Sec: 1.14
116
+ [2025-10-28 02:58:53] (step=0008400) Train Loss: 0.7753, Train Steps/Sec: 1.14
117
+ [2025-10-28 03:00:21] (step=0008500) Train Loss: 0.7754, Train Steps/Sec: 1.14
118
+ [2025-10-28 03:01:49] (step=0008600) Train Loss: 0.7740, Train Steps/Sec: 1.14
119
+ [2025-10-28 03:03:17] (step=0008700) Train Loss: 0.7747, Train Steps/Sec: 1.13
120
+ [2025-10-28 03:04:07] Beginning epoch 7...
121
+ [2025-10-28 03:04:49] (step=0008800) Train Loss: 0.7741, Train Steps/Sec: 1.09
122
+ [2025-10-28 03:06:16] (step=0008900) Train Loss: 0.7736, Train Steps/Sec: 1.14
123
+ [2025-10-28 03:07:44] (step=0009000) Train Loss: 0.7735, Train Steps/Sec: 1.14
124
+ [2025-10-28 03:09:12] (step=0009100) Train Loss: 0.7727, Train Steps/Sec: 1.14
125
+ [2025-10-28 03:10:40] (step=0009200) Train Loss: 0.7733, Train Steps/Sec: 1.14
126
+ [2025-10-28 03:12:07] (step=0009300) Train Loss: 0.7726, Train Steps/Sec: 1.14
127
+ [2025-10-28 03:13:35] (step=0009400) Train Loss: 0.7711, Train Steps/Sec: 1.14
128
+ [2025-10-28 03:15:03] (step=0009500) Train Loss: 0.7719, Train Steps/Sec: 1.13
129
+ [2025-10-28 03:16:31] (step=0009600) Train Loss: 0.7705, Train Steps/Sec: 1.14
130
+ [2025-10-28 03:17:59] (step=0009700) Train Loss: 0.7706, Train Steps/Sec: 1.14
131
+ [2025-10-28 03:19:27] (step=0009800) Train Loss: 0.7695, Train Steps/Sec: 1.14
132
+ [2025-10-28 03:20:55] (step=0009900) Train Loss: 0.7699, Train Steps/Sec: 1.14
133
+ [2025-10-28 03:22:22] (step=0010000) Train Loss: 0.7698, Train Steps/Sec: 1.14
134
+ [2025-10-28 03:22:30] Beginning epoch 8...
135
+ [2025-10-28 03:23:55] (step=0010100) Train Loss: 0.7689, Train Steps/Sec: 1.09
136
+ [2025-10-28 03:25:22] (step=0010200) Train Loss: 0.7690, Train Steps/Sec: 1.14
137
+ [2025-10-28 03:26:50] (step=0010300) Train Loss: 0.7684, Train Steps/Sec: 1.14
138
+ [2025-10-28 03:28:19] (step=0010400) Train Loss: 0.7684, Train Steps/Sec: 1.13
139
+ [2025-10-28 03:29:46] (step=0010500) Train Loss: 0.7682, Train Steps/Sec: 1.14
140
+ [2025-10-28 03:31:14] (step=0010600) Train Loss: 0.7685, Train Steps/Sec: 1.14
141
+ [2025-10-28 03:32:42] (step=0010700) Train Loss: 0.7672, Train Steps/Sec: 1.14
142
+ [2025-10-28 03:34:10] (step=0010800) Train Loss: 0.7673, Train Steps/Sec: 1.14
143
+ [2025-10-28 03:35:38] (step=0010900) Train Loss: 0.7661, Train Steps/Sec: 1.14
144
+ [2025-10-28 03:37:05] (step=0011000) Train Loss: 0.7663, Train Steps/Sec: 1.14
145
+ [2025-10-28 03:38:33] (step=0011100) Train Loss: 0.7654, Train Steps/Sec: 1.14
146
+ [2025-10-28 03:40:01] (step=0011200) Train Loss: 0.7659, Train Steps/Sec: 1.14
147
+ [2025-10-28 03:40:54] Beginning epoch 9...
148
+ [2025-10-28 03:41:33] (step=0011300) Train Loss: 0.7656, Train Steps/Sec: 1.09
149
+ [2025-10-28 03:43:00] (step=0011400) Train Loss: 0.7664, Train Steps/Sec: 1.14
150
+ [2025-10-28 03:44:28] (step=0011500) Train Loss: 0.7654, Train Steps/Sec: 1.14
151
+ [2025-10-28 03:45:56] (step=0011600) Train Loss: 0.7648, Train Steps/Sec: 1.14
152
+ [2025-10-28 03:47:24] (step=0011700) Train Loss: 0.7654, Train Steps/Sec: 1.14
153
+ [2025-10-28 03:48:52] (step=0011800) Train Loss: 0.7642, Train Steps/Sec: 1.14
154
+ [2025-10-28 03:50:19] (step=0011900) Train Loss: 0.7643, Train Steps/Sec: 1.14
155
+ [2025-10-28 03:51:47] (step=0012000) Train Loss: 0.7636, Train Steps/Sec: 1.14
156
+ [2025-10-28 03:53:15] (step=0012100) Train Loss: 0.7636, Train Steps/Sec: 1.13
157
+ [2025-10-28 03:54:43] (step=0012200) Train Loss: 0.7629, Train Steps/Sec: 1.14
158
+ [2025-10-28 03:56:11] (step=0012300) Train Loss: 0.7630, Train Steps/Sec: 1.14
159
+ [2025-10-28 03:57:39] (step=0012400) Train Loss: 0.7636, Train Steps/Sec: 1.14
160
+ [2025-10-28 03:59:07] (step=0012500) Train Loss: 0.7628, Train Steps/Sec: 1.14
161
+ [2025-10-28 03:59:16] Beginning epoch 10...
162
+ [2025-10-28 04:00:38] (step=0012600) Train Loss: 0.7626, Train Steps/Sec: 1.09
163
+ [2025-10-28 04:02:06] (step=0012700) Train Loss: 0.7631, Train Steps/Sec: 1.14
164
+ [2025-10-28 04:03:34] (step=0012800) Train Loss: 0.7616, Train Steps/Sec: 1.14
165
+ [2025-10-28 04:05:01] (step=0012900) Train Loss: 0.7618, Train Steps/Sec: 1.14
166
+ [2025-10-28 04:06:30] (step=0013000) Train Loss: 0.7611, Train Steps/Sec: 1.13
167
+ [2025-10-28 04:07:57] (step=0013100) Train Loss: 0.7607, Train Steps/Sec: 1.14
168
+ [2025-10-28 04:09:25] (step=0013200) Train Loss: 0.7618, Train Steps/Sec: 1.14
169
+ [2025-10-28 04:10:53] (step=0013300) Train Loss: 0.7613, Train Steps/Sec: 1.14
170
+ [2025-10-28 04:12:21] (step=0013400) Train Loss: 0.7614, Train Steps/Sec: 1.14
171
+ [2025-10-28 04:13:49] (step=0013500) Train Loss: 0.7605, Train Steps/Sec: 1.14
172
+ [2025-10-28 04:15:16] (step=0013600) Train Loss: 0.7608, Train Steps/Sec: 1.14
173
+ [2025-10-28 04:16:44] (step=0013700) Train Loss: 0.7606, Train Steps/Sec: 1.14
174
+ [2025-10-28 04:17:38] Beginning epoch 11...
175
+ [2025-10-28 04:18:16] (step=0013800) Train Loss: 0.7602, Train Steps/Sec: 1.09
176
+ [2025-10-28 04:19:44] (step=0013900) Train Loss: 0.7599, Train Steps/Sec: 1.13
177
+ [2025-10-28 04:21:12] (step=0014000) Train Loss: 0.7592, Train Steps/Sec: 1.14
178
+ [2025-10-28 04:22:40] (step=0014100) Train Loss: 0.7587, Train Steps/Sec: 1.14
179
+ [2025-10-28 04:24:08] (step=0014200) Train Loss: 0.7586, Train Steps/Sec: 1.14
180
+ [2025-10-28 04:25:36] (step=0014300) Train Loss: 0.7590, Train Steps/Sec: 1.14
181
+ [2025-10-28 04:27:03] (step=0014400) Train Loss: 0.7574, Train Steps/Sec: 1.14
182
+ [2025-10-28 04:28:31] (step=0014500) Train Loss: 0.7582, Train Steps/Sec: 1.14
183
+ [2025-10-28 04:29:59] (step=0014600) Train Loss: 0.7599, Train Steps/Sec: 1.14
184
+ [2025-10-28 04:31:27] (step=0014700) Train Loss: 0.7590, Train Steps/Sec: 1.13
185
+ [2025-10-28 04:32:55] (step=0014800) Train Loss: 0.7577, Train Steps/Sec: 1.14
186
+ [2025-10-28 04:34:23] (step=0014900) Train Loss: 0.7589, Train Steps/Sec: 1.14
187
+ [2025-10-28 04:35:50] (step=0015000) Train Loss: 0.7585, Train Steps/Sec: 1.14
188
+ [2025-10-28 04:36:02] Beginning epoch 12...
189
+ [2025-10-28 04:37:23] (step=0015100) Train Loss: 0.7563, Train Steps/Sec: 1.08
190
+ [2025-10-28 04:38:51] (step=0015200) Train Loss: 0.7578, Train Steps/Sec: 1.14
191
+ [2025-10-28 04:40:18] (step=0015300) Train Loss: 0.7576, Train Steps/Sec: 1.14
192
+ [2025-10-28 04:41:46] (step=0015400) Train Loss: 0.7569, Train Steps/Sec: 1.14
193
+ [2025-10-28 04:43:14] (step=0015500) Train Loss: 0.7558, Train Steps/Sec: 1.14
194
+ [2025-10-28 04:44:42] (step=0015600) Train Loss: 0.7572, Train Steps/Sec: 1.13
195
+ [2025-10-28 04:46:10] (step=0015700) Train Loss: 0.7566, Train Steps/Sec: 1.14
196
+ [2025-10-28 04:47:38] (step=0015800) Train Loss: 0.7564, Train Steps/Sec: 1.14
197
+ [2025-10-28 04:49:06] (step=0015900) Train Loss: 0.7556, Train Steps/Sec: 1.14
198
+ [2025-10-28 04:50:34] (step=0016000) Train Loss: 0.7574, Train Steps/Sec: 1.14
199
+ [2025-10-28 04:52:01] (step=0016100) Train Loss: 0.7562, Train Steps/Sec: 1.14
200
+ [2025-10-28 04:53:29] (step=0016200) Train Loss: 0.7552, Train Steps/Sec: 1.14
201
+ [2025-10-28 04:54:25] Beginning epoch 13...
202
+ [2025-10-28 04:55:01] (step=0016300) Train Loss: 0.7562, Train Steps/Sec: 1.10
203
+ [2025-10-28 04:56:29] (step=0016400) Train Loss: 0.7558, Train Steps/Sec: 1.14
204
+ [2025-10-28 04:57:57] (step=0016500) Train Loss: 0.7559, Train Steps/Sec: 1.13
205
+ [2025-10-28 04:59:25] (step=0016600) Train Loss: 0.7553, Train Steps/Sec: 1.14
206
+ [2025-10-28 05:00:52] (step=0016700) Train Loss: 0.7549, Train Steps/Sec: 1.14
207
+ [2025-10-28 05:02:20] (step=0016800) Train Loss: 0.7546, Train Steps/Sec: 1.14
208
+ [2025-10-28 05:03:48] (step=0016900) Train Loss: 0.7547, Train Steps/Sec: 1.14
209
+ [2025-10-28 05:05:16] (step=0017000) Train Loss: 0.7541, Train Steps/Sec: 1.14
210
+ [2025-10-28 05:06:43] (step=0017100) Train Loss: 0.7534, Train Steps/Sec: 1.14
211
+ [2025-10-28 05:08:11] (step=0017200) Train Loss: 0.7537, Train Steps/Sec: 1.14
212
+ [2025-10-28 05:09:40] (step=0017300) Train Loss: 0.7537, Train Steps/Sec: 1.13
213
+ [2025-10-28 05:11:07] (step=0017400) Train Loss: 0.7529, Train Steps/Sec: 1.14
214
+ [2025-10-28 05:12:35] (step=0017500) Train Loss: 0.7545, Train Steps/Sec: 1.14
215
+ [2025-10-28 05:12:48] Beginning epoch 14...
216
+ [2025-10-28 05:14:07] (step=0017600) Train Loss: 0.7515, Train Steps/Sec: 1.09
217
+ [2025-10-28 05:15:34] (step=0017700) Train Loss: 0.7528, Train Steps/Sec: 1.14
218
+ [2025-10-28 05:17:02] (step=0017800) Train Loss: 0.7536, Train Steps/Sec: 1.14
219
+ [2025-10-28 05:18:30] (step=0017900) Train Loss: 0.7535, Train Steps/Sec: 1.14
220
+ [2025-10-28 05:19:58] (step=0018000) Train Loss: 0.7536, Train Steps/Sec: 1.14
221
+ [2025-10-28 05:21:26] (step=0018100) Train Loss: 0.7527, Train Steps/Sec: 1.14
222
+ [2025-10-28 05:22:54] (step=0018200) Train Loss: 0.7528, Train Steps/Sec: 1.13
223
+ [2025-10-28 05:24:22] (step=0018300) Train Loss: 0.7527, Train Steps/Sec: 1.14
224
+ [2025-10-28 05:25:50] (step=0018400) Train Loss: 0.7534, Train Steps/Sec: 1.14
225
+ [2025-10-28 05:27:17] (step=0018500) Train Loss: 0.7521, Train Steps/Sec: 1.14
226
+ [2025-10-28 05:28:45] (step=0018600) Train Loss: 0.7521, Train Steps/Sec: 1.14
227
+ [2025-10-28 05:30:13] (step=0018700) Train Loss: 0.7521, Train Steps/Sec: 1.14
228
+ [2025-10-28 05:31:10] Beginning epoch 15...
229
+ [2025-10-28 05:31:45] (step=0018800) Train Loss: 0.7521, Train Steps/Sec: 1.09
230
+ [2025-10-28 05:33:12] (step=0018900) Train Loss: 0.7506, Train Steps/Sec: 1.14
231
+ [2025-10-28 05:34:41] (step=0019000) Train Loss: 0.7515, Train Steps/Sec: 1.13
232
+ [2025-10-28 05:36:08] (step=0019100) Train Loss: 0.7512, Train Steps/Sec: 1.14
233
+ [2025-10-28 05:37:36] (step=0019200) Train Loss: 0.7516, Train Steps/Sec: 1.14
234
+ [2025-10-28 05:39:04] (step=0019300) Train Loss: 0.7509, Train Steps/Sec: 1.14
235
+ [2025-10-28 05:40:32] (step=0019400) Train Loss: 0.7508, Train Steps/Sec: 1.14
236
+ [2025-10-28 05:41:59] (step=0019500) Train Loss: 0.7503, Train Steps/Sec: 1.14
237
+ [2025-10-28 05:43:27] (step=0019600) Train Loss: 0.7523, Train Steps/Sec: 1.14
238
+ [2025-10-28 05:44:55] (step=0019700) Train Loss: 0.7512, Train Steps/Sec: 1.14
239
+ [2025-10-28 05:46:23] (step=0019800) Train Loss: 0.7507, Train Steps/Sec: 1.14
240
+ [2025-10-28 05:47:51] (step=0019900) Train Loss: 0.7505, Train Steps/Sec: 1.13
241
+ [2025-10-28 05:49:19] (step=0020000) Train Loss: 0.7495, Train Steps/Sec: 1.14
242
+ [2025-10-28 05:49:34] Beginning epoch 16...
243
+ [2025-10-28 05:50:51] (step=0020100) Train Loss: 0.7496, Train Steps/Sec: 1.09
244
+ [2025-10-28 05:52:18] (step=0020200) Train Loss: 0.7492, Train Steps/Sec: 1.14
245
+ [2025-10-28 05:53:46] (step=0020300) Train Loss: 0.7499, Train Steps/Sec: 1.14
246
+ [2025-10-28 05:55:14] (step=0020400) Train Loss: 0.7490, Train Steps/Sec: 1.14
247
+ [2025-10-28 05:56:42] (step=0020500) Train Loss: 0.7492, Train Steps/Sec: 1.14
248
+ [2025-10-28 05:58:10] (step=0020600) Train Loss: 0.7504, Train Steps/Sec: 1.14
249
+ [2025-10-28 05:59:37] (step=0020700) Train Loss: 0.7500, Train Steps/Sec: 1.14
250
+ [2025-10-28 06:01:06] (step=0020800) Train Loss: 0.7488, Train Steps/Sec: 1.13
251
+ [2025-10-28 06:02:34] (step=0020900) Train Loss: 0.7500, Train Steps/Sec: 1.14
252
+ [2025-10-28 06:04:01] (step=0021000) Train Loss: 0.7497, Train Steps/Sec: 1.14
253
+ [2025-10-28 06:05:29] (step=0021100) Train Loss: 0.7489, Train Steps/Sec: 1.14
254
+ [2025-10-28 06:06:57] (step=0021200) Train Loss: 0.7488, Train Steps/Sec: 1.14
255
+ [2025-10-28 06:07:56] Beginning epoch 17...
256
+ [2025-10-28 06:08:28] (step=0021300) Train Loss: 0.7479, Train Steps/Sec: 1.09
257
+ [2025-10-28 06:09:56] (step=0021400) Train Loss: 0.7494, Train Steps/Sec: 1.14
258
+ [2025-10-28 06:11:24] (step=0021500) Train Loss: 0.7487, Train Steps/Sec: 1.14
259
+ [2025-10-28 06:12:52] (step=0021600) Train Loss: 0.7478, Train Steps/Sec: 1.13
260
+ [2025-10-28 06:14:20] (step=0021700) Train Loss: 0.7483, Train Steps/Sec: 1.14
261
+ [2025-10-28 06:15:48] (step=0021800) Train Loss: 0.7478, Train Steps/Sec: 1.14
262
+ [2025-10-28 06:17:15] (step=0021900) Train Loss: 0.7472, Train Steps/Sec: 1.14
263
+ [2025-10-28 06:18:43] (step=0022000) Train Loss: 0.7478, Train Steps/Sec: 1.14
264
+ [2025-10-28 06:20:11] (step=0022100) Train Loss: 0.7477, Train Steps/Sec: 1.14
265
+ [2025-10-28 06:21:39] (step=0022200) Train Loss: 0.7477, Train Steps/Sec: 1.14
266
+ [2025-10-28 06:23:07] (step=0022300) Train Loss: 0.7480, Train Steps/Sec: 1.14
267
+ [2025-10-28 06:24:34] (step=0022400) Train Loss: 0.7483, Train Steps/Sec: 1.14
268
+ [2025-10-28 06:26:03] (step=0022500) Train Loss: 0.7488, Train Steps/Sec: 1.13
269
+ [2025-10-28 06:26:19] Beginning epoch 18...
270
+ [2025-10-28 06:27:34] (step=0022600) Train Loss: 0.7477, Train Steps/Sec: 1.09
271
+ [2025-10-28 06:29:02] (step=0022700) Train Loss: 0.7472, Train Steps/Sec: 1.14
272
+ [2025-10-28 06:30:30] (step=0022800) Train Loss: 0.7484, Train Steps/Sec: 1.14
273
+ [2025-10-28 06:31:57] (step=0022900) Train Loss: 0.7467, Train Steps/Sec: 1.14
274
+ [2025-10-28 06:33:25] (step=0023000) Train Loss: 0.7467, Train Steps/Sec: 1.14
275
+ [2025-10-28 06:34:53] (step=0023100) Train Loss: 0.7468, Train Steps/Sec: 1.14
276
+ [2025-10-28 06:36:21] (step=0023200) Train Loss: 0.7467, Train Steps/Sec: 1.14
277
+ [2025-10-28 06:37:49] (step=0023300) Train Loss: 0.7465, Train Steps/Sec: 1.14
278
+ [2025-10-28 06:39:17] (step=0023400) Train Loss: 0.7464, Train Steps/Sec: 1.13
279
+ [2025-10-28 06:40:45] (step=0023500) Train Loss: 0.7462, Train Steps/Sec: 1.14
280
+ [2025-10-28 06:42:12] (step=0023600) Train Loss: 0.7472, Train Steps/Sec: 1.14
281
+ [2025-10-28 06:43:40] (step=0023700) Train Loss: 0.7467, Train Steps/Sec: 1.14
282
+ [2025-10-28 06:44:41] Beginning epoch 19...
283
+ [2025-10-28 06:45:12] (step=0023800) Train Loss: 0.7464, Train Steps/Sec: 1.09
284
+ [2025-10-28 06:46:39] (step=0023900) Train Loss: 0.7467, Train Steps/Sec: 1.14
285
+ [2025-10-28 06:48:07] (step=0024000) Train Loss: 0.7475, Train Steps/Sec: 1.14
286
+ [2025-10-28 06:49:35] (step=0024100) Train Loss: 0.7448, Train Steps/Sec: 1.14
287
+ [2025-10-28 06:51:03] (step=0024200) Train Loss: 0.7459, Train Steps/Sec: 1.13
288
+ [2025-10-28 06:52:31] (step=0024300) Train Loss: 0.7465, Train Steps/Sec: 1.14
289
+ [2025-10-28 06:53:59] (step=0024400) Train Loss: 0.7463, Train Steps/Sec: 1.14
290
+ [2025-10-28 06:55:27] (step=0024500) Train Loss: 0.7462, Train Steps/Sec: 1.14
291
+ [2025-10-28 06:56:55] (step=0024600) Train Loss: 0.7461, Train Steps/Sec: 1.14
292
+ [2025-10-28 06:58:22] (step=0024700) Train Loss: 0.7467, Train Steps/Sec: 1.14
293
+ [2025-10-28 06:59:50] (step=0024800) Train Loss: 0.7465, Train Steps/Sec: 1.14
294
+ [2025-10-28 07:01:18] (step=0024900) Train Loss: 0.7455, Train Steps/Sec: 1.14
295
+ [2025-10-28 07:02:46] (step=0025000) Train Loss: 0.7446, Train Steps/Sec: 1.14
296
+ [2025-10-28 07:03:37] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-g-bf16/checkpoints/0025000.pt
297
+ [2025-10-28 07:03:37] Generating EMA samples...
298
+ [2025-10-28 07:04:05] Generating EMA samples done.
299
+ [2025-10-28 07:04:23] Beginning epoch 20...
300
+ [2025-10-28 07:05:39] (step=0025100) Train Loss: 0.7453, Train Steps/Sec: 0.58
301
+ [2025-10-28 07:07:06] (step=0025200) Train Loss: 0.7448, Train Steps/Sec: 1.14
302
+ [2025-10-28 07:08:34] (step=0025300) Train Loss: 0.7446, Train Steps/Sec: 1.14
303
+ [2025-10-28 07:10:02] (step=0025400) Train Loss: 0.7443, Train Steps/Sec: 1.14
304
+ [2025-10-28 07:11:30] (step=0025500) Train Loss: 0.7446, Train Steps/Sec: 1.14
305
+ [2025-10-28 07:12:58] (step=0025600) Train Loss: 0.7441, Train Steps/Sec: 1.14
306
+ [2025-10-28 07:14:25] (step=0025700) Train Loss: 0.7435, Train Steps/Sec: 1.14
307
+ [2025-10-28 07:15:53] (step=0025800) Train Loss: 0.7443, Train Steps/Sec: 1.14
308
+ [2025-10-28 07:17:21] (step=0025900) Train Loss: 0.7452, Train Steps/Sec: 1.13
309
+ [2025-10-28 07:18:49] (step=0026000) Train Loss: 0.7446, Train Steps/Sec: 1.14
310
+ [2025-10-28 07:20:17] (step=0026100) Train Loss: 0.7452, Train Steps/Sec: 1.14
311
+ [2025-10-28 07:21:45] (step=0026200) Train Loss: 0.7433, Train Steps/Sec: 1.14
312
+ [2025-10-28 07:22:48] Beginning epoch 21...
313
+ [2025-10-28 07:23:18] (step=0026300) Train Loss: 0.7440, Train Steps/Sec: 1.07
314
+ [2025-10-28 07:24:46] (step=0026400) Train Loss: 0.7437, Train Steps/Sec: 1.14
315
+ [2025-10-28 07:26:14] (step=0026500) Train Loss: 0.7427, Train Steps/Sec: 1.14
316
+ [2025-10-28 07:27:42] (step=0026600) Train Loss: 0.7429, Train Steps/Sec: 1.14
317
+ [2025-10-28 07:29:10] (step=0026700) Train Loss: 0.7438, Train Steps/Sec: 1.14
318
+ [2025-10-28 07:30:38] (step=0026800) Train Loss: 0.7442, Train Steps/Sec: 1.13
319
+ [2025-10-28 07:32:06] (step=0026900) Train Loss: 0.7446, Train Steps/Sec: 1.14
320
+ [2025-10-28 07:33:33] (step=0027000) Train Loss: 0.7443, Train Steps/Sec: 1.14
321
+ [2025-10-28 07:35:01] (step=0027100) Train Loss: 0.7444, Train Steps/Sec: 1.14
322
+ [2025-10-28 07:36:29] (step=0027200) Train Loss: 0.7432, Train Steps/Sec: 1.14
323
+ [2025-10-28 07:37:57] (step=0027300) Train Loss: 0.7438, Train Steps/Sec: 1.14
324
+ [2025-10-28 07:39:25] (step=0027400) Train Loss: 0.7443, Train Steps/Sec: 1.14
325
+ [2025-10-28 07:40:52] (step=0027500) Train Loss: 0.7440, Train Steps/Sec: 1.14
326
+ [2025-10-28 07:41:12] Beginning epoch 22...
327
+ [2025-10-28 07:42:26] (step=0027600) Train Loss: 0.7430, Train Steps/Sec: 1.07
328
+ [2025-10-28 07:43:54] (step=0027700) Train Loss: 0.7438, Train Steps/Sec: 1.13
329
+ [2025-10-28 07:45:22] (step=0027800) Train Loss: 0.7423, Train Steps/Sec: 1.14
330
+ [2025-10-28 07:46:50] (step=0027900) Train Loss: 0.7432, Train Steps/Sec: 1.14
331
+ [2025-10-28 07:48:18] (step=0028000) Train Loss: 0.7436, Train Steps/Sec: 1.14
332
+ [2025-10-28 07:49:45] (step=0028100) Train Loss: 0.7423, Train Steps/Sec: 1.14
333
+ [2025-10-28 07:51:13] (step=0028200) Train Loss: 0.7432, Train Steps/Sec: 1.14
334
+ [2025-10-28 07:52:41] (step=0028300) Train Loss: 0.7420, Train Steps/Sec: 1.14
335
+ [2025-10-28 07:54:09] (step=0028400) Train Loss: 0.7413, Train Steps/Sec: 1.14
336
+ [2025-10-28 07:55:37] (step=0028500) Train Loss: 0.7430, Train Steps/Sec: 1.14
337
+ [2025-10-28 07:57:05] (step=0028600) Train Loss: 0.7427, Train Steps/Sec: 1.14
338
+ [2025-10-28 07:58:32] (step=0028700) Train Loss: 0.7430, Train Steps/Sec: 1.14
339
+ [2025-10-28 07:59:37] Beginning epoch 23...
340
+ [2025-10-28 08:00:05] (step=0028800) Train Loss: 0.7438, Train Steps/Sec: 1.07
341
+ [2025-10-28 08:01:33] (step=0028900) Train Loss: 0.7428, Train Steps/Sec: 1.14
342
+ [2025-10-28 08:03:01] (step=0029000) Train Loss: 0.7412, Train Steps/Sec: 1.14
343
+ [2025-10-28 08:04:29] (step=0029100) Train Loss: 0.7418, Train Steps/Sec: 1.14
344
+ [2025-10-28 08:05:57] (step=0029200) Train Loss: 0.7417, Train Steps/Sec: 1.14
345
+ [2025-10-28 08:07:25] (step=0029300) Train Loss: 0.7408, Train Steps/Sec: 1.14
346
+ [2025-10-28 08:08:53] (step=0029400) Train Loss: 0.7415, Train Steps/Sec: 1.13
347
+ [2025-10-28 08:10:21] (step=0029500) Train Loss: 0.7424, Train Steps/Sec: 1.14
348
+ [2025-10-28 08:11:48] (step=0029600) Train Loss: 0.7434, Train Steps/Sec: 1.14
349
+ [2025-10-28 08:13:16] (step=0029700) Train Loss: 0.7416, Train Steps/Sec: 1.14
350
+ [2025-10-28 08:14:44] (step=0029800) Train Loss: 0.7423, Train Steps/Sec: 1.14
351
+ [2025-10-28 08:16:12] (step=0029900) Train Loss: 0.7421, Train Steps/Sec: 1.14
352
+ [2025-10-28 08:17:40] (step=0030000) Train Loss: 0.7410, Train Steps/Sec: 1.14
353
+ [2025-10-28 08:18:01] Beginning epoch 24...
354
+ [2025-10-28 08:19:13] (step=0030100) Train Loss: 0.7414, Train Steps/Sec: 1.07
355
+ [2025-10-28 08:20:41] (step=0030200) Train Loss: 0.7405, Train Steps/Sec: 1.14
356
+ [2025-10-28 08:22:09] (step=0030300) Train Loss: 0.7410, Train Steps/Sec: 1.13
357
+ [2025-10-28 08:23:37] (step=0030400) Train Loss: 0.7416, Train Steps/Sec: 1.14
358
+ [2025-10-28 08:25:05] (step=0030500) Train Loss: 0.7409, Train Steps/Sec: 1.14
359
+ [2025-10-28 08:26:33] (step=0030600) Train Loss: 0.7416, Train Steps/Sec: 1.14
360
+ [2025-10-28 08:28:00] (step=0030700) Train Loss: 0.7407, Train Steps/Sec: 1.14
361
+ [2025-10-28 08:29:28] (step=0030800) Train Loss: 0.7419, Train Steps/Sec: 1.14
362
+ [2025-10-28 08:30:56] (step=0030900) Train Loss: 0.7406, Train Steps/Sec: 1.14
363
+ [2025-10-28 08:32:24] (step=0031000) Train Loss: 0.7406, Train Steps/Sec: 1.14
364
+ [2025-10-28 08:33:52] (step=0031100) Train Loss: 0.7411, Train Steps/Sec: 1.14
365
+ [2025-10-28 08:35:19] (step=0031200) Train Loss: 0.7409, Train Steps/Sec: 1.14
366
+ [2025-10-28 08:36:26] Beginning epoch 25...
367
+ [2025-10-28 08:36:52] (step=0031300) Train Loss: 0.7401, Train Steps/Sec: 1.08
368
+ [2025-10-28 08:38:20] (step=0031400) Train Loss: 0.7404, Train Steps/Sec: 1.14
369
+ [2025-10-28 08:39:48] (step=0031500) Train Loss: 0.7415, Train Steps/Sec: 1.14
370
+ [2025-10-28 08:41:16] (step=0031600) Train Loss: 0.7406, Train Steps/Sec: 1.14
371
+ [2025-10-28 08:42:43] (step=0031700) Train Loss: 0.7400, Train Steps/Sec: 1.14
372
+ [2025-10-28 08:44:11] (step=0031800) Train Loss: 0.7399, Train Steps/Sec: 1.14
373
+ [2025-10-28 08:45:39] (step=0031900) Train Loss: 0.7404, Train Steps/Sec: 1.14
374
+ [2025-10-28 08:47:07] (step=0032000) Train Loss: 0.7404, Train Steps/Sec: 1.13
375
+ [2025-10-28 08:48:35] (step=0032100) Train Loss: 0.7393, Train Steps/Sec: 1.14
376
+ [2025-10-28 08:50:03] (step=0032200) Train Loss: 0.7394, Train Steps/Sec: 1.14
377
+ [2025-10-28 08:51:31] (step=0032300) Train Loss: 0.7401, Train Steps/Sec: 1.14
378
+ [2025-10-28 08:52:58] (step=0032400) Train Loss: 0.7394, Train Steps/Sec: 1.14
379
+ [2025-10-28 08:54:26] (step=0032500) Train Loss: 0.7398, Train Steps/Sec: 1.14
380
+ [2025-10-28 08:54:50] Beginning epoch 26...
381
+ [2025-10-28 08:55:59] (step=0032600) Train Loss: 0.7395, Train Steps/Sec: 1.07
382
+ [2025-10-28 08:57:27] (step=0032700) Train Loss: 0.7391, Train Steps/Sec: 1.14
383
+ [2025-10-28 08:58:55] (step=0032800) Train Loss: 0.7394, Train Steps/Sec: 1.13
384
+ [2025-10-28 09:00:23] (step=0032900) Train Loss: 0.7381, Train Steps/Sec: 1.14
385
+ [2025-10-28 09:01:51] (step=0033000) Train Loss: 0.7400, Train Steps/Sec: 1.14
386
+ [2025-10-28 09:03:19] (step=0033100) Train Loss: 0.7395, Train Steps/Sec: 1.14
387
+ [2025-10-28 09:04:46] (step=0033200) Train Loss: 0.7393, Train Steps/Sec: 1.14
388
+ [2025-10-28 09:06:14] (step=0033300) Train Loss: 0.7402, Train Steps/Sec: 1.14
389
+ [2025-10-28 09:07:42] (step=0033400) Train Loss: 0.7403, Train Steps/Sec: 1.14
390
+ [2025-10-28 09:09:10] (step=0033500) Train Loss: 0.7394, Train Steps/Sec: 1.14
391
+ [2025-10-28 09:10:37] (step=0033600) Train Loss: 0.7382, Train Steps/Sec: 1.14
392
+ [2025-10-28 09:12:06] (step=0033700) Train Loss: 0.7400, Train Steps/Sec: 1.13
393
+ [2025-10-28 09:13:14] Beginning epoch 27...
394
+ [2025-10-28 09:13:39] (step=0033800) Train Loss: 0.7393, Train Steps/Sec: 1.07
395
+ [2025-10-28 09:15:07] (step=0033900) Train Loss: 0.7385, Train Steps/Sec: 1.14
396
+ [2025-10-28 09:16:34] (step=0034000) Train Loss: 0.7399, Train Steps/Sec: 1.14
397
+ [2025-10-28 09:18:02] (step=0034100) Train Loss: 0.7387, Train Steps/Sec: 1.14
398
+ [2025-10-28 09:19:30] (step=0034200) Train Loss: 0.7391, Train Steps/Sec: 1.14
399
+ [2025-10-28 09:20:58] (step=0034300) Train Loss: 0.7392, Train Steps/Sec: 1.14
400
+ [2025-10-28 09:22:25] (step=0034400) Train Loss: 0.7382, Train Steps/Sec: 1.14
401
+ [2025-10-28 09:23:53] (step=0034500) Train Loss: 0.7388, Train Steps/Sec: 1.13
402
+ [2025-10-28 09:25:22] (step=0034600) Train Loss: 0.7383, Train Steps/Sec: 1.13
403
+ [2025-10-28 09:26:49] (step=0034700) Train Loss: 0.7386, Train Steps/Sec: 1.14
404
+ [2025-10-28 09:28:17] (step=0034800) Train Loss: 0.7375, Train Steps/Sec: 1.14
405
+ [2025-10-28 09:29:45] (step=0034900) Train Loss: 0.7383, Train Steps/Sec: 1.14
406
+ [2025-10-28 09:31:13] (step=0035000) Train Loss: 0.7390, Train Steps/Sec: 1.14
407
+ [2025-10-28 09:31:38] Beginning epoch 28...
408
+ [2025-10-28 09:32:46] (step=0035100) Train Loss: 0.7385, Train Steps/Sec: 1.07
409
+ [2025-10-28 09:34:14] (step=0035200) Train Loss: 0.7384, Train Steps/Sec: 1.14
410
+ [2025-10-28 09:35:42] (step=0035300) Train Loss: 0.7377, Train Steps/Sec: 1.14
411
+ [2025-10-28 09:37:10] (step=0035400) Train Loss: 0.7382, Train Steps/Sec: 1.13
412
+ [2025-10-28 09:38:38] (step=0035500) Train Loss: 0.7383, Train Steps/Sec: 1.14
413
+ [2025-10-28 09:40:05] (step=0035600) Train Loss: 0.7385, Train Steps/Sec: 1.14
414
+ [2025-10-28 09:41:33] (step=0035700) Train Loss: 0.7381, Train Steps/Sec: 1.14
415
+ [2025-10-28 09:43:01] (step=0035800) Train Loss: 0.7369, Train Steps/Sec: 1.14
416
+ [2025-10-28 09:44:29] (step=0035900) Train Loss: 0.7384, Train Steps/Sec: 1.14
417
+ [2025-10-28 09:45:57] (step=0036000) Train Loss: 0.7377, Train Steps/Sec: 1.14
418
+ [2025-10-28 09:47:24] (step=0036100) Train Loss: 0.7386, Train Steps/Sec: 1.14
419
+ [2025-10-28 09:48:52] (step=0036200) Train Loss: 0.7372, Train Steps/Sec: 1.14
420
+ [2025-10-28 09:50:02] Beginning epoch 29...
421
+ [2025-10-28 09:50:26] (step=0036300) Train Loss: 0.7374, Train Steps/Sec: 1.07
422
+ [2025-10-28 09:51:54] (step=0036400) Train Loss: 0.7370, Train Steps/Sec: 1.14
423
+ [2025-10-28 09:53:21] (step=0036500) Train Loss: 0.7368, Train Steps/Sec: 1.14
424
+ [2025-10-28 09:54:49] (step=0036600) Train Loss: 0.7374, Train Steps/Sec: 1.14
425
+ [2025-10-28 09:56:17] (step=0036700) Train Loss: 0.7375, Train Steps/Sec: 1.14
426
+ [2025-10-28 09:57:45] (step=0036800) Train Loss: 0.7366, Train Steps/Sec: 1.14
427
+ [2025-10-28 09:59:12] (step=0036900) Train Loss: 0.7369, Train Steps/Sec: 1.14
428
+ [2025-10-28 10:00:40] (step=0037000) Train Loss: 0.7371, Train Steps/Sec: 1.14
429
+ [2025-10-28 10:02:08] (step=0037100) Train Loss: 0.7371, Train Steps/Sec: 1.14
430
+ [2025-10-28 10:03:36] (step=0037200) Train Loss: 0.7379, Train Steps/Sec: 1.14
431
+ [2025-10-28 10:05:04] (step=0037300) Train Loss: 0.7374, Train Steps/Sec: 1.14
432
+ [2025-10-28 10:06:32] (step=0037400) Train Loss: 0.7378, Train Steps/Sec: 1.14
433
+ [2025-10-28 10:08:00] (step=0037500) Train Loss: 0.7372, Train Steps/Sec: 1.14
434
+ [2025-10-28 10:08:27] Beginning epoch 30...
435
+ [2025-10-28 10:09:33] (step=0037600) Train Loss: 0.7365, Train Steps/Sec: 1.07
436
+ [2025-10-28 10:11:01] (step=0037700) Train Loss: 0.7371, Train Steps/Sec: 1.14
437
+ [2025-10-28 10:12:29] (step=0037800) Train Loss: 0.7379, Train Steps/Sec: 1.14
438
+ [2025-10-28 10:13:57] (step=0037900) Train Loss: 0.7370, Train Steps/Sec: 1.14
439
+ [2025-10-28 10:15:25] (step=0038000) Train Loss: 0.7360, Train Steps/Sec: 1.13
440
+ [2025-10-28 10:16:53] (step=0038100) Train Loss: 0.7363, Train Steps/Sec: 1.14
441
+ [2025-10-28 10:18:21] (step=0038200) Train Loss: 0.7366, Train Steps/Sec: 1.14
442
+ [2025-10-28 10:19:48] (step=0038300) Train Loss: 0.7365, Train Steps/Sec: 1.14
443
+ [2025-10-28 10:21:16] (step=0038400) Train Loss: 0.7376, Train Steps/Sec: 1.14
444
+ [2025-10-28 10:22:44] (step=0038500) Train Loss: 0.7352, Train Steps/Sec: 1.14
445
+ [2025-10-28 10:24:12] (step=0038600) Train Loss: 0.7360, Train Steps/Sec: 1.14
446
+ [2025-10-28 10:25:40] (step=0038700) Train Loss: 0.7362, Train Steps/Sec: 1.14
447
+ [2025-10-28 10:26:51] Beginning epoch 31...
448
+ [2025-10-28 10:27:13] (step=0038800) Train Loss: 0.7359, Train Steps/Sec: 1.07
449
+ [2025-10-28 10:28:41] (step=0038900) Train Loss: 0.7342, Train Steps/Sec: 1.13
450
+ [2025-10-28 10:30:09] (step=0039000) Train Loss: 0.7370, Train Steps/Sec: 1.14
451
+ [2025-10-28 10:31:37] (step=0039100) Train Loss: 0.7358, Train Steps/Sec: 1.14
452
+ [2025-10-28 10:33:05] (step=0039200) Train Loss: 0.7357, Train Steps/Sec: 1.14
453
+ [2025-10-28 10:34:32] (step=0039300) Train Loss: 0.7371, Train Steps/Sec: 1.14
454
+ [2025-10-28 10:36:00] (step=0039400) Train Loss: 0.7353, Train Steps/Sec: 1.14
455
+ [2025-10-28 10:37:28] (step=0039500) Train Loss: 0.7355, Train Steps/Sec: 1.14
456
+ [2025-10-28 10:38:56] (step=0039600) Train Loss: 0.7366, Train Steps/Sec: 1.14
457
+ [2025-10-28 10:40:24] (step=0039700) Train Loss: 0.7364, Train Steps/Sec: 1.14
458
+ [2025-10-28 10:41:52] (step=0039800) Train Loss: 0.7362, Train Steps/Sec: 1.14
459
+ [2025-10-28 10:43:20] (step=0039900) Train Loss: 0.7366, Train Steps/Sec: 1.14
460
+ [2025-10-28 10:44:47] (step=0040000) Train Loss: 0.7355, Train Steps/Sec: 1.14
461
+ [2025-10-28 10:45:16] Beginning epoch 32...
462
+ [2025-10-28 10:46:20] (step=0040100) Train Loss: 0.7357, Train Steps/Sec: 1.07
463
+ [2025-10-28 10:47:48] (step=0040200) Train Loss: 0.7353, Train Steps/Sec: 1.14
464
+ [2025-10-28 10:49:16] (step=0040300) Train Loss: 0.7348, Train Steps/Sec: 1.14
465
+ [2025-10-28 10:50:44] (step=0040400) Train Loss: 0.7356, Train Steps/Sec: 1.14
466
+ [2025-10-28 10:52:11] (step=0040500) Train Loss: 0.7353, Train Steps/Sec: 1.14
467
+ [2025-10-28 10:53:40] (step=0040600) Train Loss: 0.7358, Train Steps/Sec: 1.13
468
+ [2025-10-28 10:55:08] (step=0040700) Train Loss: 0.7359, Train Steps/Sec: 1.14
469
+ [2025-10-28 10:56:35] (step=0040800) Train Loss: 0.7365, Train Steps/Sec: 1.14
470
+ [2025-10-28 10:58:03] (step=0040900) Train Loss: 0.7349, Train Steps/Sec: 1.14
471
+ [2025-10-28 10:59:31] (step=0041000) Train Loss: 0.7336, Train Steps/Sec: 1.14
472
+ [2025-10-28 11:00:59] (step=0041100) Train Loss: 0.7354, Train Steps/Sec: 1.14
473
+ [2025-10-28 11:02:26] (step=0041200) Train Loss: 0.7339, Train Steps/Sec: 1.14
474
+ [2025-10-28 11:03:40] Beginning epoch 33...
475
+ [2025-10-28 11:03:59] (step=0041300) Train Loss: 0.7357, Train Steps/Sec: 1.08
476
+ [2025-10-28 11:05:27] (step=0041400) Train Loss: 0.7355, Train Steps/Sec: 1.14
477
+ [2025-10-28 11:06:56] (step=0041500) Train Loss: 0.7343, Train Steps/Sec: 1.13
478
+ [2025-10-28 11:08:23] (step=0041600) Train Loss: 0.7338, Train Steps/Sec: 1.14
479
+ [2025-10-28 11:09:51] (step=0041700) Train Loss: 0.7345, Train Steps/Sec: 1.14
480
+ [2025-10-28 11:11:19] (step=0041800) Train Loss: 0.7364, Train Steps/Sec: 1.14
481
+ [2025-10-28 11:12:47] (step=0041900) Train Loss: 0.7342, Train Steps/Sec: 1.14
482
+ [2025-10-28 11:14:14] (step=0042000) Train Loss: 0.7349, Train Steps/Sec: 1.14
483
+ [2025-10-28 11:15:42] (step=0042100) Train Loss: 0.7361, Train Steps/Sec: 1.14
484
+ [2025-10-28 11:17:10] (step=0042200) Train Loss: 0.7351, Train Steps/Sec: 1.14
485
+ [2025-10-28 11:18:38] (step=0042300) Train Loss: 0.7347, Train Steps/Sec: 1.13
486
+ [2025-10-28 11:20:06] (step=0042400) Train Loss: 0.7352, Train Steps/Sec: 1.14
487
+ [2025-10-28 11:21:34] (step=0042500) Train Loss: 0.7336, Train Steps/Sec: 1.14
488
+ [2025-10-28 11:22:04] Beginning epoch 34...
489
+ [2025-10-28 11:23:07] (step=0042600) Train Loss: 0.7341, Train Steps/Sec: 1.07
490
+ [2025-10-28 11:24:35] (step=0042700) Train Loss: 0.7338, Train Steps/Sec: 1.14
491
+ [2025-10-28 11:26:02] (step=0042800) Train Loss: 0.7340, Train Steps/Sec: 1.14
492
+ [2025-10-28 11:27:30] (step=0042900) Train Loss: 0.7348, Train Steps/Sec: 1.14
493
+ [2025-10-28 11:28:58] (step=0043000) Train Loss: 0.7339, Train Steps/Sec: 1.14
494
+ [2025-10-28 11:30:26] (step=0043100) Train Loss: 0.7355, Train Steps/Sec: 1.14
495
+ [2025-10-28 11:31:54] (step=0043200) Train Loss: 0.7340, Train Steps/Sec: 1.13
496
+ [2025-10-28 11:33:22] (step=0043300) Train Loss: 0.7342, Train Steps/Sec: 1.14
497
+ [2025-10-28 11:34:50] (step=0043400) Train Loss: 0.7346, Train Steps/Sec: 1.14
498
+ [2025-10-28 11:36:17] (step=0043500) Train Loss: 0.7341, Train Steps/Sec: 1.14
499
+ [2025-10-28 11:37:45] (step=0043600) Train Loss: 0.7341, Train Steps/Sec: 1.14
500
+ [2025-10-28 11:39:13] (step=0043700) Train Loss: 0.7345, Train Steps/Sec: 1.14
501
+ [2025-10-28 11:40:28] Beginning epoch 35...
502
+ [2025-10-28 11:40:46] (step=0043800) Train Loss: 0.7348, Train Steps/Sec: 1.07
503
+ [2025-10-28 11:42:14] (step=0043900) Train Loss: 0.7342, Train Steps/Sec: 1.14
504
+ [2025-10-28 11:43:42] (step=0044000) Train Loss: 0.7335, Train Steps/Sec: 1.14
505
+ [2025-10-28 11:45:10] (step=0044100) Train Loss: 0.7330, Train Steps/Sec: 1.14
506
+ [2025-10-28 11:46:38] (step=0044200) Train Loss: 0.7336, Train Steps/Sec: 1.14
507
+ [2025-10-28 11:48:06] (step=0044300) Train Loss: 0.7343, Train Steps/Sec: 1.14
508
+ [2025-10-28 11:49:34] (step=0044400) Train Loss: 0.7327, Train Steps/Sec: 1.14
509
+ [2025-10-28 11:51:01] (step=0044500) Train Loss: 0.7338, Train Steps/Sec: 1.14
510
+ [2025-10-28 11:52:29] (step=0044600) Train Loss: 0.7349, Train Steps/Sec: 1.14
511
+ [2025-10-28 11:53:57] (step=0044700) Train Loss: 0.7342, Train Steps/Sec: 1.14
512
+ [2025-10-28 11:55:25] (step=0044800) Train Loss: 0.7341, Train Steps/Sec: 1.14
513
+ [2025-10-28 11:56:53] (step=0044900) Train Loss: 0.7333, Train Steps/Sec: 1.14
514
+ [2025-10-28 11:58:21] (step=0045000) Train Loss: 0.7342, Train Steps/Sec: 1.14
515
+ [2025-10-28 11:58:53] Beginning epoch 36...
516
+ [2025-10-28 11:59:54] (step=0045100) Train Loss: 0.7333, Train Steps/Sec: 1.07
517
+ [2025-10-28 12:01:22] (step=0045200) Train Loss: 0.7341, Train Steps/Sec: 1.14
518
+ [2025-10-28 12:02:49] (step=0045300) Train Loss: 0.7339, Train Steps/Sec: 1.14
519
+ [2025-10-28 12:04:17] (step=0045400) Train Loss: 0.7333, Train Steps/Sec: 1.14
520
+ [2025-10-28 12:05:45] (step=0045500) Train Loss: 0.7337, Train Steps/Sec: 1.14
521
+ [2025-10-28 12:07:13] (step=0045600) Train Loss: 0.7330, Train Steps/Sec: 1.14
522
+ [2025-10-28 12:08:40] (step=0045700) Train Loss: 0.7330, Train Steps/Sec: 1.14
523
+ [2025-10-28 12:10:09] (step=0045800) Train Loss: 0.7329, Train Steps/Sec: 1.13
524
+ [2025-10-28 12:11:37] (step=0045900) Train Loss: 0.7335, Train Steps/Sec: 1.14
525
+ [2025-10-28 12:13:04] (step=0046000) Train Loss: 0.7332, Train Steps/Sec: 1.14
526
+ [2025-10-28 12:14:32] (step=0046100) Train Loss: 0.7326, Train Steps/Sec: 1.14
527
+ [2025-10-28 12:16:00] (step=0046200) Train Loss: 0.7338, Train Steps/Sec: 1.14
528
+ [2025-10-28 12:17:17] Beginning epoch 37...
529
+ [2025-10-28 12:17:33] (step=0046300) Train Loss: 0.7338, Train Steps/Sec: 1.07
530
+ [2025-10-28 12:19:01] (step=0046400) Train Loss: 0.7342, Train Steps/Sec: 1.14
531
+ [2025-10-28 12:20:29] (step=0046500) Train Loss: 0.7330, Train Steps/Sec: 1.14
532
+ [2025-10-28 12:21:57] (step=0046600) Train Loss: 0.7333, Train Steps/Sec: 1.13
533
+ [2025-10-28 12:23:25] (step=0046700) Train Loss: 0.7340, Train Steps/Sec: 1.14
534
+ [2025-10-28 12:24:53] (step=0046800) Train Loss: 0.7331, Train Steps/Sec: 1.14
535
+ [2025-10-28 12:26:20] (step=0046900) Train Loss: 0.7328, Train Steps/Sec: 1.14
536
+ [2025-10-28 12:27:48] (step=0047000) Train Loss: 0.7328, Train Steps/Sec: 1.14
537
+ [2025-10-28 12:29:16] (step=0047100) Train Loss: 0.7327, Train Steps/Sec: 1.14
538
+ [2025-10-28 12:30:44] (step=0047200) Train Loss: 0.7335, Train Steps/Sec: 1.14
539
+ [2025-10-28 12:32:12] (step=0047300) Train Loss: 0.7318, Train Steps/Sec: 1.14
540
+ [2025-10-28 12:33:39] (step=0047400) Train Loss: 0.7330, Train Steps/Sec: 1.14
541
+ [2025-10-28 12:35:08] (step=0047500) Train Loss: 0.7314, Train Steps/Sec: 1.13
542
+ [2025-10-28 12:35:42] Beginning epoch 38...
543
+ [2025-10-28 12:36:40] (step=0047600) Train Loss: 0.7331, Train Steps/Sec: 1.08
544
+ [2025-10-28 12:38:08] (step=0047700) Train Loss: 0.7325, Train Steps/Sec: 1.14
545
+ [2025-10-28 12:39:36] (step=0047800) Train Loss: 0.7322, Train Steps/Sec: 1.14
546
+ [2025-10-28 12:41:04] (step=0047900) Train Loss: 0.7328, Train Steps/Sec: 1.14
547
+ [2025-10-28 12:42:32] (step=0048000) Train Loss: 0.7311, Train Steps/Sec: 1.14
548
+ [2025-10-28 12:44:00] (step=0048100) Train Loss: 0.7320, Train Steps/Sec: 1.14
549
+ [2025-10-28 12:45:27] (step=0048200) Train Loss: 0.7326, Train Steps/Sec: 1.14
550
+ [2025-10-28 12:46:55] (step=0048300) Train Loss: 0.7322, Train Steps/Sec: 1.14
551
+ [2025-10-28 12:48:24] (step=0048400) Train Loss: 0.7325, Train Steps/Sec: 1.13
552
+ [2025-10-28 12:49:51] (step=0048500) Train Loss: 0.7326, Train Steps/Sec: 1.14
553
+ [2025-10-28 12:51:19] (step=0048600) Train Loss: 0.7313, Train Steps/Sec: 1.14
554
+ [2025-10-28 12:52:47] (step=0048700) Train Loss: 0.7325, Train Steps/Sec: 1.14
555
+ [2025-10-28 12:54:06] Beginning epoch 39...
556
+ [2025-10-28 12:54:20] (step=0048800) Train Loss: 0.7318, Train Steps/Sec: 1.08
557
+ [2025-10-28 12:55:48] (step=0048900) Train Loss: 0.7313, Train Steps/Sec: 1.14
558
+ [2025-10-28 12:57:15] (step=0049000) Train Loss: 0.7312, Train Steps/Sec: 1.14
559
+ [2025-10-28 12:58:43] (step=0049100) Train Loss: 0.7320, Train Steps/Sec: 1.14
560
+ [2025-10-28 13:00:12] (step=0049200) Train Loss: 0.7313, Train Steps/Sec: 1.13
561
+ [2025-10-28 13:01:39] (step=0049300) Train Loss: 0.7321, Train Steps/Sec: 1.14
562
+ [2025-10-28 13:03:07] (step=0049400) Train Loss: 0.7325, Train Steps/Sec: 1.14
563
+ [2025-10-28 13:04:35] (step=0049500) Train Loss: 0.7307, Train Steps/Sec: 1.14
564
+ [2025-10-28 13:06:03] (step=0049600) Train Loss: 0.7314, Train Steps/Sec: 1.14
565
+ [2025-10-28 13:07:31] (step=0049700) Train Loss: 0.7312, Train Steps/Sec: 1.14
566
+ [2025-10-28 13:08:58] (step=0049800) Train Loss: 0.7308, Train Steps/Sec: 1.14
567
+ [2025-10-28 13:10:26] (step=0049900) Train Loss: 0.7331, Train Steps/Sec: 1.14
568
+ [2025-10-28 13:11:54] (step=0050000) Train Loss: 0.7325, Train Steps/Sec: 1.14
569
+ [2025-10-28 13:12:46] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-g-bf16/checkpoints/0050000.pt
570
+ [2025-10-28 13:12:46] Generating EMA samples...
571
+ [2025-10-28 13:13:15] Generating EMA samples done.
572
+ [2025-10-28 13:13:51] Beginning epoch 40...
573
+ [2025-10-28 13:14:49] (step=0050100) Train Loss: 0.7302, Train Steps/Sec: 0.57
574
+ [2025-10-28 13:16:17] (step=0050200) Train Loss: 0.7319, Train Steps/Sec: 1.14
575
+ [2025-10-28 13:17:44] (step=0050300) Train Loss: 0.7314, Train Steps/Sec: 1.14
576
+ [2025-10-28 13:19:12] (step=0050400) Train Loss: 0.7323, Train Steps/Sec: 1.14
577
+ [2025-10-28 13:20:40] (step=0050500) Train Loss: 0.7311, Train Steps/Sec: 1.14
578
+ [2025-10-28 13:22:08] (step=0050600) Train Loss: 0.7313, Train Steps/Sec: 1.14
579
+ [2025-10-28 13:23:36] (step=0050700) Train Loss: 0.7320, Train Steps/Sec: 1.14
580
+ [2025-10-28 13:25:03] (step=0050800) Train Loss: 0.7317, Train Steps/Sec: 1.14
581
+ [2025-10-28 13:26:31] (step=0050900) Train Loss: 0.7310, Train Steps/Sec: 1.14
582
+ [2025-10-28 13:27:59] (step=0051000) Train Loss: 0.7314, Train Steps/Sec: 1.14
583
+ [2025-10-28 13:29:27] (step=0051100) Train Loss: 0.7307, Train Steps/Sec: 1.14
584
+ [2025-10-28 13:30:55] (step=0051200) Train Loss: 0.7322, Train Steps/Sec: 1.14
585
+ [2025-10-28 13:32:15] Beginning epoch 41...
586
+ [2025-10-28 13:32:28] (step=0051300) Train Loss: 0.7306, Train Steps/Sec: 1.08
587
+ [2025-10-28 13:33:55] (step=0051400) Train Loss: 0.7318, Train Steps/Sec: 1.14
588
+ [2025-10-28 13:35:23] (step=0051500) Train Loss: 0.7307, Train Steps/Sec: 1.14
589
+ [2025-10-28 13:36:51] (step=0051600) Train Loss: 0.7315, Train Steps/Sec: 1.14
590
+ [2025-10-28 13:38:19] (step=0051700) Train Loss: 0.7310, Train Steps/Sec: 1.14
591
+ [2025-10-28 13:39:47] (step=0051800) Train Loss: 0.7313, Train Steps/Sec: 1.13
592
+ [2025-10-28 13:41:15] (step=0051900) Train Loss: 0.7313, Train Steps/Sec: 1.14
593
+ [2025-10-28 13:42:43] (step=0052000) Train Loss: 0.7312, Train Steps/Sec: 1.14
594
+ [2025-10-28 13:44:11] (step=0052100) Train Loss: 0.7304, Train Steps/Sec: 1.14
595
+ [2025-10-28 13:45:39] (step=0052200) Train Loss: 0.7320, Train Steps/Sec: 1.14
596
+ [2025-10-28 13:47:06] (step=0052300) Train Loss: 0.7309, Train Steps/Sec: 1.14
597
+ [2025-10-28 13:48:34] (step=0052400) Train Loss: 0.7312, Train Steps/Sec: 1.14
598
+ [2025-10-28 13:50:02] (step=0052500) Train Loss: 0.7309, Train Steps/Sec: 1.14
599
+ [2025-10-28 13:50:39] Beginning epoch 42...
600
+ [2025-10-28 13:51:35] (step=0052600) Train Loss: 0.7302, Train Steps/Sec: 1.07
601
+ [2025-10-28 13:53:04] (step=0052700) Train Loss: 0.7312, Train Steps/Sec: 1.13
602
+ [2025-10-28 13:54:32] (step=0052800) Train Loss: 0.7304, Train Steps/Sec: 1.14
603
+ [2025-10-28 13:56:00] (step=0052900) Train Loss: 0.7307, Train Steps/Sec: 1.14
604
+ [2025-10-28 13:57:27] (step=0053000) Train Loss: 0.7308, Train Steps/Sec: 1.14
605
+ [2025-10-28 13:58:55] (step=0053100) Train Loss: 0.7292, Train Steps/Sec: 1.14
606
+ [2025-10-28 14:00:23] (step=0053200) Train Loss: 0.7314, Train Steps/Sec: 1.14
607
+ [2025-10-28 14:01:51] (step=0053300) Train Loss: 0.7302, Train Steps/Sec: 1.14
608
+ [2025-10-28 14:03:18] (step=0053400) Train Loss: 0.7293, Train Steps/Sec: 1.14
609
+ [2025-10-28 14:04:46] (step=0053500) Train Loss: 0.7302, Train Steps/Sec: 1.14
610
+ [2025-10-28 14:06:14] (step=0053600) Train Loss: 0.7303, Train Steps/Sec: 1.14
611
+ [2025-10-28 14:07:42] (step=0053700) Train Loss: 0.7297, Train Steps/Sec: 1.14
612
+ [2025-10-28 14:09:04] Beginning epoch 43...
613
+ [2025-10-28 14:09:15] (step=0053800) Train Loss: 0.7303, Train Steps/Sec: 1.07
614
+ [2025-10-28 14:10:43] (step=0053900) Train Loss: 0.7311, Train Steps/Sec: 1.14
615
+ [2025-10-28 14:12:11] (step=0054000) Train Loss: 0.7305, Train Steps/Sec: 1.14
616
+ [2025-10-28 14:13:39] (step=0054100) Train Loss: 0.7320, Train Steps/Sec: 1.14
617
+ [2025-10-28 14:15:07] (step=0054200) Train Loss: 0.7304, Train Steps/Sec: 1.14
618
+ [2025-10-28 14:16:34] (step=0054300) Train Loss: 0.7285, Train Steps/Sec: 1.14
619
+ [2025-10-28 14:18:03] (step=0054400) Train Loss: 0.7299, Train Steps/Sec: 1.13
620
+ [2025-10-28 14:19:31] (step=0054500) Train Loss: 0.7298, Train Steps/Sec: 1.14
621
+ [2025-10-28 14:20:59] (step=0054600) Train Loss: 0.7306, Train Steps/Sec: 1.14
622
+ [2025-10-28 14:22:26] (step=0054700) Train Loss: 0.7300, Train Steps/Sec: 1.14
623
+ [2025-10-28 14:23:54] (step=0054800) Train Loss: 0.7279, Train Steps/Sec: 1.14
624
+ [2025-10-28 14:25:22] (step=0054900) Train Loss: 0.7291, Train Steps/Sec: 1.14
625
+ [2025-10-28 14:26:50] (step=0055000) Train Loss: 0.7300, Train Steps/Sec: 1.14
626
+ [2025-10-28 14:27:29] Beginning epoch 44...
627
+ [2025-10-28 14:28:23] (step=0055100) Train Loss: 0.7297, Train Steps/Sec: 1.07
628
+ [2025-10-28 14:29:51] (step=0055200) Train Loss: 0.7303, Train Steps/Sec: 1.14
629
+ [2025-10-28 14:31:19] (step=0055300) Train Loss: 0.7300, Train Steps/Sec: 1.13
630
+ [2025-10-28 14:32:47] (step=0055400) Train Loss: 0.7304, Train Steps/Sec: 1.14
631
+ [2025-10-28 14:34:14] (step=0055500) Train Loss: 0.7295, Train Steps/Sec: 1.14
632
+ [2025-10-28 14:35:42] (step=0055600) Train Loss: 0.7289, Train Steps/Sec: 1.14
633
+ [2025-10-28 14:37:10] (step=0055700) Train Loss: 0.7296, Train Steps/Sec: 1.14
634
+ [2025-10-28 14:38:38] (step=0055800) Train Loss: 0.7302, Train Steps/Sec: 1.14
635
+ [2025-10-28 14:40:06] (step=0055900) Train Loss: 0.7294, Train Steps/Sec: 1.14
636
+ [2025-10-28 14:41:33] (step=0056000) Train Loss: 0.7298, Train Steps/Sec: 1.14
637
+ [2025-10-28 14:43:01] (step=0056100) Train Loss: 0.7295, Train Steps/Sec: 1.13
638
+ [2025-10-28 14:44:29] (step=0056200) Train Loss: 0.7288, Train Steps/Sec: 1.14
639
+ [2025-10-28 14:45:54] Beginning epoch 45...
640
+ [2025-10-28 14:46:03] (step=0056300) Train Loss: 0.7289, Train Steps/Sec: 1.07
641
+ [2025-10-28 14:47:30] (step=0056400) Train Loss: 0.7288, Train Steps/Sec: 1.14
642
+ [2025-10-28 14:48:58] (step=0056500) Train Loss: 0.7310, Train Steps/Sec: 1.14
643
+ [2025-10-28 14:50:26] (step=0056600) Train Loss: 0.7297, Train Steps/Sec: 1.14
644
+ [2025-10-28 14:51:54] (step=0056700) Train Loss: 0.7286, Train Steps/Sec: 1.14
645
+ [2025-10-28 14:53:22] (step=0056800) Train Loss: 0.7296, Train Steps/Sec: 1.14
646
+ [2025-10-28 14:54:50] (step=0056900) Train Loss: 0.7292, Train Steps/Sec: 1.14
647
+ [2025-10-28 14:56:18] (step=0057000) Train Loss: 0.7293, Train Steps/Sec: 1.13
648
+ [2025-10-28 14:57:46] (step=0057100) Train Loss: 0.7285, Train Steps/Sec: 1.14
649
+ [2025-10-28 14:59:14] (step=0057200) Train Loss: 0.7292, Train Steps/Sec: 1.14
650
+ [2025-10-28 15:00:42] (step=0057300) Train Loss: 0.7285, Train Steps/Sec: 1.14
651
+ [2025-10-28 15:02:09] (step=0057400) Train Loss: 0.7284, Train Steps/Sec: 1.14
652
+ [2025-10-28 15:03:37] (step=0057500) Train Loss: 0.7289, Train Steps/Sec: 1.14
653
+ [2025-10-28 15:04:18] Beginning epoch 46...
654
+ [2025-10-28 15:05:10] (step=0057600) Train Loss: 0.7287, Train Steps/Sec: 1.08
655
+ [2025-10-28 15:06:38] (step=0057700) Train Loss: 0.7290, Train Steps/Sec: 1.14
656
+ [2025-10-28 15:08:05] (step=0057800) Train Loss: 0.7285, Train Steps/Sec: 1.14
657
+ [2025-10-28 15:09:34] (step=0057900) Train Loss: 0.7286, Train Steps/Sec: 1.13
658
+ [2025-10-28 15:11:02] (step=0058000) Train Loss: 0.7290, Train Steps/Sec: 1.14
659
+ [2025-10-28 15:12:29] (step=0058100) Train Loss: 0.7288, Train Steps/Sec: 1.14
660
+ [2025-10-28 15:13:57] (step=0058200) Train Loss: 0.7284, Train Steps/Sec: 1.14
661
+ [2025-10-28 15:15:25] (step=0058300) Train Loss: 0.7297, Train Steps/Sec: 1.14
662
+ [2025-10-28 15:16:53] (step=0058400) Train Loss: 0.7302, Train Steps/Sec: 1.14
663
+ [2025-10-28 15:18:21] (step=0058500) Train Loss: 0.7296, Train Steps/Sec: 1.14
664
+ [2025-10-28 15:19:48] (step=0058600) Train Loss: 0.7277, Train Steps/Sec: 1.14
665
+ [2025-10-28 15:21:16] (step=0058700) Train Loss: 0.7290, Train Steps/Sec: 1.13
666
+ [2025-10-28 15:22:43] Beginning epoch 47...
667
+ [2025-10-28 15:22:50] (step=0058800) Train Loss: 0.7288, Train Steps/Sec: 1.07
668
+ [2025-10-28 15:24:18] (step=0058900) Train Loss: 0.7288, Train Steps/Sec: 1.14
669
+ [2025-10-28 15:25:46] (step=0059000) Train Loss: 0.7280, Train Steps/Sec: 1.14
670
+ [2025-10-28 15:27:13] (step=0059100) Train Loss: 0.7288, Train Steps/Sec: 1.14
671
+ [2025-10-28 15:28:41] (step=0059200) Train Loss: 0.7283, Train Steps/Sec: 1.14
672
+ [2025-10-28 15:30:09] (step=0059300) Train Loss: 0.7294, Train Steps/Sec: 1.14
673
+ [2025-10-28 15:31:37] (step=0059400) Train Loss: 0.7287, Train Steps/Sec: 1.14
674
+ [2025-10-28 15:33:05] (step=0059500) Train Loss: 0.7274, Train Steps/Sec: 1.14
675
+ [2025-10-28 15:34:33] (step=0059600) Train Loss: 0.7283, Train Steps/Sec: 1.13
676
+ [2025-10-28 15:36:01] (step=0059700) Train Loss: 0.7290, Train Steps/Sec: 1.14
677
+ [2025-10-28 15:37:29] (step=0059800) Train Loss: 0.7288, Train Steps/Sec: 1.14
678
+ [2025-10-28 15:38:56] (step=0059900) Train Loss: 0.7288, Train Steps/Sec: 1.14
679
+ [2025-10-28 15:40:24] (step=0060000) Train Loss: 0.7284, Train Steps/Sec: 1.14
680
+ [2025-10-28 15:41:07] Beginning epoch 48...
681
+ [2025-10-28 15:41:58] (step=0060100) Train Loss: 0.7288, Train Steps/Sec: 1.07
682
+ [2025-10-28 15:43:25] (step=0060200) Train Loss: 0.7266, Train Steps/Sec: 1.14
683
+ [2025-10-28 15:44:53] (step=0060300) Train Loss: 0.7290, Train Steps/Sec: 1.14
684
+ [2025-10-28 15:46:22] (step=0060400) Train Loss: 0.7287, Train Steps/Sec: 1.13
685
+ [2025-10-28 15:47:50] (step=0060500) Train Loss: 0.7286, Train Steps/Sec: 1.13
686
+ [2025-10-28 15:49:18] (step=0060600) Train Loss: 0.7280, Train Steps/Sec: 1.14
687
+ [2025-10-28 15:50:45] (step=0060700) Train Loss: 0.7277, Train Steps/Sec: 1.14
688
+ [2025-10-28 15:52:13] (step=0060800) Train Loss: 0.7276, Train Steps/Sec: 1.14
689
+ [2025-10-28 15:53:41] (step=0060900) Train Loss: 0.7272, Train Steps/Sec: 1.14
690
+ [2025-10-28 15:55:09] (step=0061000) Train Loss: 0.7288, Train Steps/Sec: 1.14
691
+ [2025-10-28 15:56:36] (step=0061100) Train Loss: 0.7287, Train Steps/Sec: 1.14
692
+ [2025-10-28 15:58:04] (step=0061200) Train Loss: 0.7288, Train Steps/Sec: 1.14
693
+ [2025-10-28 15:59:32] Beginning epoch 49...
694
+ [2025-10-28 15:59:38] (step=0061300) Train Loss: 0.7277, Train Steps/Sec: 1.07
695
+ [2025-10-28 16:01:05] (step=0061400) Train Loss: 0.7275, Train Steps/Sec: 1.14
696
+ [2025-10-28 16:02:33] (step=0061500) Train Loss: 0.7286, Train Steps/Sec: 1.14
697
+ [2025-10-28 16:04:01] (step=0061600) Train Loss: 0.7283, Train Steps/Sec: 1.14
698
+ [2025-10-28 16:05:29] (step=0061700) Train Loss: 0.7279, Train Steps/Sec: 1.14
699
+ [2025-10-28 16:06:56] (step=0061800) Train Loss: 0.7280, Train Steps/Sec: 1.14
700
+ [2025-10-28 16:08:24] (step=0061900) Train Loss: 0.7281, Train Steps/Sec: 1.14
701
+ [2025-10-28 16:09:52] (step=0062000) Train Loss: 0.7285, Train Steps/Sec: 1.14
702
+ [2025-10-28 16:11:20] (step=0062100) Train Loss: 0.7259, Train Steps/Sec: 1.14
703
+ [2025-10-28 16:12:48] (step=0062200) Train Loss: 0.7281, Train Steps/Sec: 1.13
704
+ [2025-10-28 16:14:16] (step=0062300) Train Loss: 0.7281, Train Steps/Sec: 1.14
705
+ [2025-10-28 16:15:44] (step=0062400) Train Loss: 0.7288, Train Steps/Sec: 1.14
706
+ [2025-10-28 16:17:12] (step=0062500) Train Loss: 0.7277, Train Steps/Sec: 1.14
707
+ [2025-10-28 16:17:56] Beginning epoch 50...
708
+ [2025-10-28 16:18:46] (step=0062600) Train Loss: 0.7281, Train Steps/Sec: 1.06
709
+ [2025-10-28 16:20:13] (step=0062700) Train Loss: 0.7262, Train Steps/Sec: 1.14
710
+ [2025-10-28 16:21:41] (step=0062800) Train Loss: 0.7288, Train Steps/Sec: 1.14
711
+ [2025-10-28 16:23:09] (step=0062900) Train Loss: 0.7269, Train Steps/Sec: 1.14
712
+ [2025-10-28 16:24:37] (step=0063000) Train Loss: 0.7289, Train Steps/Sec: 1.14
713
+ [2025-10-28 16:26:05] (step=0063100) Train Loss: 0.7277, Train Steps/Sec: 1.13
714
+ [2025-10-28 16:27:33] (step=0063200) Train Loss: 0.7269, Train Steps/Sec: 1.14
715
+ [2025-10-28 16:29:01] (step=0063300) Train Loss: 0.7261, Train Steps/Sec: 1.14
716
+ [2025-10-28 16:30:29] (step=0063400) Train Loss: 0.7268, Train Steps/Sec: 1.14
717
+ [2025-10-28 16:31:56] (step=0063500) Train Loss: 0.7273, Train Steps/Sec: 1.14
718
+ [2025-10-28 16:33:24] (step=0063600) Train Loss: 0.7280, Train Steps/Sec: 1.14
719
+ [2025-10-28 16:34:52] (step=0063700) Train Loss: 0.7277, Train Steps/Sec: 1.14
720
+ [2025-10-28 16:36:20] (step=0063800) Train Loss: 0.7276, Train Steps/Sec: 1.14
721
+ [2025-10-28 16:36:21] Beginning epoch 51...
722
+ [2025-10-28 16:37:53] (step=0063900) Train Loss: 0.7265, Train Steps/Sec: 1.07
723
+ [2025-10-28 16:39:21] (step=0064000) Train Loss: 0.7276, Train Steps/Sec: 1.14
724
+ [2025-10-28 16:40:49] (step=0064100) Train Loss: 0.7263, Train Steps/Sec: 1.14
725
+ [2025-10-28 16:42:17] (step=0064200) Train Loss: 0.7282, Train Steps/Sec: 1.14
726
+ [2025-10-28 16:43:45] (step=0064300) Train Loss: 0.7273, Train Steps/Sec: 1.14
727
+ [2025-10-28 16:45:13] (step=0064400) Train Loss: 0.7258, Train Steps/Sec: 1.14
728
+ [2025-10-28 16:46:40] (step=0064500) Train Loss: 0.7269, Train Steps/Sec: 1.14
729
+ [2025-10-28 16:48:08] (step=0064600) Train Loss: 0.7267, Train Steps/Sec: 1.14
730
+ [2025-10-28 16:49:36] (step=0064700) Train Loss: 0.7267, Train Steps/Sec: 1.14
731
+ [2025-10-28 16:51:04] (step=0064800) Train Loss: 0.7279, Train Steps/Sec: 1.14
732
+ [2025-10-28 16:52:32] (step=0064900) Train Loss: 0.7269, Train Steps/Sec: 1.14
733
+ [2025-10-28 16:54:00] (step=0065000) Train Loss: 0.7275, Train Steps/Sec: 1.14
734
+ [2025-10-28 16:54:46] Beginning epoch 52...
735
+ [2025-10-28 16:55:33] (step=0065100) Train Loss: 0.7276, Train Steps/Sec: 1.07
736
+ [2025-10-28 16:57:01] (step=0065200) Train Loss: 0.7264, Train Steps/Sec: 1.14
737
+ [2025-10-28 16:58:29] (step=0065300) Train Loss: 0.7278, Train Steps/Sec: 1.14
738
+ [2025-10-28 16:59:57] (step=0065400) Train Loss: 0.7278, Train Steps/Sec: 1.14
739
+ [2025-10-28 17:01:25] (step=0065500) Train Loss: 0.7266, Train Steps/Sec: 1.14
740
+ [2025-10-28 17:02:53] (step=0065600) Train Loss: 0.7271, Train Steps/Sec: 1.14
741
+ [2025-10-28 17:04:21] (step=0065700) Train Loss: 0.7286, Train Steps/Sec: 1.14
742
+ [2025-10-28 17:05:48] (step=0065800) Train Loss: 0.7273, Train Steps/Sec: 1.14
743
+ [2025-10-28 17:07:16] (step=0065900) Train Loss: 0.7271, Train Steps/Sec: 1.14
744
+ [2025-10-28 17:08:44] (step=0066000) Train Loss: 0.7273, Train Steps/Sec: 1.14
745
+ [2025-10-28 17:10:12] (step=0066100) Train Loss: 0.7269, Train Steps/Sec: 1.14
746
+ [2025-10-28 17:11:40] (step=0066200) Train Loss: 0.7269, Train Steps/Sec: 1.14
747
+ [2025-10-28 17:13:07] (step=0066300) Train Loss: 0.7267, Train Steps/Sec: 1.14
748
+ [2025-10-28 17:13:11] Beginning epoch 53...
749
+ [2025-10-28 17:14:41] (step=0066400) Train Loss: 0.7264, Train Steps/Sec: 1.07
750
+ [2025-10-28 17:16:09] (step=0066500) Train Loss: 0.7271, Train Steps/Sec: 1.13
751
+ [2025-10-28 17:17:37] (step=0066600) Train Loss: 0.7265, Train Steps/Sec: 1.14
752
+ [2025-10-28 17:19:05] (step=0066700) Train Loss: 0.7269, Train Steps/Sec: 1.14
753
+ [2025-10-28 17:20:33] (step=0066800) Train Loss: 0.7259, Train Steps/Sec: 1.14
754
+ [2025-10-28 17:22:01] (step=0066900) Train Loss: 0.7257, Train Steps/Sec: 1.14
755
+ [2025-10-28 17:23:29] (step=0067000) Train Loss: 0.7264, Train Steps/Sec: 1.14
756
+ [2025-10-28 17:24:56] (step=0067100) Train Loss: 0.7283, Train Steps/Sec: 1.14
757
+ [2025-10-28 17:26:24] (step=0067200) Train Loss: 0.7276, Train Steps/Sec: 1.14
758
+ [2025-10-28 17:27:52] (step=0067300) Train Loss: 0.7270, Train Steps/Sec: 1.14
759
+ [2025-10-28 17:29:20] (step=0067400) Train Loss: 0.7259, Train Steps/Sec: 1.14
760
+ [2025-10-28 17:30:48] (step=0067500) Train Loss: 0.7275, Train Steps/Sec: 1.14
761
+ [2025-10-28 17:31:36] Beginning epoch 54...
762
+ [2025-10-28 17:32:21] (step=0067600) Train Loss: 0.7259, Train Steps/Sec: 1.07
763
+ [2025-10-28 17:33:49] (step=0067700) Train Loss: 0.7270, Train Steps/Sec: 1.14
764
+ [2025-10-28 17:35:17] (step=0067800) Train Loss: 0.7265, Train Steps/Sec: 1.14
765
+ [2025-10-28 17:36:44] (step=0067900) Train Loss: 0.7274, Train Steps/Sec: 1.14
766
+ [2025-10-28 17:38:12] (step=0068000) Train Loss: 0.7273, Train Steps/Sec: 1.14
767
+ [2025-10-28 17:39:40] (step=0068100) Train Loss: 0.7278, Train Steps/Sec: 1.14
768
+ [2025-10-28 17:41:08] (step=0068200) Train Loss: 0.7276, Train Steps/Sec: 1.13
769
+ [2025-10-28 17:42:36] (step=0068300) Train Loss: 0.7258, Train Steps/Sec: 1.14
770
+ [2025-10-28 17:44:04] (step=0068400) Train Loss: 0.7265, Train Steps/Sec: 1.14
771
+ [2025-10-28 17:45:32] (step=0068500) Train Loss: 0.7260, Train Steps/Sec: 1.14
772
+ [2025-10-28 17:47:00] (step=0068600) Train Loss: 0.7259, Train Steps/Sec: 1.14
773
+ [2025-10-28 17:48:28] (step=0068700) Train Loss: 0.7269, Train Steps/Sec: 1.14
774
+ [2025-10-28 17:49:55] (step=0068800) Train Loss: 0.7258, Train Steps/Sec: 1.14
775
+ [2025-10-28 17:50:00] Beginning epoch 55...
776
+ [2025-10-28 17:51:29] (step=0068900) Train Loss: 0.7255, Train Steps/Sec: 1.07
777
+ [2025-10-28 17:52:57] (step=0069000) Train Loss: 0.7273, Train Steps/Sec: 1.14
778
+ [2025-10-28 17:54:25] (step=0069100) Train Loss: 0.7260, Train Steps/Sec: 1.13
779
+ [2025-10-28 17:55:53] (step=0069200) Train Loss: 0.7263, Train Steps/Sec: 1.14
780
+ [2025-10-28 17:57:21] (step=0069300) Train Loss: 0.7276, Train Steps/Sec: 1.14
781
+ [2025-10-28 17:58:49] (step=0069400) Train Loss: 0.7269, Train Steps/Sec: 1.14
782
+ [2025-10-28 18:00:17] (step=0069500) Train Loss: 0.7265, Train Steps/Sec: 1.14
783
+ [2025-10-28 18:01:44] (step=0069600) Train Loss: 0.7249, Train Steps/Sec: 1.14
784
+ [2025-10-28 18:03:12] (step=0069700) Train Loss: 0.7253, Train Steps/Sec: 1.14
785
+ [2025-10-28 18:04:40] (step=0069800) Train Loss: 0.7250, Train Steps/Sec: 1.14
786
+ [2025-10-28 18:06:08] (step=0069900) Train Loss: 0.7262, Train Steps/Sec: 1.14
787
+ [2025-10-28 18:07:36] (step=0070000) Train Loss: 0.7264, Train Steps/Sec: 1.13
788
+ [2025-10-28 18:08:26] Beginning epoch 56...
789
+ [2025-10-28 18:09:10] (step=0070100) Train Loss: 0.7259, Train Steps/Sec: 1.06
790
+ [2025-10-28 18:10:38] (step=0070200) Train Loss: 0.7263, Train Steps/Sec: 1.14
791
+ [2025-10-28 18:12:06] (step=0070300) Train Loss: 0.7253, Train Steps/Sec: 1.14
792
+ [2025-10-28 18:13:33] (step=0070400) Train Loss: 0.7246, Train Steps/Sec: 1.14
793
+ [2025-10-28 18:15:01] (step=0070500) Train Loss: 0.7248, Train Steps/Sec: 1.14
794
+ [2025-10-28 18:16:29] (step=0070600) Train Loss: 0.7255, Train Steps/Sec: 1.14
795
+ [2025-10-28 18:17:57] (step=0070700) Train Loss: 0.7274, Train Steps/Sec: 1.14
796
+ [2025-10-28 18:19:25] (step=0070800) Train Loss: 0.7259, Train Steps/Sec: 1.13
797
+ [2025-10-28 18:20:53] (step=0070900) Train Loss: 0.7268, Train Steps/Sec: 1.14
798
+ [2025-10-28 18:22:21] (step=0071000) Train Loss: 0.7253, Train Steps/Sec: 1.14
799
+ [2025-10-28 18:23:48] (step=0071100) Train Loss: 0.7266, Train Steps/Sec: 1.14
800
+ [2025-10-28 18:25:16] (step=0071200) Train Loss: 0.7255, Train Steps/Sec: 1.14
801
+ [2025-10-28 18:26:44] (step=0071300) Train Loss: 0.7262, Train Steps/Sec: 1.14
802
+ [2025-10-28 18:26:51] Beginning epoch 57...
803
+ [2025-10-28 18:28:17] (step=0071400) Train Loss: 0.7253, Train Steps/Sec: 1.07
804
+ [2025-10-28 18:29:45] (step=0071500) Train Loss: 0.7248, Train Steps/Sec: 1.14
805
+ [2025-10-28 18:31:13] (step=0071600) Train Loss: 0.7243, Train Steps/Sec: 1.14
806
+ [2025-10-28 18:32:41] (step=0071700) Train Loss: 0.7266, Train Steps/Sec: 1.13
807
+ [2025-10-28 18:34:09] (step=0071800) Train Loss: 0.7259, Train Steps/Sec: 1.14
808
+ [2025-10-28 18:35:37] (step=0071900) Train Loss: 0.7264, Train Steps/Sec: 1.14
809
+ [2025-10-28 18:37:05] (step=0072000) Train Loss: 0.7253, Train Steps/Sec: 1.14
810
+ [2025-10-28 18:38:32] (step=0072100) Train Loss: 0.7251, Train Steps/Sec: 1.14
811
+ [2025-10-28 18:40:00] (step=0072200) Train Loss: 0.7240, Train Steps/Sec: 1.14
812
+ [2025-10-28 18:41:28] (step=0072300) Train Loss: 0.7254, Train Steps/Sec: 1.14
813
+ [2025-10-28 18:42:56] (step=0072400) Train Loss: 0.7248, Train Steps/Sec: 1.14
814
+ [2025-10-28 18:44:23] (step=0072500) Train Loss: 0.7263, Train Steps/Sec: 1.14
815
+ [2025-10-28 18:45:15] Beginning epoch 58...
816
+ [2025-10-28 18:45:58] (step=0072600) Train Loss: 0.7248, Train Steps/Sec: 1.05
817
+ [2025-10-28 18:47:26] (step=0072700) Train Loss: 0.7260, Train Steps/Sec: 1.14
818
+ [2025-10-28 18:48:54] (step=0072800) Train Loss: 0.7268, Train Steps/Sec: 1.14
819
+ [2025-10-28 18:50:21] (step=0072900) Train Loss: 0.7253, Train Steps/Sec: 1.14
820
+ [2025-10-28 18:51:49] (step=0073000) Train Loss: 0.7246, Train Steps/Sec: 1.14
821
+ [2025-10-28 18:53:17] (step=0073100) Train Loss: 0.7258, Train Steps/Sec: 1.14
822
+ [2025-10-28 18:54:45] (step=0073200) Train Loss: 0.7255, Train Steps/Sec: 1.14
823
+ [2025-10-28 18:56:13] (step=0073300) Train Loss: 0.7242, Train Steps/Sec: 1.14
824
+ [2025-10-28 18:57:41] (step=0073400) Train Loss: 0.7255, Train Steps/Sec: 1.13
825
+ [2025-10-28 18:59:09] (step=0073500) Train Loss: 0.7243, Train Steps/Sec: 1.14
826
+ [2025-10-28 19:00:37] (step=0073600) Train Loss: 0.7254, Train Steps/Sec: 1.14
827
+ [2025-10-28 19:02:04] (step=0073700) Train Loss: 0.7247, Train Steps/Sec: 1.14
828
+ [2025-10-28 19:03:32] (step=0073800) Train Loss: 0.7252, Train Steps/Sec: 1.14
829
+ [2025-10-28 19:03:41] Beginning epoch 59...
830
+ [2025-10-28 19:05:05] (step=0073900) Train Loss: 0.7253, Train Steps/Sec: 1.07
831
+ [2025-10-28 19:06:33] (step=0074000) Train Loss: 0.7244, Train Steps/Sec: 1.14
832
+ [2025-10-28 19:08:01] (step=0074100) Train Loss: 0.7243, Train Steps/Sec: 1.14
833
+ [2025-10-28 19:09:29] (step=0074200) Train Loss: 0.7242, Train Steps/Sec: 1.14
834
+ [2025-10-28 19:10:57] (step=0074300) Train Loss: 0.7240, Train Steps/Sec: 1.13
835
+ [2025-10-28 19:12:25] (step=0074400) Train Loss: 0.7251, Train Steps/Sec: 1.14
836
+ [2025-10-28 19:13:52] (step=0074500) Train Loss: 0.7262, Train Steps/Sec: 1.14
837
+ [2025-10-28 19:15:20] (step=0074600) Train Loss: 0.7257, Train Steps/Sec: 1.14
838
+ [2025-10-28 19:16:48] (step=0074700) Train Loss: 0.7252, Train Steps/Sec: 1.14
839
+ [2025-10-28 19:18:16] (step=0074800) Train Loss: 0.7256, Train Steps/Sec: 1.14
840
+ [2025-10-28 19:19:44] (step=0074900) Train Loss: 0.7251, Train Steps/Sec: 1.14
841
+ [2025-10-28 19:21:11] (step=0075000) Train Loss: 0.7251, Train Steps/Sec: 1.14
842
+ [2025-10-28 19:22:03] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-g-bf16/checkpoints/0075000.pt
843
+ [2025-10-28 19:22:03] Generating EMA samples...
844
+ [2025-10-28 19:22:32] Generating EMA samples done.
845
+ [2025-10-28 19:23:25] Beginning epoch 60...
846
+ [2025-10-28 19:24:05] (step=0075100) Train Loss: 0.7244, Train Steps/Sec: 0.58
847
+ [2025-10-28 19:25:33] (step=0075200) Train Loss: 0.7250, Train Steps/Sec: 1.13
848
+ [2025-10-28 19:27:01] (step=0075300) Train Loss: 0.7249, Train Steps/Sec: 1.14
849
+ [2025-10-28 19:28:29] (step=0075400) Train Loss: 0.7244, Train Steps/Sec: 1.14
850
+ [2025-10-28 19:29:57] (step=0075500) Train Loss: 0.7261, Train Steps/Sec: 1.14
851
+ [2025-10-28 19:31:24] (step=0075600) Train Loss: 0.7244, Train Steps/Sec: 1.14
852
+ [2025-10-28 19:32:52] (step=0075700) Train Loss: 0.7251, Train Steps/Sec: 1.14
853
+ [2025-10-28 19:34:20] (step=0075800) Train Loss: 0.7262, Train Steps/Sec: 1.14
854
+ [2025-10-28 19:35:48] (step=0075900) Train Loss: 0.7241, Train Steps/Sec: 1.14
855
+ [2025-10-28 19:37:16] (step=0076000) Train Loss: 0.7244, Train Steps/Sec: 1.14
856
+ [2025-10-28 19:38:44] (step=0076100) Train Loss: 0.7253, Train Steps/Sec: 1.14
857
+ [2025-10-28 19:40:11] (step=0076200) Train Loss: 0.7248, Train Steps/Sec: 1.14
858
+ [2025-10-28 19:41:39] (step=0076300) Train Loss: 0.7251, Train Steps/Sec: 1.14
859
+ [2025-10-28 19:41:49] Beginning epoch 61...
860
+ [2025-10-28 19:43:13] (step=0076400) Train Loss: 0.7239, Train Steps/Sec: 1.06
861
+ [2025-10-28 19:44:41] (step=0076500) Train Loss: 0.7240, Train Steps/Sec: 1.14
862
+ [2025-10-28 19:46:09] (step=0076600) Train Loss: 0.7246, Train Steps/Sec: 1.14
863
+ [2025-10-28 19:47:37] (step=0076700) Train Loss: 0.7235, Train Steps/Sec: 1.14
864
+ [2025-10-28 19:49:05] (step=0076800) Train Loss: 0.7247, Train Steps/Sec: 1.13
865
+ [2025-10-28 19:50:33] (step=0076900) Train Loss: 0.7236, Train Steps/Sec: 1.13
866
+ [2025-10-28 19:52:01] (step=0077000) Train Loss: 0.7251, Train Steps/Sec: 1.14
867
+ [2025-10-28 19:53:29] (step=0077100) Train Loss: 0.7253, Train Steps/Sec: 1.14
868
+ [2025-10-28 19:54:57] (step=0077200) Train Loss: 0.7244, Train Steps/Sec: 1.14
869
+ [2025-10-28 19:56:24] (step=0077300) Train Loss: 0.7246, Train Steps/Sec: 1.14
870
+ [2025-10-28 19:57:52] (step=0077400) Train Loss: 0.7252, Train Steps/Sec: 1.14
871
+ [2025-10-28 19:59:20] (step=0077500) Train Loss: 0.7255, Train Steps/Sec: 1.14
872
+ [2025-10-28 20:00:15] Beginning epoch 62...
873
+ [2025-10-28 20:00:53] (step=0077600) Train Loss: 0.7245, Train Steps/Sec: 1.07
874
+ [2025-10-28 20:02:22] (step=0077700) Train Loss: 0.7236, Train Steps/Sec: 1.13
875
+ [2025-10-28 20:03:50] (step=0077800) Train Loss: 0.7241, Train Steps/Sec: 1.14
876
+ [2025-10-28 20:05:17] (step=0077900) Train Loss: 0.7243, Train Steps/Sec: 1.14
877
+ [2025-10-28 20:06:45] (step=0078000) Train Loss: 0.7239, Train Steps/Sec: 1.14
878
+ [2025-10-28 20:08:13] (step=0078100) Train Loss: 0.7246, Train Steps/Sec: 1.14
879
+ [2025-10-28 20:09:41] (step=0078200) Train Loss: 0.7235, Train Steps/Sec: 1.14
880
+ [2025-10-28 20:11:08] (step=0078300) Train Loss: 0.7251, Train Steps/Sec: 1.14
881
+ [2025-10-28 20:12:36] (step=0078400) Train Loss: 0.7235, Train Steps/Sec: 1.14
882
+ [2025-10-28 20:14:04] (step=0078500) Train Loss: 0.7242, Train Steps/Sec: 1.14
883
+ [2025-10-28 20:15:32] (step=0078600) Train Loss: 0.7238, Train Steps/Sec: 1.14
884
+ [2025-10-28 20:17:00] (step=0078700) Train Loss: 0.7248, Train Steps/Sec: 1.14
885
+ [2025-10-28 20:18:28] (step=0078800) Train Loss: 0.7246, Train Steps/Sec: 1.14
886
+ [2025-10-28 20:18:40] Beginning epoch 63...
887
+ [2025-10-28 20:20:01] (step=0078900) Train Loss: 0.7248, Train Steps/Sec: 1.07
888
+ [2025-10-28 20:21:29] (step=0079000) Train Loss: 0.7229, Train Steps/Sec: 1.14
889
+ [2025-10-28 20:22:57] (step=0079100) Train Loss: 0.7248, Train Steps/Sec: 1.14
890
+ [2025-10-28 20:24:24] (step=0079200) Train Loss: 0.7245, Train Steps/Sec: 1.14
891
+ [2025-10-28 20:25:52] (step=0079300) Train Loss: 0.7253, Train Steps/Sec: 1.14
892
+ [2025-10-28 20:27:20] (step=0079400) Train Loss: 0.7239, Train Steps/Sec: 1.13
893
+ [2025-10-28 20:28:48] (step=0079500) Train Loss: 0.7247, Train Steps/Sec: 1.14
894
+ [2025-10-28 20:30:16] (step=0079600) Train Loss: 0.7238, Train Steps/Sec: 1.14
895
+ [2025-10-28 20:31:44] (step=0079700) Train Loss: 0.7246, Train Steps/Sec: 1.14
896
+ [2025-10-28 20:33:11] (step=0079800) Train Loss: 0.7234, Train Steps/Sec: 1.14
897
+ [2025-10-28 20:34:39] (step=0079900) Train Loss: 0.7247, Train Steps/Sec: 1.14
898
+ [2025-10-28 20:36:07] (step=0080000) Train Loss: 0.7245, Train Steps/Sec: 1.14
899
+ [2025-10-28 20:37:04] Beginning epoch 64...
900
+ [2025-10-28 20:37:41] (step=0080100) Train Loss: 0.7233, Train Steps/Sec: 1.06
901
+ [2025-10-28 20:39:09] (step=0080200) Train Loss: 0.7226, Train Steps/Sec: 1.14
902
+ [2025-10-28 20:40:38] (step=0080300) Train Loss: 0.7237, Train Steps/Sec: 1.13
903
+ [2025-10-28 20:42:05] (step=0080400) Train Loss: 0.7251, Train Steps/Sec: 1.14
904
+ [2025-10-28 20:43:33] (step=0080500) Train Loss: 0.7235, Train Steps/Sec: 1.14
905
+ [2025-10-28 20:45:01] (step=0080600) Train Loss: 0.7233, Train Steps/Sec: 1.14
906
+ [2025-10-28 20:46:29] (step=0080700) Train Loss: 0.7226, Train Steps/Sec: 1.14
907
+ [2025-10-28 20:47:56] (step=0080800) Train Loss: 0.7240, Train Steps/Sec: 1.14
908
+ [2025-10-28 20:49:24] (step=0080900) Train Loss: 0.7228, Train Steps/Sec: 1.14
909
+ [2025-10-28 20:50:52] (step=0081000) Train Loss: 0.7239, Train Steps/Sec: 1.14
910
+ [2025-10-28 20:52:20] (step=0081100) Train Loss: 0.7248, Train Steps/Sec: 1.14
911
+ [2025-10-28 20:53:48] (step=0081200) Train Loss: 0.7237, Train Steps/Sec: 1.13
912
+ [2025-10-28 20:55:16] (step=0081300) Train Loss: 0.7237, Train Steps/Sec: 1.14
913
+ [2025-10-28 20:55:30] Beginning epoch 65...
914
+ [2025-10-28 20:56:49] (step=0081400) Train Loss: 0.7236, Train Steps/Sec: 1.07
915
+ [2025-10-28 20:58:17] (step=0081500) Train Loss: 0.7233, Train Steps/Sec: 1.14
916
+ [2025-10-28 20:59:45] (step=0081600) Train Loss: 0.7243, Train Steps/Sec: 1.14
917
+ [2025-10-28 21:01:12] (step=0081700) Train Loss: 0.7240, Train Steps/Sec: 1.14
918
+ [2025-10-28 21:02:40] (step=0081800) Train Loss: 0.7238, Train Steps/Sec: 1.14
919
+ [2025-10-28 21:04:08] (step=0081900) Train Loss: 0.7230, Train Steps/Sec: 1.14
920
+ [2025-10-28 21:05:36] (step=0082000) Train Loss: 0.7239, Train Steps/Sec: 1.13
921
+ [2025-10-28 21:07:04] (step=0082100) Train Loss: 0.7226, Train Steps/Sec: 1.14
922
+ [2025-10-28 21:08:32] (step=0082200) Train Loss: 0.7238, Train Steps/Sec: 1.14
923
+ [2025-10-28 21:10:00] (step=0082300) Train Loss: 0.7236, Train Steps/Sec: 1.14
924
+ [2025-10-28 21:11:27] (step=0082400) Train Loss: 0.7243, Train Steps/Sec: 1.14
925
+ [2025-10-28 21:12:55] (step=0082500) Train Loss: 0.7236, Train Steps/Sec: 1.14
926
+ [2025-10-28 21:13:54] Beginning epoch 66...
927
+ [2025-10-28 21:14:28] (step=0082600) Train Loss: 0.7232, Train Steps/Sec: 1.07
928
+ [2025-10-28 21:15:56] (step=0082700) Train Loss: 0.7215, Train Steps/Sec: 1.14
929
+ [2025-10-28 21:17:24] (step=0082800) Train Loss: 0.7233, Train Steps/Sec: 1.14
930
+ [2025-10-28 21:18:52] (step=0082900) Train Loss: 0.7233, Train Steps/Sec: 1.13
931
+ [2025-10-28 21:20:20] (step=0083000) Train Loss: 0.7228, Train Steps/Sec: 1.14
932
+ [2025-10-28 21:21:48] (step=0083100) Train Loss: 0.7223, Train Steps/Sec: 1.14
933
+ [2025-10-28 21:23:16] (step=0083200) Train Loss: 0.7238, Train Steps/Sec: 1.14
934
+ [2025-10-28 21:24:43] (step=0083300) Train Loss: 0.7241, Train Steps/Sec: 1.14
935
+ [2025-10-28 21:26:11] (step=0083400) Train Loss: 0.7242, Train Steps/Sec: 1.14
936
+ [2025-10-28 21:27:39] (step=0083500) Train Loss: 0.7237, Train Steps/Sec: 1.14
937
+ [2025-10-28 21:29:06] (step=0083600) Train Loss: 0.7236, Train Steps/Sec: 1.14
938
+ [2025-10-28 21:30:34] (step=0083700) Train Loss: 0.7229, Train Steps/Sec: 1.14
939
+ [2025-10-28 21:32:02] (step=0083800) Train Loss: 0.7237, Train Steps/Sec: 1.14
940
+ [2025-10-28 21:32:18] Beginning epoch 67...
941
+ [2025-10-28 21:33:35] (step=0083900) Train Loss: 0.7234, Train Steps/Sec: 1.08
942
+ [2025-10-28 21:35:03] (step=0084000) Train Loss: 0.7233, Train Steps/Sec: 1.14
943
+ [2025-10-28 21:36:31] (step=0084100) Train Loss: 0.7227, Train Steps/Sec: 1.14
944
+ [2025-10-28 21:37:59] (step=0084200) Train Loss: 0.7224, Train Steps/Sec: 1.14
945
+ [2025-10-28 21:39:26] (step=0084300) Train Loss: 0.7227, Train Steps/Sec: 1.14
946
+ [2025-10-28 21:40:54] (step=0084400) Train Loss: 0.7237, Train Steps/Sec: 1.14
947
+ [2025-10-28 21:42:22] (step=0084500) Train Loss: 0.7233, Train Steps/Sec: 1.14
948
+ [2025-10-28 21:43:50] (step=0084600) Train Loss: 0.7230, Train Steps/Sec: 1.13
949
+ [2025-10-28 21:45:18] (step=0084700) Train Loss: 0.7235, Train Steps/Sec: 1.14
950
+ [2025-10-28 21:46:46] (step=0084800) Train Loss: 0.7243, Train Steps/Sec: 1.14
951
+ [2025-10-28 21:48:14] (step=0084900) Train Loss: 0.7236, Train Steps/Sec: 1.14
952
+ [2025-10-28 21:49:41] (step=0085000) Train Loss: 0.7234, Train Steps/Sec: 1.14
953
+ [2025-10-28 21:50:42] Beginning epoch 68...
954
+ [2025-10-28 21:51:14] (step=0085100) Train Loss: 0.7233, Train Steps/Sec: 1.08
955
+ [2025-10-28 21:52:42] (step=0085200) Train Loss: 0.7236, Train Steps/Sec: 1.14
956
+ [2025-10-28 21:54:10] (step=0085300) Train Loss: 0.7233, Train Steps/Sec: 1.14
957
+ [2025-10-28 21:55:37] (step=0085400) Train Loss: 0.7224, Train Steps/Sec: 1.14
958
+ [2025-10-28 21:57:06] (step=0085500) Train Loss: 0.7227, Train Steps/Sec: 1.13
959
+ [2025-10-28 21:58:34] (step=0085600) Train Loss: 0.7238, Train Steps/Sec: 1.14
960
+ [2025-10-28 22:00:02] (step=0085700) Train Loss: 0.7233, Train Steps/Sec: 1.14
961
+ [2025-10-28 22:01:29] (step=0085800) Train Loss: 0.7240, Train Steps/Sec: 1.14
962
+ [2025-10-28 22:02:57] (step=0085900) Train Loss: 0.7221, Train Steps/Sec: 1.14
963
+ [2025-10-28 22:04:25] (step=0086000) Train Loss: 0.7217, Train Steps/Sec: 1.14
964
+ [2025-10-28 22:05:52] (step=0086100) Train Loss: 0.7225, Train Steps/Sec: 1.14
965
+ [2025-10-28 22:07:20] (step=0086200) Train Loss: 0.7221, Train Steps/Sec: 1.14
966
+ [2025-10-28 22:08:48] (step=0086300) Train Loss: 0.7229, Train Steps/Sec: 1.14
967
+ [2025-10-28 22:09:05] Beginning epoch 69...
968
+ [2025-10-28 22:10:22] (step=0086400) Train Loss: 0.7226, Train Steps/Sec: 1.07
969
+ [2025-10-28 22:11:49] (step=0086500) Train Loss: 0.7223, Train Steps/Sec: 1.14
970
+ [2025-10-28 22:13:17] (step=0086600) Train Loss: 0.7223, Train Steps/Sec: 1.14
971
+ [2025-10-28 22:14:45] (step=0086700) Train Loss: 0.7238, Train Steps/Sec: 1.14
972
+ [2025-10-28 22:16:13] (step=0086800) Train Loss: 0.7231, Train Steps/Sec: 1.14
973
+ [2025-10-28 22:17:41] (step=0086900) Train Loss: 0.7224, Train Steps/Sec: 1.14
974
+ [2025-10-28 22:19:08] (step=0087000) Train Loss: 0.7236, Train Steps/Sec: 1.14
975
+ [2025-10-28 22:20:36] (step=0087100) Train Loss: 0.7237, Train Steps/Sec: 1.14
976
+ [2025-10-28 22:22:05] (step=0087200) Train Loss: 0.7220, Train Steps/Sec: 1.13
977
+ [2025-10-28 22:23:32] (step=0087300) Train Loss: 0.7235, Train Steps/Sec: 1.14
978
+ [2025-10-28 22:25:00] (step=0087400) Train Loss: 0.7230, Train Steps/Sec: 1.14
979
+ [2025-10-28 22:26:28] (step=0087500) Train Loss: 0.7232, Train Steps/Sec: 1.14
980
+ [2025-10-28 22:27:30] Beginning epoch 70...
981
+ [2025-10-28 22:28:01] (step=0087600) Train Loss: 0.7236, Train Steps/Sec: 1.07
982
+ [2025-10-28 22:29:29] (step=0087700) Train Loss: 0.7223, Train Steps/Sec: 1.14
983
+ [2025-10-28 22:30:56] (step=0087800) Train Loss: 0.7220, Train Steps/Sec: 1.14
984
+ [2025-10-28 22:32:24] (step=0087900) Train Loss: 0.7232, Train Steps/Sec: 1.14
985
+ [2025-10-28 22:33:52] (step=0088000) Train Loss: 0.7227, Train Steps/Sec: 1.14
986
+ [2025-10-28 22:35:21] (step=0088100) Train Loss: 0.7219, Train Steps/Sec: 1.13
987
+ [2025-10-28 22:36:48] (step=0088200) Train Loss: 0.7225, Train Steps/Sec: 1.14
988
+ [2025-10-28 22:38:16] (step=0088300) Train Loss: 0.7229, Train Steps/Sec: 1.14
989
+ [2025-10-28 22:39:44] (step=0088400) Train Loss: 0.7234, Train Steps/Sec: 1.14
990
+ [2025-10-28 22:41:12] (step=0088500) Train Loss: 0.7229, Train Steps/Sec: 1.14