xingjianleng commited on
Commit
6a0ae0d
·
verified ·
1 Parent(s): f1755ad

Upload folder using huggingface_hub

Browse files
stage2/lightningdit-xl-pe-vit-b-bf16/checkpoints/0025000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d540005ebc084282f849ffd7312f49840b358f21ca3813943180559d22701f71
3
+ size 19230431602
stage2/lightningdit-xl-pe-vit-b-bf16/checkpoints/0050000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:335a051bc7bcf0b16bebf3b1fb9d8f5f8daba6c800bb4b970a416315021c2306
3
+ size 19230431602
stage2/lightningdit-xl-pe-vit-b-bf16/checkpoints/0075000.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5a641f46d4a6757186572e2433cf4548a8e9b6faf8820745adbb88597c1ead8e
3
+ size 19230431602
stage2/lightningdit-xl-pe-vit-b-bf16/log.txt ADDED
@@ -0,0 +1,1116 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [2025-10-27 23:30:09] Experiment directory created at results/stage2/hfdata/lightningdit-xl-pe-vit-b-bf16
2
+ [2025-10-27 23:30:15] Missing keys for loading vision encoder: []
3
+ [2025-10-27 23:30:15] Unexpected keys for loading vision encoder: []
4
+ [2025-10-27 23:30:30] Model Parameters: 1202.04M
5
+ [2025-10-27 23:30:33] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/train)
6
+ [2025-10-27 23:30:33] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
7
+ [2025-10-27 23:30:33] Precision mode: bf16
8
+ [2025-10-27 23:30:33] Training configured for 80 epochs, 1251 steps per epoch.
9
+ [2025-10-27 23:30:33] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
10
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
11
+ [2025-10-27 23:30:33] Training for 80 epochs...
12
+ [2025-10-27 23:30:33] Beginning epoch 0...
13
+ [2025-10-27 23:30:39] Generating EMA samples...
14
+ [2025-10-27 23:31:08] Generating EMA samples done.
15
+ [2025-10-27 23:32:30] (step=0000100) Train Loss: 1.6477, Train Steps/Sec: 0.86
16
+ [2025-10-27 23:33:54] (step=0000200) Train Loss: 1.1862, Train Steps/Sec: 1.20
17
+ [2025-10-27 23:35:17] (step=0000300) Train Loss: 1.0226, Train Steps/Sec: 1.20
18
+ [2025-10-27 23:36:41] (step=0000400) Train Loss: 0.9462, Train Steps/Sec: 1.20
19
+ [2025-10-27 23:38:04] (step=0000500) Train Loss: 0.9013, Train Steps/Sec: 1.20
20
+ [2025-10-27 23:39:28] (step=0000600) Train Loss: 0.8715, Train Steps/Sec: 1.20
21
+ [2025-10-27 23:40:51] (step=0000700) Train Loss: 0.8490, Train Steps/Sec: 1.20
22
+ [2025-10-27 23:42:15] (step=0000800) Train Loss: 0.8317, Train Steps/Sec: 1.19
23
+ [2025-10-27 23:43:39] (step=0000900) Train Loss: 0.8191, Train Steps/Sec: 1.20
24
+ [2025-10-27 23:45:02] (step=0001000) Train Loss: 0.8082, Train Steps/Sec: 1.20
25
+ [2025-10-27 23:46:26] (step=0001100) Train Loss: 0.7983, Train Steps/Sec: 1.20
26
+ [2025-10-27 23:47:49] (step=0001200) Train Loss: 0.7907, Train Steps/Sec: 1.20
27
+ [2025-10-27 23:48:33] Beginning epoch 1...
28
+ [2025-10-27 23:49:16] (step=0001300) Train Loss: 0.7838, Train Steps/Sec: 1.16
29
+ [2025-10-27 23:50:39] (step=0001400) Train Loss: 0.7766, Train Steps/Sec: 1.20
30
+ [2025-10-27 23:52:03] (step=0001500) Train Loss: 0.7712, Train Steps/Sec: 1.20
31
+ [2025-10-27 23:53:26] (step=0001600) Train Loss: 0.7650, Train Steps/Sec: 1.20
32
+ [2025-10-27 23:54:50] (step=0001700) Train Loss: 0.7623, Train Steps/Sec: 1.20
33
+ [2025-10-27 23:56:14] (step=0001800) Train Loss: 0.7565, Train Steps/Sec: 1.20
34
+ [2025-10-28 00:02:06] Experiment directory created at results/stage2/hfdata/lightningdit-xl-pe-vit-b-bf16
35
+ [2025-10-28 00:02:10] Missing keys for loading vision encoder: []
36
+ [2025-10-28 00:02:10] Unexpected keys for loading vision encoder: []
37
+ [2025-10-28 00:02:25] Model Parameters: 1202.04M
38
+ [2025-10-28 00:02:30] Dataset contains 1,281,167 images (/scratch/xingjian.leng/data/train)
39
+ [2025-10-28 00:02:30] Gradient accumulation: steps=1, micro batch=128, per-GPU batch=128, global batch=1024
40
+ [2025-10-28 00:02:30] Precision mode: bf16
41
+ [2025-10-28 00:02:30] Training configured for 80 epochs, 1251 steps per epoch.
42
+ [2025-10-28 00:02:30] Optimizer: ADAMW with lr=0.0002, betas=(0.9, 0.95), weight_decay=0.0, eps=1e-08
43
+ Scheduler: linear with warmup_steps=0, decay_end_steps=0, final_lr=0.0002
44
+ [2025-10-28 00:02:30] Training for 80 epochs...
45
+ [2025-10-28 00:02:30] Beginning epoch 0...
46
+ [2025-10-28 00:02:36] Generating EMA samples...
47
+ [2025-10-28 00:03:04] Generating EMA samples done.
48
+ [2025-10-28 00:04:25] (step=0000100) Train Loss: 1.6477, Train Steps/Sec: 0.87
49
+ [2025-10-28 00:05:47] (step=0000200) Train Loss: 1.1862, Train Steps/Sec: 1.21
50
+ [2025-10-28 00:07:10] (step=0000300) Train Loss: 1.0226, Train Steps/Sec: 1.21
51
+ [2025-10-28 00:08:32] (step=0000400) Train Loss: 0.9462, Train Steps/Sec: 1.21
52
+ [2025-10-28 00:09:55] (step=0000500) Train Loss: 0.9013, Train Steps/Sec: 1.21
53
+ [2025-10-28 00:11:17] (step=0000600) Train Loss: 0.8715, Train Steps/Sec: 1.21
54
+ [2025-10-28 00:12:40] (step=0000700) Train Loss: 0.8490, Train Steps/Sec: 1.21
55
+ [2025-10-28 00:14:02] (step=0000800) Train Loss: 0.8317, Train Steps/Sec: 1.21
56
+ [2025-10-28 00:15:25] (step=0000900) Train Loss: 0.8191, Train Steps/Sec: 1.21
57
+ [2025-10-28 00:16:47] (step=0001000) Train Loss: 0.8082, Train Steps/Sec: 1.21
58
+ [2025-10-28 00:18:10] (step=0001100) Train Loss: 0.7983, Train Steps/Sec: 1.21
59
+ [2025-10-28 00:19:32] (step=0001200) Train Loss: 0.7907, Train Steps/Sec: 1.21
60
+ [2025-10-28 00:20:15] Beginning epoch 1...
61
+ [2025-10-28 00:20:57] (step=0001300) Train Loss: 0.7838, Train Steps/Sec: 1.18
62
+ [2025-10-28 00:22:19] (step=0001400) Train Loss: 0.7766, Train Steps/Sec: 1.22
63
+ [2025-10-28 00:23:41] (step=0001500) Train Loss: 0.7712, Train Steps/Sec: 1.21
64
+ [2025-10-28 00:25:04] (step=0001600) Train Loss: 0.7650, Train Steps/Sec: 1.21
65
+ [2025-10-28 00:26:26] (step=0001700) Train Loss: 0.7623, Train Steps/Sec: 1.21
66
+ [2025-10-28 00:27:49] (step=0001800) Train Loss: 0.7565, Train Steps/Sec: 1.21
67
+ [2025-10-28 00:29:11] (step=0001900) Train Loss: 0.7531, Train Steps/Sec: 1.21
68
+ [2025-10-28 00:30:33] (step=0002000) Train Loss: 0.7491, Train Steps/Sec: 1.21
69
+ [2025-10-28 00:31:56] (step=0002100) Train Loss: 0.7464, Train Steps/Sec: 1.21
70
+ [2025-10-28 00:33:18] (step=0002200) Train Loss: 0.7427, Train Steps/Sec: 1.21
71
+ [2025-10-28 00:34:41] (step=0002300) Train Loss: 0.7393, Train Steps/Sec: 1.21
72
+ [2025-10-28 00:36:04] (step=0002400) Train Loss: 0.7360, Train Steps/Sec: 1.21
73
+ [2025-10-28 00:37:26] (step=0002500) Train Loss: 0.7317, Train Steps/Sec: 1.21
74
+ [2025-10-28 00:37:28] Beginning epoch 2...
75
+ [2025-10-28 00:38:51] (step=0002600) Train Loss: 0.7300, Train Steps/Sec: 1.18
76
+ [2025-10-28 00:40:14] (step=0002700) Train Loss: 0.7280, Train Steps/Sec: 1.21
77
+ [2025-10-28 00:41:36] (step=0002800) Train Loss: 0.7255, Train Steps/Sec: 1.21
78
+ [2025-10-28 00:42:58] (step=0002900) Train Loss: 0.7245, Train Steps/Sec: 1.21
79
+ [2025-10-28 00:44:21] (step=0003000) Train Loss: 0.7225, Train Steps/Sec: 1.21
80
+ [2025-10-28 00:45:43] (step=0003100) Train Loss: 0.7195, Train Steps/Sec: 1.21
81
+ [2025-10-28 00:47:06] (step=0003200) Train Loss: 0.7184, Train Steps/Sec: 1.21
82
+ [2025-10-28 00:48:28] (step=0003300) Train Loss: 0.7160, Train Steps/Sec: 1.21
83
+ [2025-10-28 00:49:51] (step=0003400) Train Loss: 0.7134, Train Steps/Sec: 1.21
84
+ [2025-10-28 00:51:13] (step=0003500) Train Loss: 0.7117, Train Steps/Sec: 1.21
85
+ [2025-10-28 00:52:36] (step=0003600) Train Loss: 0.7094, Train Steps/Sec: 1.21
86
+ [2025-10-28 00:53:58] (step=0003700) Train Loss: 0.7091, Train Steps/Sec: 1.21
87
+ [2025-10-28 00:54:42] Beginning epoch 3...
88
+ [2025-10-28 00:55:23] (step=0003800) Train Loss: 0.7077, Train Steps/Sec: 1.18
89
+ [2025-10-28 00:56:45] (step=0003900) Train Loss: 0.7064, Train Steps/Sec: 1.21
90
+ [2025-10-28 00:58:08] (step=0004000) Train Loss: 0.7052, Train Steps/Sec: 1.21
91
+ [2025-10-28 00:59:31] (step=0004100) Train Loss: 0.7036, Train Steps/Sec: 1.21
92
+ [2025-10-28 01:00:53] (step=0004200) Train Loss: 0.7007, Train Steps/Sec: 1.21
93
+ [2025-10-28 01:02:15] (step=0004300) Train Loss: 0.7017, Train Steps/Sec: 1.21
94
+ [2025-10-28 01:03:38] (step=0004400) Train Loss: 0.6988, Train Steps/Sec: 1.21
95
+ [2025-10-28 01:05:00] (step=0004500) Train Loss: 0.6971, Train Steps/Sec: 1.21
96
+ [2025-10-28 01:06:23] (step=0004600) Train Loss: 0.6953, Train Steps/Sec: 1.21
97
+ [2025-10-28 01:07:45] (step=0004700) Train Loss: 0.6965, Train Steps/Sec: 1.21
98
+ [2025-10-28 01:09:07] (step=0004800) Train Loss: 0.6939, Train Steps/Sec: 1.21
99
+ [2025-10-28 01:10:30] (step=0004900) Train Loss: 0.6927, Train Steps/Sec: 1.21
100
+ [2025-10-28 01:11:52] (step=0005000) Train Loss: 0.6926, Train Steps/Sec: 1.21
101
+ [2025-10-28 01:11:56] Beginning epoch 4...
102
+ [2025-10-28 01:13:17] (step=0005100) Train Loss: 0.6900, Train Steps/Sec: 1.18
103
+ [2025-10-28 01:14:40] (step=0005200) Train Loss: 0.6909, Train Steps/Sec: 1.21
104
+ [2025-10-28 01:16:02] (step=0005300) Train Loss: 0.6888, Train Steps/Sec: 1.21
105
+ [2025-10-28 01:17:24] (step=0005400) Train Loss: 0.6877, Train Steps/Sec: 1.21
106
+ [2025-10-28 01:18:47] (step=0005500) Train Loss: 0.6860, Train Steps/Sec: 1.21
107
+ [2025-10-28 01:20:09] (step=0005600) Train Loss: 0.6870, Train Steps/Sec: 1.21
108
+ [2025-10-28 01:21:31] (step=0005700) Train Loss: 0.6853, Train Steps/Sec: 1.21
109
+ [2025-10-28 01:22:55] (step=0005800) Train Loss: 0.6841, Train Steps/Sec: 1.20
110
+ [2025-10-28 01:24:17] (step=0005900) Train Loss: 0.6824, Train Steps/Sec: 1.21
111
+ [2025-10-28 01:25:40] (step=0006000) Train Loss: 0.6811, Train Steps/Sec: 1.21
112
+ [2025-10-28 01:27:02] (step=0006100) Train Loss: 0.6819, Train Steps/Sec: 1.21
113
+ [2025-10-28 01:28:24] (step=0006200) Train Loss: 0.6799, Train Steps/Sec: 1.21
114
+ [2025-10-28 01:29:10] Beginning epoch 5...
115
+ [2025-10-28 01:29:49] (step=0006300) Train Loss: 0.6801, Train Steps/Sec: 1.18
116
+ [2025-10-28 01:31:12] (step=0006400) Train Loss: 0.6806, Train Steps/Sec: 1.21
117
+ [2025-10-28 01:32:34] (step=0006500) Train Loss: 0.6772, Train Steps/Sec: 1.21
118
+ [2025-10-28 01:33:56] (step=0006600) Train Loss: 0.6774, Train Steps/Sec: 1.21
119
+ [2025-10-28 01:35:19] (step=0006700) Train Loss: 0.6777, Train Steps/Sec: 1.21
120
+ [2025-10-28 01:36:41] (step=0006800) Train Loss: 0.6771, Train Steps/Sec: 1.21
121
+ [2025-10-28 01:38:03] (step=0006900) Train Loss: 0.6762, Train Steps/Sec: 1.21
122
+ [2025-10-28 01:39:26] (step=0007000) Train Loss: 0.6764, Train Steps/Sec: 1.21
123
+ [2025-10-28 01:40:48] (step=0007100) Train Loss: 0.6753, Train Steps/Sec: 1.21
124
+ [2025-10-28 01:42:11] (step=0007200) Train Loss: 0.6746, Train Steps/Sec: 1.21
125
+ [2025-10-28 01:43:33] (step=0007300) Train Loss: 0.6733, Train Steps/Sec: 1.21
126
+ [2025-10-28 01:44:56] (step=0007400) Train Loss: 0.6743, Train Steps/Sec: 1.20
127
+ [2025-10-28 01:46:19] (step=0007500) Train Loss: 0.6714, Train Steps/Sec: 1.21
128
+ [2025-10-28 01:46:24] Beginning epoch 6...
129
+ [2025-10-28 01:47:43] (step=0007600) Train Loss: 0.6729, Train Steps/Sec: 1.18
130
+ [2025-10-28 01:49:06] (step=0007700) Train Loss: 0.6715, Train Steps/Sec: 1.21
131
+ [2025-10-28 01:50:28] (step=0007800) Train Loss: 0.6712, Train Steps/Sec: 1.21
132
+ [2025-10-28 01:51:51] (step=0007900) Train Loss: 0.6715, Train Steps/Sec: 1.21
133
+ [2025-10-28 01:53:13] (step=0008000) Train Loss: 0.6691, Train Steps/Sec: 1.21
134
+ [2025-10-28 01:54:36] (step=0008100) Train Loss: 0.6698, Train Steps/Sec: 1.21
135
+ [2025-10-28 01:55:58] (step=0008200) Train Loss: 0.6685, Train Steps/Sec: 1.21
136
+ [2025-10-28 01:57:20] (step=0008300) Train Loss: 0.6680, Train Steps/Sec: 1.21
137
+ [2025-10-28 01:58:43] (step=0008400) Train Loss: 0.6667, Train Steps/Sec: 1.21
138
+ [2025-10-28 02:00:05] (step=0008500) Train Loss: 0.6666, Train Steps/Sec: 1.21
139
+ [2025-10-28 02:01:28] (step=0008600) Train Loss: 0.6672, Train Steps/Sec: 1.21
140
+ [2025-10-28 02:02:50] (step=0008700) Train Loss: 0.6673, Train Steps/Sec: 1.21
141
+ [2025-10-28 02:03:38] Beginning epoch 7...
142
+ [2025-10-28 02:04:15] (step=0008800) Train Loss: 0.6661, Train Steps/Sec: 1.18
143
+ [2025-10-28 02:05:37] (step=0008900) Train Loss: 0.6647, Train Steps/Sec: 1.21
144
+ [2025-10-28 02:07:00] (step=0009000) Train Loss: 0.6649, Train Steps/Sec: 1.21
145
+ [2025-10-28 02:08:23] (step=0009100) Train Loss: 0.6648, Train Steps/Sec: 1.20
146
+ [2025-10-28 02:09:45] (step=0009200) Train Loss: 0.6644, Train Steps/Sec: 1.21
147
+ [2025-10-28 02:11:08] (step=0009300) Train Loss: 0.6636, Train Steps/Sec: 1.21
148
+ [2025-10-28 02:12:30] (step=0009400) Train Loss: 0.6628, Train Steps/Sec: 1.21
149
+ [2025-10-28 02:13:52] (step=0009500) Train Loss: 0.6628, Train Steps/Sec: 1.21
150
+ [2025-10-28 02:15:15] (step=0009600) Train Loss: 0.6627, Train Steps/Sec: 1.21
151
+ [2025-10-28 02:16:37] (step=0009700) Train Loss: 0.6624, Train Steps/Sec: 1.21
152
+ [2025-10-28 02:18:00] (step=0009800) Train Loss: 0.6624, Train Steps/Sec: 1.21
153
+ [2025-10-28 02:19:22] (step=0009900) Train Loss: 0.6598, Train Steps/Sec: 1.21
154
+ [2025-10-28 02:20:45] (step=0010000) Train Loss: 0.6615, Train Steps/Sec: 1.21
155
+ [2025-10-28 02:20:52] Beginning epoch 8...
156
+ [2025-10-28 02:22:09] (step=0010100) Train Loss: 0.6600, Train Steps/Sec: 1.18
157
+ [2025-10-28 02:23:32] (step=0010200) Train Loss: 0.6603, Train Steps/Sec: 1.21
158
+ [2025-10-28 02:24:54] (step=0010300) Train Loss: 0.6591, Train Steps/Sec: 1.21
159
+ [2025-10-28 02:26:17] (step=0010400) Train Loss: 0.6594, Train Steps/Sec: 1.21
160
+ [2025-10-28 02:27:39] (step=0010500) Train Loss: 0.6585, Train Steps/Sec: 1.21
161
+ [2025-10-28 02:29:02] (step=0010600) Train Loss: 0.6587, Train Steps/Sec: 1.21
162
+ [2025-10-28 02:30:25] (step=0010700) Train Loss: 0.6591, Train Steps/Sec: 1.20
163
+ [2025-10-28 02:31:47] (step=0010800) Train Loss: 0.6591, Train Steps/Sec: 1.21
164
+ [2025-10-28 02:33:10] (step=0010900) Train Loss: 0.6580, Train Steps/Sec: 1.21
165
+ [2025-10-28 02:34:32] (step=0011000) Train Loss: 0.6575, Train Steps/Sec: 1.21
166
+ [2025-10-28 02:35:54] (step=0011100) Train Loss: 0.6566, Train Steps/Sec: 1.22
167
+ [2025-10-28 02:37:17] (step=0011200) Train Loss: 0.6579, Train Steps/Sec: 1.21
168
+ [2025-10-28 02:38:06] Beginning epoch 9...
169
+ [2025-10-28 02:38:42] (step=0011300) Train Loss: 0.6569, Train Steps/Sec: 1.18
170
+ [2025-10-28 02:40:04] (step=0011400) Train Loss: 0.6560, Train Steps/Sec: 1.21
171
+ [2025-10-28 02:41:26] (step=0011500) Train Loss: 0.6552, Train Steps/Sec: 1.21
172
+ [2025-10-28 02:42:49] (step=0011600) Train Loss: 0.6547, Train Steps/Sec: 1.21
173
+ [2025-10-28 02:44:11] (step=0011700) Train Loss: 0.6552, Train Steps/Sec: 1.21
174
+ [2025-10-28 02:45:34] (step=0011800) Train Loss: 0.6534, Train Steps/Sec: 1.21
175
+ [2025-10-28 02:46:56] (step=0011900) Train Loss: 0.6541, Train Steps/Sec: 1.21
176
+ [2025-10-28 02:48:18] (step=0012000) Train Loss: 0.6547, Train Steps/Sec: 1.22
177
+ [2025-10-28 02:49:41] (step=0012100) Train Loss: 0.6544, Train Steps/Sec: 1.22
178
+ [2025-10-28 02:51:03] (step=0012200) Train Loss: 0.6546, Train Steps/Sec: 1.21
179
+ [2025-10-28 02:52:26] (step=0012300) Train Loss: 0.6538, Train Steps/Sec: 1.21
180
+ [2025-10-28 02:53:49] (step=0012400) Train Loss: 0.6529, Train Steps/Sec: 1.20
181
+ [2025-10-28 02:55:11] (step=0012500) Train Loss: 0.6525, Train Steps/Sec: 1.21
182
+ [2025-10-28 02:55:20] Beginning epoch 10...
183
+ [2025-10-28 02:56:36] (step=0012600) Train Loss: 0.6523, Train Steps/Sec: 1.18
184
+ [2025-10-28 02:57:59] (step=0012700) Train Loss: 0.6537, Train Steps/Sec: 1.21
185
+ [2025-10-28 02:59:21] (step=0012800) Train Loss: 0.6525, Train Steps/Sec: 1.21
186
+ [2025-10-28 03:00:43] (step=0012900) Train Loss: 0.6511, Train Steps/Sec: 1.21
187
+ [2025-10-28 03:02:06] (step=0013000) Train Loss: 0.6519, Train Steps/Sec: 1.21
188
+ [2025-10-28 03:03:28] (step=0013100) Train Loss: 0.6505, Train Steps/Sec: 1.21
189
+ [2025-10-28 03:04:51] (step=0013200) Train Loss: 0.6521, Train Steps/Sec: 1.21
190
+ [2025-10-28 03:06:13] (step=0013300) Train Loss: 0.6519, Train Steps/Sec: 1.21
191
+ [2025-10-28 03:07:36] (step=0013400) Train Loss: 0.6498, Train Steps/Sec: 1.21
192
+ [2025-10-28 03:08:58] (step=0013500) Train Loss: 0.6515, Train Steps/Sec: 1.21
193
+ [2025-10-28 03:10:20] (step=0013600) Train Loss: 0.6507, Train Steps/Sec: 1.21
194
+ [2025-10-28 03:11:43] (step=0013700) Train Loss: 0.6495, Train Steps/Sec: 1.21
195
+ [2025-10-28 03:12:34] Beginning epoch 11...
196
+ [2025-10-28 03:13:08] (step=0013800) Train Loss: 0.6501, Train Steps/Sec: 1.18
197
+ [2025-10-28 03:14:30] (step=0013900) Train Loss: 0.6481, Train Steps/Sec: 1.21
198
+ [2025-10-28 03:15:52] (step=0014000) Train Loss: 0.6499, Train Steps/Sec: 1.22
199
+ [2025-10-28 03:17:16] (step=0014100) Train Loss: 0.6481, Train Steps/Sec: 1.20
200
+ [2025-10-28 03:18:38] (step=0014200) Train Loss: 0.6505, Train Steps/Sec: 1.21
201
+ [2025-10-28 03:20:00] (step=0014300) Train Loss: 0.6497, Train Steps/Sec: 1.21
202
+ [2025-10-28 03:21:23] (step=0014400) Train Loss: 0.6493, Train Steps/Sec: 1.21
203
+ [2025-10-28 03:22:45] (step=0014500) Train Loss: 0.6484, Train Steps/Sec: 1.21
204
+ [2025-10-28 03:24:08] (step=0014600) Train Loss: 0.6484, Train Steps/Sec: 1.21
205
+ [2025-10-28 03:25:30] (step=0014700) Train Loss: 0.6493, Train Steps/Sec: 1.21
206
+ [2025-10-28 03:26:53] (step=0014800) Train Loss: 0.6481, Train Steps/Sec: 1.21
207
+ [2025-10-28 03:28:15] (step=0014900) Train Loss: 0.6477, Train Steps/Sec: 1.21
208
+ [2025-10-28 03:29:37] (step=0015000) Train Loss: 0.6479, Train Steps/Sec: 1.21
209
+ [2025-10-28 03:29:48] Beginning epoch 12...
210
+ [2025-10-28 03:31:02] (step=0015100) Train Loss: 0.6482, Train Steps/Sec: 1.18
211
+ [2025-10-28 03:32:25] (step=0015200) Train Loss: 0.6472, Train Steps/Sec: 1.21
212
+ [2025-10-28 03:33:47] (step=0015300) Train Loss: 0.6471, Train Steps/Sec: 1.21
213
+ [2025-10-28 03:35:10] (step=0015400) Train Loss: 0.6464, Train Steps/Sec: 1.21
214
+ [2025-10-28 03:36:32] (step=0015500) Train Loss: 0.6457, Train Steps/Sec: 1.21
215
+ [2025-10-28 03:37:55] (step=0015600) Train Loss: 0.6466, Train Steps/Sec: 1.21
216
+ [2025-10-28 03:39:18] (step=0015700) Train Loss: 0.6462, Train Steps/Sec: 1.21
217
+ [2025-10-28 03:40:40] (step=0015800) Train Loss: 0.6451, Train Steps/Sec: 1.21
218
+ [2025-10-28 03:42:03] (step=0015900) Train Loss: 0.6442, Train Steps/Sec: 1.21
219
+ [2025-10-28 03:43:25] (step=0016000) Train Loss: 0.6450, Train Steps/Sec: 1.21
220
+ [2025-10-28 03:44:48] (step=0016100) Train Loss: 0.6447, Train Steps/Sec: 1.21
221
+ [2025-10-28 03:46:10] (step=0016200) Train Loss: 0.6436, Train Steps/Sec: 1.22
222
+ [2025-10-28 03:47:02] Beginning epoch 13...
223
+ [2025-10-28 03:47:34] (step=0016300) Train Loss: 0.6449, Train Steps/Sec: 1.18
224
+ [2025-10-28 03:48:57] (step=0016400) Train Loss: 0.6450, Train Steps/Sec: 1.21
225
+ [2025-10-28 03:50:19] (step=0016500) Train Loss: 0.6434, Train Steps/Sec: 1.21
226
+ [2025-10-28 03:51:42] (step=0016600) Train Loss: 0.6440, Train Steps/Sec: 1.21
227
+ [2025-10-28 03:53:04] (step=0016700) Train Loss: 0.6442, Train Steps/Sec: 1.21
228
+ [2025-10-28 03:54:27] (step=0016800) Train Loss: 0.6429, Train Steps/Sec: 1.21
229
+ [2025-10-28 03:55:49] (step=0016900) Train Loss: 0.6435, Train Steps/Sec: 1.22
230
+ [2025-10-28 03:57:11] (step=0017000) Train Loss: 0.6432, Train Steps/Sec: 1.21
231
+ [2025-10-28 03:58:34] (step=0017100) Train Loss: 0.6433, Train Steps/Sec: 1.21
232
+ [2025-10-28 03:59:56] (step=0017200) Train Loss: 0.6413, Train Steps/Sec: 1.22
233
+ [2025-10-28 04:01:18] (step=0017300) Train Loss: 0.6424, Train Steps/Sec: 1.21
234
+ [2025-10-28 04:02:42] (step=0017400) Train Loss: 0.6428, Train Steps/Sec: 1.20
235
+ [2025-10-28 04:04:04] (step=0017500) Train Loss: 0.6432, Train Steps/Sec: 1.21
236
+ [2025-10-28 04:04:16] Beginning epoch 14...
237
+ [2025-10-28 04:05:29] (step=0017600) Train Loss: 0.6411, Train Steps/Sec: 1.18
238
+ [2025-10-28 04:06:51] (step=0017700) Train Loss: 0.6429, Train Steps/Sec: 1.21
239
+ [2025-10-28 04:08:14] (step=0017800) Train Loss: 0.6421, Train Steps/Sec: 1.21
240
+ [2025-10-28 04:09:36] (step=0017900) Train Loss: 0.6427, Train Steps/Sec: 1.21
241
+ [2025-10-28 04:10:59] (step=0018000) Train Loss: 0.6417, Train Steps/Sec: 1.21
242
+ [2025-10-28 04:12:21] (step=0018100) Train Loss: 0.6415, Train Steps/Sec: 1.21
243
+ [2025-10-28 04:13:43] (step=0018200) Train Loss: 0.6418, Train Steps/Sec: 1.21
244
+ [2025-10-28 04:15:06] (step=0018300) Train Loss: 0.6434, Train Steps/Sec: 1.21
245
+ [2025-10-28 04:16:28] (step=0018400) Train Loss: 0.6419, Train Steps/Sec: 1.21
246
+ [2025-10-28 04:17:51] (step=0018500) Train Loss: 0.6404, Train Steps/Sec: 1.21
247
+ [2025-10-28 04:19:13] (step=0018600) Train Loss: 0.6427, Train Steps/Sec: 1.21
248
+ [2025-10-28 04:20:35] (step=0018700) Train Loss: 0.6413, Train Steps/Sec: 1.21
249
+ [2025-10-28 04:21:29] Beginning epoch 15...
250
+ [2025-10-28 04:22:00] (step=0018800) Train Loss: 0.6405, Train Steps/Sec: 1.18
251
+ [2025-10-28 04:23:23] (step=0018900) Train Loss: 0.6394, Train Steps/Sec: 1.21
252
+ [2025-10-28 04:24:45] (step=0019000) Train Loss: 0.6402, Train Steps/Sec: 1.21
253
+ [2025-10-28 04:26:08] (step=0019100) Train Loss: 0.6398, Train Steps/Sec: 1.20
254
+ [2025-10-28 04:27:31] (step=0019200) Train Loss: 0.6405, Train Steps/Sec: 1.21
255
+ [2025-10-28 04:28:53] (step=0019300) Train Loss: 0.6399, Train Steps/Sec: 1.21
256
+ [2025-10-28 04:30:15] (step=0019400) Train Loss: 0.6395, Train Steps/Sec: 1.21
257
+ [2025-10-28 04:31:38] (step=0019500) Train Loss: 0.6410, Train Steps/Sec: 1.21
258
+ [2025-10-28 04:33:00] (step=0019600) Train Loss: 0.6385, Train Steps/Sec: 1.21
259
+ [2025-10-28 04:34:23] (step=0019700) Train Loss: 0.6382, Train Steps/Sec: 1.21
260
+ [2025-10-28 04:35:45] (step=0019800) Train Loss: 0.6388, Train Steps/Sec: 1.21
261
+ [2025-10-28 04:37:07] (step=0019900) Train Loss: 0.6393, Train Steps/Sec: 1.21
262
+ [2025-10-28 04:38:30] (step=0020000) Train Loss: 0.6384, Train Steps/Sec: 1.21
263
+ [2025-10-28 04:38:43] Beginning epoch 16...
264
+ [2025-10-28 04:39:55] (step=0020100) Train Loss: 0.6384, Train Steps/Sec: 1.18
265
+ [2025-10-28 04:41:17] (step=0020200) Train Loss: 0.6388, Train Steps/Sec: 1.21
266
+ [2025-10-28 04:42:40] (step=0020300) Train Loss: 0.6393, Train Steps/Sec: 1.21
267
+ [2025-10-28 04:44:02] (step=0020400) Train Loss: 0.6385, Train Steps/Sec: 1.21
268
+ [2025-10-28 04:45:24] (step=0020500) Train Loss: 0.6374, Train Steps/Sec: 1.21
269
+ [2025-10-28 04:46:47] (step=0020600) Train Loss: 0.6392, Train Steps/Sec: 1.21
270
+ [2025-10-28 04:48:10] (step=0020700) Train Loss: 0.6385, Train Steps/Sec: 1.20
271
+ [2025-10-28 04:49:32] (step=0020800) Train Loss: 0.6385, Train Steps/Sec: 1.21
272
+ [2025-10-28 04:50:55] (step=0020900) Train Loss: 0.6379, Train Steps/Sec: 1.21
273
+ [2025-10-28 04:52:17] (step=0021000) Train Loss: 0.6372, Train Steps/Sec: 1.21
274
+ [2025-10-28 04:53:40] (step=0021100) Train Loss: 0.6362, Train Steps/Sec: 1.21
275
+ [2025-10-28 04:55:02] (step=0021200) Train Loss: 0.6377, Train Steps/Sec: 1.21
276
+ [2025-10-28 04:55:58] Beginning epoch 17...
277
+ [2025-10-28 04:56:27] (step=0021300) Train Loss: 0.6371, Train Steps/Sec: 1.18
278
+ [2025-10-28 04:57:49] (step=0021400) Train Loss: 0.6371, Train Steps/Sec: 1.22
279
+ [2025-10-28 04:59:12] (step=0021500) Train Loss: 0.6372, Train Steps/Sec: 1.21
280
+ [2025-10-28 05:00:34] (step=0021600) Train Loss: 0.6369, Train Steps/Sec: 1.21
281
+ [2025-10-28 05:01:57] (step=0021700) Train Loss: 0.6373, Train Steps/Sec: 1.21
282
+ [2025-10-28 05:03:19] (step=0021800) Train Loss: 0.6369, Train Steps/Sec: 1.21
283
+ [2025-10-28 05:04:41] (step=0021900) Train Loss: 0.6354, Train Steps/Sec: 1.21
284
+ [2025-10-28 05:06:04] (step=0022000) Train Loss: 0.6353, Train Steps/Sec: 1.21
285
+ [2025-10-28 05:07:26] (step=0022100) Train Loss: 0.6356, Train Steps/Sec: 1.21
286
+ [2025-10-28 05:08:49] (step=0022200) Train Loss: 0.6360, Train Steps/Sec: 1.21
287
+ [2025-10-28 05:10:11] (step=0022300) Train Loss: 0.6360, Train Steps/Sec: 1.21
288
+ [2025-10-28 05:11:34] (step=0022400) Train Loss: 0.6350, Train Steps/Sec: 1.20
289
+ [2025-10-28 05:12:57] (step=0022500) Train Loss: 0.6355, Train Steps/Sec: 1.21
290
+ [2025-10-28 05:13:12] Beginning epoch 18...
291
+ [2025-10-28 05:14:22] (step=0022600) Train Loss: 0.6352, Train Steps/Sec: 1.18
292
+ [2025-10-28 05:15:44] (step=0022700) Train Loss: 0.6345, Train Steps/Sec: 1.21
293
+ [2025-10-28 05:17:06] (step=0022800) Train Loss: 0.6364, Train Steps/Sec: 1.21
294
+ [2025-10-28 05:18:29] (step=0022900) Train Loss: 0.6339, Train Steps/Sec: 1.21
295
+ [2025-10-28 05:19:51] (step=0023000) Train Loss: 0.6338, Train Steps/Sec: 1.21
296
+ [2025-10-28 05:21:14] (step=0023100) Train Loss: 0.6343, Train Steps/Sec: 1.21
297
+ [2025-10-28 05:22:36] (step=0023200) Train Loss: 0.6341, Train Steps/Sec: 1.21
298
+ [2025-10-28 05:23:58] (step=0023300) Train Loss: 0.6340, Train Steps/Sec: 1.21
299
+ [2025-10-28 05:25:21] (step=0023400) Train Loss: 0.6348, Train Steps/Sec: 1.21
300
+ [2025-10-28 05:26:43] (step=0023500) Train Loss: 0.6334, Train Steps/Sec: 1.21
301
+ [2025-10-28 05:28:06] (step=0023600) Train Loss: 0.6349, Train Steps/Sec: 1.21
302
+ [2025-10-28 05:29:28] (step=0023700) Train Loss: 0.6343, Train Steps/Sec: 1.21
303
+ [2025-10-28 05:30:25] Beginning epoch 19...
304
+ [2025-10-28 05:30:53] (step=0023800) Train Loss: 0.6340, Train Steps/Sec: 1.18
305
+ [2025-10-28 05:32:15] (step=0023900) Train Loss: 0.6349, Train Steps/Sec: 1.21
306
+ [2025-10-28 05:33:38] (step=0024000) Train Loss: 0.6342, Train Steps/Sec: 1.21
307
+ [2025-10-28 05:35:01] (step=0024100) Train Loss: 0.6345, Train Steps/Sec: 1.21
308
+ [2025-10-28 05:36:23] (step=0024200) Train Loss: 0.6348, Train Steps/Sec: 1.21
309
+ [2025-10-28 05:37:46] (step=0024300) Train Loss: 0.6329, Train Steps/Sec: 1.21
310
+ [2025-10-28 05:39:08] (step=0024400) Train Loss: 0.6338, Train Steps/Sec: 1.21
311
+ [2025-10-28 05:40:30] (step=0024500) Train Loss: 0.6349, Train Steps/Sec: 1.21
312
+ [2025-10-28 05:41:53] (step=0024600) Train Loss: 0.6345, Train Steps/Sec: 1.21
313
+ [2025-10-28 05:43:15] (step=0024700) Train Loss: 0.6331, Train Steps/Sec: 1.21
314
+ [2025-10-28 05:44:38] (step=0024800) Train Loss: 0.6338, Train Steps/Sec: 1.21
315
+ [2025-10-28 05:46:00] (step=0024900) Train Loss: 0.6337, Train Steps/Sec: 1.21
316
+ [2025-10-28 05:47:22] (step=0025000) Train Loss: 0.6328, Train Steps/Sec: 1.21
317
+ [2025-10-28 05:48:19] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-b-bf16/checkpoints/0025000.pt
318
+ [2025-10-28 05:48:19] Generating EMA samples...
319
+ [2025-10-28 05:48:47] Generating EMA samples done.
320
+ [2025-10-28 05:49:04] Beginning epoch 20...
321
+ [2025-10-28 05:50:12] (step=0025100) Train Loss: 0.6327, Train Steps/Sec: 0.59
322
+ [2025-10-28 05:51:34] (step=0025200) Train Loss: 0.6322, Train Steps/Sec: 1.21
323
+ [2025-10-28 05:52:57] (step=0025300) Train Loss: 0.6323, Train Steps/Sec: 1.21
324
+ [2025-10-28 05:54:19] (step=0025400) Train Loss: 0.6328, Train Steps/Sec: 1.21
325
+ [2025-10-28 05:55:41] (step=0025500) Train Loss: 0.6334, Train Steps/Sec: 1.21
326
+ [2025-10-28 05:57:04] (step=0025600) Train Loss: 0.6334, Train Steps/Sec: 1.21
327
+ [2025-10-28 05:58:27] (step=0025700) Train Loss: 0.6322, Train Steps/Sec: 1.20
328
+ [2025-10-28 05:59:50] (step=0025800) Train Loss: 0.6326, Train Steps/Sec: 1.21
329
+ [2025-10-28 06:01:12] (step=0025900) Train Loss: 0.6308, Train Steps/Sec: 1.21
330
+ [2025-10-28 06:02:35] (step=0026000) Train Loss: 0.6322, Train Steps/Sec: 1.21
331
+ [2025-10-28 06:03:57] (step=0026100) Train Loss: 0.6315, Train Steps/Sec: 1.21
332
+ [2025-10-28 06:05:19] (step=0026200) Train Loss: 0.6327, Train Steps/Sec: 1.21
333
+ [2025-10-28 06:06:18] Beginning epoch 21...
334
+ [2025-10-28 06:06:44] (step=0026300) Train Loss: 0.6314, Train Steps/Sec: 1.18
335
+ [2025-10-28 06:08:07] (step=0026400) Train Loss: 0.6328, Train Steps/Sec: 1.21
336
+ [2025-10-28 06:09:29] (step=0026500) Train Loss: 0.6299, Train Steps/Sec: 1.21
337
+ [2025-10-28 06:10:51] (step=0026600) Train Loss: 0.6303, Train Steps/Sec: 1.21
338
+ [2025-10-28 06:12:14] (step=0026700) Train Loss: 0.6315, Train Steps/Sec: 1.21
339
+ [2025-10-28 06:13:36] (step=0026800) Train Loss: 0.6316, Train Steps/Sec: 1.21
340
+ [2025-10-28 06:14:59] (step=0026900) Train Loss: 0.6318, Train Steps/Sec: 1.21
341
+ [2025-10-28 06:16:21] (step=0027000) Train Loss: 0.6314, Train Steps/Sec: 1.21
342
+ [2025-10-28 06:17:44] (step=0027100) Train Loss: 0.6306, Train Steps/Sec: 1.21
343
+ [2025-10-28 06:19:06] (step=0027200) Train Loss: 0.6313, Train Steps/Sec: 1.21
344
+ [2025-10-28 06:20:29] (step=0027300) Train Loss: 0.6308, Train Steps/Sec: 1.21
345
+ [2025-10-28 06:21:52] (step=0027400) Train Loss: 0.6302, Train Steps/Sec: 1.20
346
+ [2025-10-28 06:23:14] (step=0027500) Train Loss: 0.6295, Train Steps/Sec: 1.21
347
+ [2025-10-28 06:23:33] Beginning epoch 22...
348
+ [2025-10-28 06:24:39] (step=0027600) Train Loss: 0.6300, Train Steps/Sec: 1.18
349
+ [2025-10-28 06:26:01] (step=0027700) Train Loss: 0.6295, Train Steps/Sec: 1.21
350
+ [2025-10-28 06:27:24] (step=0027800) Train Loss: 0.6311, Train Steps/Sec: 1.21
351
+ [2025-10-28 06:28:46] (step=0027900) Train Loss: 0.6313, Train Steps/Sec: 1.21
352
+ [2025-10-28 06:30:08] (step=0028000) Train Loss: 0.6301, Train Steps/Sec: 1.21
353
+ [2025-10-28 06:31:31] (step=0028100) Train Loss: 0.6304, Train Steps/Sec: 1.21
354
+ [2025-10-28 06:32:53] (step=0028200) Train Loss: 0.6300, Train Steps/Sec: 1.21
355
+ [2025-10-28 06:34:16] (step=0028300) Train Loss: 0.6310, Train Steps/Sec: 1.21
356
+ [2025-10-28 06:35:38] (step=0028400) Train Loss: 0.6282, Train Steps/Sec: 1.21
357
+ [2025-10-28 06:37:00] (step=0028500) Train Loss: 0.6297, Train Steps/Sec: 1.21
358
+ [2025-10-28 06:38:23] (step=0028600) Train Loss: 0.6297, Train Steps/Sec: 1.21
359
+ [2025-10-28 06:39:45] (step=0028700) Train Loss: 0.6284, Train Steps/Sec: 1.21
360
+ [2025-10-28 06:40:46] Beginning epoch 23...
361
+ [2025-10-28 06:41:10] (step=0028800) Train Loss: 0.6292, Train Steps/Sec: 1.18
362
+ [2025-10-28 06:42:33] (step=0028900) Train Loss: 0.6311, Train Steps/Sec: 1.21
363
+ [2025-10-28 06:43:56] (step=0029000) Train Loss: 0.6288, Train Steps/Sec: 1.21
364
+ [2025-10-28 06:45:18] (step=0029100) Train Loss: 0.6291, Train Steps/Sec: 1.21
365
+ [2025-10-28 06:46:41] (step=0029200) Train Loss: 0.6283, Train Steps/Sec: 1.21
366
+ [2025-10-28 06:48:03] (step=0029300) Train Loss: 0.6283, Train Steps/Sec: 1.21
367
+ [2025-10-28 06:49:26] (step=0029400) Train Loss: 0.6271, Train Steps/Sec: 1.21
368
+ [2025-10-28 06:50:48] (step=0029500) Train Loss: 0.6297, Train Steps/Sec: 1.21
369
+ [2025-10-28 06:52:10] (step=0029600) Train Loss: 0.6289, Train Steps/Sec: 1.21
370
+ [2025-10-28 06:53:33] (step=0029700) Train Loss: 0.6300, Train Steps/Sec: 1.21
371
+ [2025-10-28 06:54:55] (step=0029800) Train Loss: 0.6293, Train Steps/Sec: 1.21
372
+ [2025-10-28 06:56:18] (step=0029900) Train Loss: 0.6299, Train Steps/Sec: 1.21
373
+ [2025-10-28 06:57:40] (step=0030000) Train Loss: 0.6288, Train Steps/Sec: 1.21
374
+ [2025-10-28 06:58:00] Beginning epoch 24...
375
+ [2025-10-28 06:59:05] (step=0030100) Train Loss: 0.6292, Train Steps/Sec: 1.17
376
+ [2025-10-28 07:00:28] (step=0030200) Train Loss: 0.6285, Train Steps/Sec: 1.21
377
+ [2025-10-28 07:01:50] (step=0030300) Train Loss: 0.6278, Train Steps/Sec: 1.21
378
+ [2025-10-28 07:03:12] (step=0030400) Train Loss: 0.6280, Train Steps/Sec: 1.21
379
+ [2025-10-28 07:04:35] (step=0030500) Train Loss: 0.6285, Train Steps/Sec: 1.21
380
+ [2025-10-28 07:05:57] (step=0030600) Train Loss: 0.6271, Train Steps/Sec: 1.21
381
+ [2025-10-28 07:07:20] (step=0030700) Train Loss: 0.6280, Train Steps/Sec: 1.20
382
+ [2025-10-28 07:08:43] (step=0030800) Train Loss: 0.6284, Train Steps/Sec: 1.21
383
+ [2025-10-28 07:10:05] (step=0030900) Train Loss: 0.6280, Train Steps/Sec: 1.22
384
+ [2025-10-28 07:11:27] (step=0031000) Train Loss: 0.6273, Train Steps/Sec: 1.22
385
+ [2025-10-28 07:12:50] (step=0031100) Train Loss: 0.6289, Train Steps/Sec: 1.21
386
+ [2025-10-28 07:14:12] (step=0031200) Train Loss: 0.6271, Train Steps/Sec: 1.21
387
+ [2025-10-28 07:15:14] Beginning epoch 25...
388
+ [2025-10-28 07:15:37] (step=0031300) Train Loss: 0.6275, Train Steps/Sec: 1.18
389
+ [2025-10-28 07:17:00] (step=0031400) Train Loss: 0.6274, Train Steps/Sec: 1.21
390
+ [2025-10-28 07:18:22] (step=0031500) Train Loss: 0.6270, Train Steps/Sec: 1.21
391
+ [2025-10-28 07:19:44] (step=0031600) Train Loss: 0.6268, Train Steps/Sec: 1.21
392
+ [2025-10-28 07:21:07] (step=0031700) Train Loss: 0.6258, Train Steps/Sec: 1.21
393
+ [2025-10-28 07:22:29] (step=0031800) Train Loss: 0.6275, Train Steps/Sec: 1.21
394
+ [2025-10-28 07:23:51] (step=0031900) Train Loss: 0.6259, Train Steps/Sec: 1.21
395
+ [2025-10-28 07:25:14] (step=0032000) Train Loss: 0.6269, Train Steps/Sec: 1.21
396
+ [2025-10-28 07:26:36] (step=0032100) Train Loss: 0.6261, Train Steps/Sec: 1.21
397
+ [2025-10-28 07:27:59] (step=0032200) Train Loss: 0.6265, Train Steps/Sec: 1.21
398
+ [2025-10-28 07:29:21] (step=0032300) Train Loss: 0.6273, Train Steps/Sec: 1.21
399
+ [2025-10-28 07:30:44] (step=0032400) Train Loss: 0.6262, Train Steps/Sec: 1.20
400
+ [2025-10-28 07:32:07] (step=0032500) Train Loss: 0.6270, Train Steps/Sec: 1.22
401
+ [2025-10-28 07:32:29] Beginning epoch 26...
402
+ [2025-10-28 07:33:31] (step=0032600) Train Loss: 0.6261, Train Steps/Sec: 1.18
403
+ [2025-10-28 07:34:54] (step=0032700) Train Loss: 0.6258, Train Steps/Sec: 1.21
404
+ [2025-10-28 07:36:16] (step=0032800) Train Loss: 0.6261, Train Steps/Sec: 1.21
405
+ [2025-10-28 07:37:39] (step=0032900) Train Loss: 0.6261, Train Steps/Sec: 1.21
406
+ [2025-10-28 07:39:01] (step=0033000) Train Loss: 0.6277, Train Steps/Sec: 1.21
407
+ [2025-10-28 07:40:24] (step=0033100) Train Loss: 0.6260, Train Steps/Sec: 1.21
408
+ [2025-10-28 07:41:46] (step=0033200) Train Loss: 0.6250, Train Steps/Sec: 1.21
409
+ [2025-10-28 07:43:08] (step=0033300) Train Loss: 0.6261, Train Steps/Sec: 1.21
410
+ [2025-10-28 07:44:31] (step=0033400) Train Loss: 0.6272, Train Steps/Sec: 1.21
411
+ [2025-10-28 07:45:53] (step=0033500) Train Loss: 0.6256, Train Steps/Sec: 1.21
412
+ [2025-10-28 07:47:15] (step=0033600) Train Loss: 0.6264, Train Steps/Sec: 1.21
413
+ [2025-10-28 07:48:38] (step=0033700) Train Loss: 0.6268, Train Steps/Sec: 1.21
414
+ [2025-10-28 07:49:42] Beginning epoch 27...
415
+ [2025-10-28 07:50:03] (step=0033800) Train Loss: 0.6250, Train Steps/Sec: 1.18
416
+ [2025-10-28 07:51:25] (step=0033900) Train Loss: 0.6241, Train Steps/Sec: 1.21
417
+ [2025-10-28 07:52:48] (step=0034000) Train Loss: 0.6246, Train Steps/Sec: 1.20
418
+ [2025-10-28 07:54:11] (step=0034100) Train Loss: 0.6249, Train Steps/Sec: 1.21
419
+ [2025-10-28 07:55:34] (step=0034200) Train Loss: 0.6244, Train Steps/Sec: 1.21
420
+ [2025-10-28 07:56:56] (step=0034300) Train Loss: 0.6250, Train Steps/Sec: 1.21
421
+ [2025-10-28 07:58:18] (step=0034400) Train Loss: 0.6258, Train Steps/Sec: 1.21
422
+ [2025-10-28 07:59:41] (step=0034500) Train Loss: 0.6240, Train Steps/Sec: 1.21
423
+ [2025-10-28 08:01:03] (step=0034600) Train Loss: 0.6254, Train Steps/Sec: 1.22
424
+ [2025-10-28 08:02:25] (step=0034700) Train Loss: 0.6247, Train Steps/Sec: 1.21
425
+ [2025-10-28 08:03:48] (step=0034800) Train Loss: 0.6245, Train Steps/Sec: 1.21
426
+ [2025-10-28 08:05:10] (step=0034900) Train Loss: 0.6244, Train Steps/Sec: 1.21
427
+ [2025-10-28 08:06:32] (step=0035000) Train Loss: 0.6252, Train Steps/Sec: 1.21
428
+ [2025-10-28 08:06:56] Beginning epoch 28...
429
+ [2025-10-28 08:07:57] (step=0035100) Train Loss: 0.6232, Train Steps/Sec: 1.18
430
+ [2025-10-28 08:09:19] (step=0035200) Train Loss: 0.6235, Train Steps/Sec: 1.21
431
+ [2025-10-28 08:10:42] (step=0035300) Train Loss: 0.6239, Train Steps/Sec: 1.21
432
+ [2025-10-28 08:12:04] (step=0035400) Train Loss: 0.6249, Train Steps/Sec: 1.21
433
+ [2025-10-28 08:13:27] (step=0035500) Train Loss: 0.6226, Train Steps/Sec: 1.21
434
+ [2025-10-28 08:14:49] (step=0035600) Train Loss: 0.6226, Train Steps/Sec: 1.21
435
+ [2025-10-28 08:16:12] (step=0035700) Train Loss: 0.6252, Train Steps/Sec: 1.21
436
+ [2025-10-28 08:17:35] (step=0035800) Train Loss: 0.6251, Train Steps/Sec: 1.21
437
+ [2025-10-28 08:18:57] (step=0035900) Train Loss: 0.6238, Train Steps/Sec: 1.21
438
+ [2025-10-28 08:20:19] (step=0036000) Train Loss: 0.6237, Train Steps/Sec: 1.21
439
+ [2025-10-28 08:21:42] (step=0036100) Train Loss: 0.6239, Train Steps/Sec: 1.21
440
+ [2025-10-28 08:23:04] (step=0036200) Train Loss: 0.6239, Train Steps/Sec: 1.21
441
+ [2025-10-28 08:24:10] Beginning epoch 29...
442
+ [2025-10-28 08:24:29] (step=0036300) Train Loss: 0.6241, Train Steps/Sec: 1.18
443
+ [2025-10-28 08:25:51] (step=0036400) Train Loss: 0.6250, Train Steps/Sec: 1.21
444
+ [2025-10-28 08:27:14] (step=0036500) Train Loss: 0.6238, Train Steps/Sec: 1.21
445
+ [2025-10-28 08:28:36] (step=0036600) Train Loss: 0.6237, Train Steps/Sec: 1.21
446
+ [2025-10-28 08:29:58] (step=0036700) Train Loss: 0.6243, Train Steps/Sec: 1.21
447
+ [2025-10-28 08:31:21] (step=0036800) Train Loss: 0.6229, Train Steps/Sec: 1.21
448
+ [2025-10-28 08:32:43] (step=0036900) Train Loss: 0.6244, Train Steps/Sec: 1.21
449
+ [2025-10-28 08:34:05] (step=0037000) Train Loss: 0.6225, Train Steps/Sec: 1.21
450
+ [2025-10-28 08:35:28] (step=0037100) Train Loss: 0.6230, Train Steps/Sec: 1.21
451
+ [2025-10-28 08:36:50] (step=0037200) Train Loss: 0.6239, Train Steps/Sec: 1.22
452
+ [2025-10-28 08:38:13] (step=0037300) Train Loss: 0.6235, Train Steps/Sec: 1.21
453
+ [2025-10-28 08:39:36] (step=0037400) Train Loss: 0.6227, Train Steps/Sec: 1.21
454
+ [2025-10-28 08:40:58] (step=0037500) Train Loss: 0.6235, Train Steps/Sec: 1.22
455
+ [2025-10-28 08:41:23] Beginning epoch 30...
456
+ [2025-10-28 08:42:23] (step=0037600) Train Loss: 0.6230, Train Steps/Sec: 1.18
457
+ [2025-10-28 08:43:45] (step=0037700) Train Loss: 0.6229, Train Steps/Sec: 1.21
458
+ [2025-10-28 08:45:08] (step=0037800) Train Loss: 0.6235, Train Steps/Sec: 1.21
459
+ [2025-10-28 08:46:30] (step=0037900) Train Loss: 0.6227, Train Steps/Sec: 1.21
460
+ [2025-10-28 08:47:52] (step=0038000) Train Loss: 0.6214, Train Steps/Sec: 1.21
461
+ [2025-10-28 08:49:15] (step=0038100) Train Loss: 0.6211, Train Steps/Sec: 1.21
462
+ [2025-10-28 08:50:37] (step=0038200) Train Loss: 0.6245, Train Steps/Sec: 1.21
463
+ [2025-10-28 08:51:59] (step=0038300) Train Loss: 0.6220, Train Steps/Sec: 1.21
464
+ [2025-10-28 08:53:22] (step=0038400) Train Loss: 0.6226, Train Steps/Sec: 1.21
465
+ [2025-10-28 08:54:44] (step=0038500) Train Loss: 0.6218, Train Steps/Sec: 1.21
466
+ [2025-10-28 08:56:07] (step=0038600) Train Loss: 0.6220, Train Steps/Sec: 1.21
467
+ [2025-10-28 08:57:29] (step=0038700) Train Loss: 0.6230, Train Steps/Sec: 1.21
468
+ [2025-10-28 08:58:36] Beginning epoch 31...
469
+ [2025-10-28 08:58:54] (step=0038800) Train Loss: 0.6215, Train Steps/Sec: 1.18
470
+ [2025-10-28 09:00:16] (step=0038900) Train Loss: 0.6223, Train Steps/Sec: 1.21
471
+ [2025-10-28 09:01:39] (step=0039000) Train Loss: 0.6210, Train Steps/Sec: 1.20
472
+ [2025-10-28 09:03:02] (step=0039100) Train Loss: 0.6221, Train Steps/Sec: 1.21
473
+ [2025-10-28 09:04:24] (step=0039200) Train Loss: 0.6213, Train Steps/Sec: 1.21
474
+ [2025-10-28 09:05:47] (step=0039300) Train Loss: 0.6229, Train Steps/Sec: 1.21
475
+ [2025-10-28 09:07:09] (step=0039400) Train Loss: 0.6219, Train Steps/Sec: 1.21
476
+ [2025-10-28 09:08:31] (step=0039500) Train Loss: 0.6222, Train Steps/Sec: 1.21
477
+ [2025-10-28 09:09:54] (step=0039600) Train Loss: 0.6225, Train Steps/Sec: 1.21
478
+ [2025-10-28 09:11:16] (step=0039700) Train Loss: 0.6218, Train Steps/Sec: 1.21
479
+ [2025-10-28 09:12:39] (step=0039800) Train Loss: 0.6211, Train Steps/Sec: 1.21
480
+ [2025-10-28 09:14:01] (step=0039900) Train Loss: 0.6219, Train Steps/Sec: 1.21
481
+ [2025-10-28 09:15:23] (step=0040000) Train Loss: 0.6215, Train Steps/Sec: 1.21
482
+ [2025-10-28 09:15:50] Beginning epoch 32...
483
+ [2025-10-28 09:16:48] (step=0040100) Train Loss: 0.6213, Train Steps/Sec: 1.18
484
+ [2025-10-28 09:18:11] (step=0040200) Train Loss: 0.6216, Train Steps/Sec: 1.21
485
+ [2025-10-28 09:19:33] (step=0040300) Train Loss: 0.6215, Train Steps/Sec: 1.22
486
+ [2025-10-28 09:20:55] (step=0040400) Train Loss: 0.6207, Train Steps/Sec: 1.22
487
+ [2025-10-28 09:22:17] (step=0040500) Train Loss: 0.6213, Train Steps/Sec: 1.21
488
+ [2025-10-28 09:23:40] (step=0040600) Train Loss: 0.6214, Train Steps/Sec: 1.21
489
+ [2025-10-28 09:25:04] (step=0040700) Train Loss: 0.6220, Train Steps/Sec: 1.20
490
+ [2025-10-28 09:26:26] (step=0040800) Train Loss: 0.6210, Train Steps/Sec: 1.21
491
+ [2025-10-28 09:27:48] (step=0040900) Train Loss: 0.6210, Train Steps/Sec: 1.22
492
+ [2025-10-28 09:29:10] (step=0041000) Train Loss: 0.6201, Train Steps/Sec: 1.21
493
+ [2025-10-28 09:30:33] (step=0041100) Train Loss: 0.6214, Train Steps/Sec: 1.21
494
+ [2025-10-28 09:31:55] (step=0041200) Train Loss: 0.6216, Train Steps/Sec: 1.21
495
+ [2025-10-28 09:33:04] Beginning epoch 33...
496
+ [2025-10-28 09:33:20] (step=0041300) Train Loss: 0.6222, Train Steps/Sec: 1.18
497
+ [2025-10-28 09:34:43] (step=0041400) Train Loss: 0.6208, Train Steps/Sec: 1.21
498
+ [2025-10-28 09:36:05] (step=0041500) Train Loss: 0.6200, Train Steps/Sec: 1.21
499
+ [2025-10-28 09:37:27] (step=0041600) Train Loss: 0.6208, Train Steps/Sec: 1.21
500
+ [2025-10-28 09:38:50] (step=0041700) Train Loss: 0.6198, Train Steps/Sec: 1.21
501
+ [2025-10-28 09:40:12] (step=0041800) Train Loss: 0.6202, Train Steps/Sec: 1.21
502
+ [2025-10-28 09:41:35] (step=0041900) Train Loss: 0.6212, Train Steps/Sec: 1.21
503
+ [2025-10-28 09:42:57] (step=0042000) Train Loss: 0.6211, Train Steps/Sec: 1.21
504
+ [2025-10-28 09:44:19] (step=0042100) Train Loss: 0.6202, Train Steps/Sec: 1.21
505
+ [2025-10-28 09:45:42] (step=0042200) Train Loss: 0.6217, Train Steps/Sec: 1.21
506
+ [2025-10-28 09:47:05] (step=0042300) Train Loss: 0.6190, Train Steps/Sec: 1.21
507
+ [2025-10-28 09:48:28] (step=0042400) Train Loss: 0.6194, Train Steps/Sec: 1.21
508
+ [2025-10-28 09:49:50] (step=0042500) Train Loss: 0.6210, Train Steps/Sec: 1.21
509
+ [2025-10-28 09:50:19] Beginning epoch 34...
510
+ [2025-10-28 09:51:15] (step=0042600) Train Loss: 0.6194, Train Steps/Sec: 1.17
511
+ [2025-10-28 09:52:38] (step=0042700) Train Loss: 0.6212, Train Steps/Sec: 1.21
512
+ [2025-10-28 09:54:00] (step=0042800) Train Loss: 0.6197, Train Steps/Sec: 1.21
513
+ [2025-10-28 09:55:22] (step=0042900) Train Loss: 0.6204, Train Steps/Sec: 1.21
514
+ [2025-10-28 09:56:45] (step=0043000) Train Loss: 0.6206, Train Steps/Sec: 1.21
515
+ [2025-10-28 09:58:07] (step=0043100) Train Loss: 0.6196, Train Steps/Sec: 1.21
516
+ [2025-10-28 09:59:30] (step=0043200) Train Loss: 0.6199, Train Steps/Sec: 1.21
517
+ [2025-10-28 10:00:52] (step=0043300) Train Loss: 0.6195, Train Steps/Sec: 1.21
518
+ [2025-10-28 10:02:14] (step=0043400) Train Loss: 0.6182, Train Steps/Sec: 1.21
519
+ [2025-10-28 10:03:37] (step=0043500) Train Loss: 0.6184, Train Steps/Sec: 1.21
520
+ [2025-10-28 10:04:59] (step=0043600) Train Loss: 0.6191, Train Steps/Sec: 1.21
521
+ [2025-10-28 10:06:22] (step=0043700) Train Loss: 0.6198, Train Steps/Sec: 1.21
522
+ [2025-10-28 10:07:32] Beginning epoch 35...
523
+ [2025-10-28 10:07:47] (step=0043800) Train Loss: 0.6197, Train Steps/Sec: 1.18
524
+ [2025-10-28 10:09:09] (step=0043900) Train Loss: 0.6188, Train Steps/Sec: 1.21
525
+ [2025-10-28 10:10:32] (step=0044000) Train Loss: 0.6202, Train Steps/Sec: 1.20
526
+ [2025-10-28 10:11:55] (step=0044100) Train Loss: 0.6196, Train Steps/Sec: 1.21
527
+ [2025-10-28 10:13:17] (step=0044200) Train Loss: 0.6192, Train Steps/Sec: 1.21
528
+ [2025-10-28 10:14:40] (step=0044300) Train Loss: 0.6194, Train Steps/Sec: 1.21
529
+ [2025-10-28 10:16:02] (step=0044400) Train Loss: 0.6183, Train Steps/Sec: 1.21
530
+ [2025-10-28 10:17:24] (step=0044500) Train Loss: 0.6198, Train Steps/Sec: 1.21
531
+ [2025-10-28 10:18:47] (step=0044600) Train Loss: 0.6188, Train Steps/Sec: 1.21
532
+ [2025-10-28 10:20:09] (step=0044700) Train Loss: 0.6202, Train Steps/Sec: 1.21
533
+ [2025-10-28 10:21:31] (step=0044800) Train Loss: 0.6190, Train Steps/Sec: 1.21
534
+ [2025-10-28 10:22:54] (step=0044900) Train Loss: 0.6183, Train Steps/Sec: 1.21
535
+ [2025-10-28 10:24:16] (step=0045000) Train Loss: 0.6187, Train Steps/Sec: 1.22
536
+ [2025-10-28 10:24:46] Beginning epoch 36...
537
+ [2025-10-28 10:25:41] (step=0045100) Train Loss: 0.6187, Train Steps/Sec: 1.17
538
+ [2025-10-28 10:27:04] (step=0045200) Train Loss: 0.6185, Train Steps/Sec: 1.21
539
+ [2025-10-28 10:28:26] (step=0045300) Train Loss: 0.6189, Train Steps/Sec: 1.21
540
+ [2025-10-28 10:29:48] (step=0045400) Train Loss: 0.6186, Train Steps/Sec: 1.21
541
+ [2025-10-28 10:31:11] (step=0045500) Train Loss: 0.6180, Train Steps/Sec: 1.21
542
+ [2025-10-28 10:32:34] (step=0045600) Train Loss: 0.6185, Train Steps/Sec: 1.21
543
+ [2025-10-28 10:33:57] (step=0045700) Train Loss: 0.6188, Train Steps/Sec: 1.21
544
+ [2025-10-28 10:35:19] (step=0045800) Train Loss: 0.6177, Train Steps/Sec: 1.21
545
+ [2025-10-28 10:36:41] (step=0045900) Train Loss: 0.6185, Train Steps/Sec: 1.21
546
+ [2025-10-28 10:38:04] (step=0046000) Train Loss: 0.6175, Train Steps/Sec: 1.21
547
+ [2025-10-28 10:39:26] (step=0046100) Train Loss: 0.6183, Train Steps/Sec: 1.21
548
+ [2025-10-28 10:40:48] (step=0046200) Train Loss: 0.6182, Train Steps/Sec: 1.21
549
+ [2025-10-28 10:42:00] Beginning epoch 37...
550
+ [2025-10-28 10:42:13] (step=0046300) Train Loss: 0.6180, Train Steps/Sec: 1.18
551
+ [2025-10-28 10:43:36] (step=0046400) Train Loss: 0.6183, Train Steps/Sec: 1.21
552
+ [2025-10-28 10:44:58] (step=0046500) Train Loss: 0.6183, Train Steps/Sec: 1.21
553
+ [2025-10-28 10:46:20] (step=0046600) Train Loss: 0.6181, Train Steps/Sec: 1.21
554
+ [2025-10-28 10:47:43] (step=0046700) Train Loss: 0.6180, Train Steps/Sec: 1.21
555
+ [2025-10-28 10:49:05] (step=0046800) Train Loss: 0.6179, Train Steps/Sec: 1.21
556
+ [2025-10-28 10:50:28] (step=0046900) Train Loss: 0.6175, Train Steps/Sec: 1.21
557
+ [2025-10-28 10:51:50] (step=0047000) Train Loss: 0.6184, Train Steps/Sec: 1.21
558
+ [2025-10-28 10:53:12] (step=0047100) Train Loss: 0.6180, Train Steps/Sec: 1.22
559
+ [2025-10-28 10:54:35] (step=0047200) Train Loss: 0.6178, Train Steps/Sec: 1.21
560
+ [2025-10-28 10:55:58] (step=0047300) Train Loss: 0.6179, Train Steps/Sec: 1.20
561
+ [2025-10-28 10:57:20] (step=0047400) Train Loss: 0.6184, Train Steps/Sec: 1.21
562
+ [2025-10-28 10:58:43] (step=0047500) Train Loss: 0.6162, Train Steps/Sec: 1.21
563
+ [2025-10-28 10:59:15] Beginning epoch 38...
564
+ [2025-10-28 11:00:08] (step=0047600) Train Loss: 0.6168, Train Steps/Sec: 1.18
565
+ [2025-10-28 11:01:30] (step=0047700) Train Loss: 0.6176, Train Steps/Sec: 1.21
566
+ [2025-10-28 11:02:53] (step=0047800) Train Loss: 0.6167, Train Steps/Sec: 1.21
567
+ [2025-10-28 11:04:15] (step=0047900) Train Loss: 0.6172, Train Steps/Sec: 1.21
568
+ [2025-10-28 11:05:37] (step=0048000) Train Loss: 0.6173, Train Steps/Sec: 1.21
569
+ [2025-10-28 11:07:00] (step=0048100) Train Loss: 0.6162, Train Steps/Sec: 1.21
570
+ [2025-10-28 11:08:22] (step=0048200) Train Loss: 0.6179, Train Steps/Sec: 1.21
571
+ [2025-10-28 11:09:45] (step=0048300) Train Loss: 0.6167, Train Steps/Sec: 1.21
572
+ [2025-10-28 11:11:07] (step=0048400) Train Loss: 0.6172, Train Steps/Sec: 1.21
573
+ [2025-10-28 11:12:30] (step=0048500) Train Loss: 0.6176, Train Steps/Sec: 1.21
574
+ [2025-10-28 11:13:52] (step=0048600) Train Loss: 0.6162, Train Steps/Sec: 1.21
575
+ [2025-10-28 11:15:15] (step=0048700) Train Loss: 0.6158, Train Steps/Sec: 1.21
576
+ [2025-10-28 11:16:29] Beginning epoch 39...
577
+ [2025-10-28 11:16:40] (step=0048800) Train Loss: 0.6158, Train Steps/Sec: 1.18
578
+ [2025-10-28 11:18:02] (step=0048900) Train Loss: 0.6174, Train Steps/Sec: 1.21
579
+ [2025-10-28 11:19:25] (step=0049000) Train Loss: 0.6169, Train Steps/Sec: 1.20
580
+ [2025-10-28 11:20:48] (step=0049100) Train Loss: 0.6174, Train Steps/Sec: 1.21
581
+ [2025-10-28 11:22:10] (step=0049200) Train Loss: 0.6161, Train Steps/Sec: 1.21
582
+ [2025-10-28 11:23:33] (step=0049300) Train Loss: 0.6156, Train Steps/Sec: 1.21
583
+ [2025-10-28 11:24:55] (step=0049400) Train Loss: 0.6178, Train Steps/Sec: 1.21
584
+ [2025-10-28 11:26:18] (step=0049500) Train Loss: 0.6164, Train Steps/Sec: 1.21
585
+ [2025-10-28 11:27:40] (step=0049600) Train Loss: 0.6165, Train Steps/Sec: 1.21
586
+ [2025-10-28 11:29:03] (step=0049700) Train Loss: 0.6176, Train Steps/Sec: 1.21
587
+ [2025-10-28 11:30:25] (step=0049800) Train Loss: 0.6164, Train Steps/Sec: 1.21
588
+ [2025-10-28 11:31:47] (step=0049900) Train Loss: 0.6153, Train Steps/Sec: 1.21
589
+ [2025-10-28 11:33:10] (step=0050000) Train Loss: 0.6175, Train Steps/Sec: 1.21
590
+ [2025-10-28 11:34:00] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-b-bf16/checkpoints/0050000.pt
591
+ [2025-10-28 11:34:00] Generating EMA samples...
592
+ [2025-10-28 11:34:28] Generating EMA samples done.
593
+ [2025-10-28 11:35:01] Beginning epoch 40...
594
+ [2025-10-28 11:35:53] (step=0050100) Train Loss: 0.6142, Train Steps/Sec: 0.61
595
+ [2025-10-28 11:37:15] (step=0050200) Train Loss: 0.6155, Train Steps/Sec: 1.21
596
+ [2025-10-28 11:38:38] (step=0050300) Train Loss: 0.6145, Train Steps/Sec: 1.21
597
+ [2025-10-28 11:40:00] (step=0050400) Train Loss: 0.6168, Train Steps/Sec: 1.21
598
+ [2025-10-28 11:41:22] (step=0050500) Train Loss: 0.6172, Train Steps/Sec: 1.21
599
+ [2025-10-28 11:42:45] (step=0050600) Train Loss: 0.6145, Train Steps/Sec: 1.21
600
+ [2025-10-28 11:44:08] (step=0050700) Train Loss: 0.6163, Train Steps/Sec: 1.20
601
+ [2025-10-28 11:45:31] (step=0050800) Train Loss: 0.6164, Train Steps/Sec: 1.22
602
+ [2025-10-28 11:46:53] (step=0050900) Train Loss: 0.6158, Train Steps/Sec: 1.21
603
+ [2025-10-28 11:48:15] (step=0051000) Train Loss: 0.6165, Train Steps/Sec: 1.21
604
+ [2025-10-28 11:49:38] (step=0051100) Train Loss: 0.6161, Train Steps/Sec: 1.21
605
+ [2025-10-28 11:51:00] (step=0051200) Train Loss: 0.6159, Train Steps/Sec: 1.22
606
+ [2025-10-28 11:52:15] Beginning epoch 41...
607
+ [2025-10-28 11:52:25] (step=0051300) Train Loss: 0.6159, Train Steps/Sec: 1.18
608
+ [2025-10-28 11:53:47] (step=0051400) Train Loss: 0.6165, Train Steps/Sec: 1.21
609
+ [2025-10-28 11:55:10] (step=0051500) Train Loss: 0.6150, Train Steps/Sec: 1.21
610
+ [2025-10-28 11:56:32] (step=0051600) Train Loss: 0.6150, Train Steps/Sec: 1.21
611
+ [2025-10-28 11:57:54] (step=0051700) Train Loss: 0.6151, Train Steps/Sec: 1.21
612
+ [2025-10-28 11:59:17] (step=0051800) Train Loss: 0.6146, Train Steps/Sec: 1.21
613
+ [2025-10-28 12:00:39] (step=0051900) Train Loss: 0.6158, Train Steps/Sec: 1.21
614
+ [2025-10-28 12:02:02] (step=0052000) Train Loss: 0.6156, Train Steps/Sec: 1.21
615
+ [2025-10-28 12:03:24] (step=0052100) Train Loss: 0.6145, Train Steps/Sec: 1.21
616
+ [2025-10-28 12:04:46] (step=0052200) Train Loss: 0.6156, Train Steps/Sec: 1.21
617
+ [2025-10-28 12:06:09] (step=0052300) Train Loss: 0.6164, Train Steps/Sec: 1.20
618
+ [2025-10-28 12:07:32] (step=0052400) Train Loss: 0.6163, Train Steps/Sec: 1.21
619
+ [2025-10-28 12:08:55] (step=0052500) Train Loss: 0.6150, Train Steps/Sec: 1.21
620
+ [2025-10-28 12:09:30] Beginning epoch 42...
621
+ [2025-10-28 12:10:20] (step=0052600) Train Loss: 0.6150, Train Steps/Sec: 1.18
622
+ [2025-10-28 12:11:42] (step=0052700) Train Loss: 0.6144, Train Steps/Sec: 1.21
623
+ [2025-10-28 12:13:04] (step=0052800) Train Loss: 0.6147, Train Steps/Sec: 1.21
624
+ [2025-10-28 12:14:27] (step=0052900) Train Loss: 0.6158, Train Steps/Sec: 1.21
625
+ [2025-10-28 12:15:49] (step=0053000) Train Loss: 0.6162, Train Steps/Sec: 1.21
626
+ [2025-10-28 12:17:11] (step=0053100) Train Loss: 0.6145, Train Steps/Sec: 1.21
627
+ [2025-10-28 12:18:34] (step=0053200) Train Loss: 0.6146, Train Steps/Sec: 1.21
628
+ [2025-10-28 12:19:56] (step=0053300) Train Loss: 0.6143, Train Steps/Sec: 1.21
629
+ [2025-10-28 12:21:19] (step=0053400) Train Loss: 0.6135, Train Steps/Sec: 1.21
630
+ [2025-10-28 12:22:41] (step=0053500) Train Loss: 0.6158, Train Steps/Sec: 1.21
631
+ [2025-10-28 12:24:03] (step=0053600) Train Loss: 0.6169, Train Steps/Sec: 1.21
632
+ [2025-10-28 12:25:26] (step=0053700) Train Loss: 0.6154, Train Steps/Sec: 1.21
633
+ [2025-10-28 12:26:43] Beginning epoch 43...
634
+ [2025-10-28 12:26:51] (step=0053800) Train Loss: 0.6158, Train Steps/Sec: 1.18
635
+ [2025-10-28 12:28:13] (step=0053900) Train Loss: 0.6136, Train Steps/Sec: 1.21
636
+ [2025-10-28 12:29:36] (step=0054000) Train Loss: 0.6153, Train Steps/Sec: 1.20
637
+ [2025-10-28 12:30:59] (step=0054100) Train Loss: 0.6149, Train Steps/Sec: 1.21
638
+ [2025-10-28 12:32:21] (step=0054200) Train Loss: 0.6145, Train Steps/Sec: 1.21
639
+ [2025-10-28 12:33:44] (step=0054300) Train Loss: 0.6164, Train Steps/Sec: 1.22
640
+ [2025-10-28 12:35:06] (step=0054400) Train Loss: 0.6165, Train Steps/Sec: 1.22
641
+ [2025-10-28 12:36:28] (step=0054500) Train Loss: 0.6120, Train Steps/Sec: 1.22
642
+ [2025-10-28 12:37:50] (step=0054600) Train Loss: 0.6159, Train Steps/Sec: 1.21
643
+ [2025-10-28 12:39:13] (step=0054700) Train Loss: 0.6144, Train Steps/Sec: 1.22
644
+ [2025-10-28 12:40:35] (step=0054800) Train Loss: 0.6151, Train Steps/Sec: 1.22
645
+ [2025-10-28 12:41:57] (step=0054900) Train Loss: 0.6150, Train Steps/Sec: 1.21
646
+ [2025-10-28 12:43:20] (step=0055000) Train Loss: 0.6145, Train Steps/Sec: 1.21
647
+ [2025-10-28 12:43:56] Beginning epoch 44...
648
+ [2025-10-28 12:44:45] (step=0055100) Train Loss: 0.6144, Train Steps/Sec: 1.18
649
+ [2025-10-28 12:46:07] (step=0055200) Train Loss: 0.6134, Train Steps/Sec: 1.21
650
+ [2025-10-28 12:47:29] (step=0055300) Train Loss: 0.6136, Train Steps/Sec: 1.21
651
+ [2025-10-28 12:48:52] (step=0055400) Train Loss: 0.6150, Train Steps/Sec: 1.21
652
+ [2025-10-28 12:50:14] (step=0055500) Train Loss: 0.6146, Train Steps/Sec: 1.22
653
+ [2025-10-28 12:51:37] (step=0055600) Train Loss: 0.6129, Train Steps/Sec: 1.21
654
+ [2025-10-28 12:53:00] (step=0055700) Train Loss: 0.6138, Train Steps/Sec: 1.21
655
+ [2025-10-28 12:54:22] (step=0055800) Train Loss: 0.6136, Train Steps/Sec: 1.21
656
+ [2025-10-28 12:55:45] (step=0055900) Train Loss: 0.6139, Train Steps/Sec: 1.21
657
+ [2025-10-28 12:57:07] (step=0056000) Train Loss: 0.6112, Train Steps/Sec: 1.21
658
+ [2025-10-28 12:58:29] (step=0056100) Train Loss: 0.6153, Train Steps/Sec: 1.21
659
+ [2025-10-28 12:59:52] (step=0056200) Train Loss: 0.6140, Train Steps/Sec: 1.22
660
+ [2025-10-28 13:01:10] Beginning epoch 45...
661
+ [2025-10-28 13:01:17] (step=0056300) Train Loss: 0.6139, Train Steps/Sec: 1.18
662
+ [2025-10-28 13:02:39] (step=0056400) Train Loss: 0.6130, Train Steps/Sec: 1.21
663
+ [2025-10-28 13:04:01] (step=0056500) Train Loss: 0.6143, Train Steps/Sec: 1.21
664
+ [2025-10-28 13:05:24] (step=0056600) Train Loss: 0.6151, Train Steps/Sec: 1.21
665
+ [2025-10-28 13:06:46] (step=0056700) Train Loss: 0.6136, Train Steps/Sec: 1.21
666
+ [2025-10-28 13:08:09] (step=0056800) Train Loss: 0.6132, Train Steps/Sec: 1.21
667
+ [2025-10-28 13:09:31] (step=0056900) Train Loss: 0.6118, Train Steps/Sec: 1.21
668
+ [2025-10-28 13:10:54] (step=0057000) Train Loss: 0.6147, Train Steps/Sec: 1.21
669
+ [2025-10-28 13:12:16] (step=0057100) Train Loss: 0.6131, Train Steps/Sec: 1.21
670
+ [2025-10-28 13:13:39] (step=0057200) Train Loss: 0.6141, Train Steps/Sec: 1.21
671
+ [2025-10-28 13:15:02] (step=0057300) Train Loss: 0.6149, Train Steps/Sec: 1.21
672
+ [2025-10-28 13:16:24] (step=0057400) Train Loss: 0.6145, Train Steps/Sec: 1.21
673
+ [2025-10-28 13:17:47] (step=0057500) Train Loss: 0.6139, Train Steps/Sec: 1.21
674
+ [2025-10-28 13:18:25] Beginning epoch 46...
675
+ [2025-10-28 13:19:11] (step=0057600) Train Loss: 0.6126, Train Steps/Sec: 1.18
676
+ [2025-10-28 13:20:34] (step=0057700) Train Loss: 0.6140, Train Steps/Sec: 1.21
677
+ [2025-10-28 13:21:56] (step=0057800) Train Loss: 0.6127, Train Steps/Sec: 1.21
678
+ [2025-10-28 13:23:19] (step=0057900) Train Loss: 0.6132, Train Steps/Sec: 1.21
679
+ [2025-10-28 13:24:41] (step=0058000) Train Loss: 0.6151, Train Steps/Sec: 1.21
680
+ [2025-10-28 13:26:04] (step=0058100) Train Loss: 0.6142, Train Steps/Sec: 1.21
681
+ [2025-10-28 13:27:26] (step=0058200) Train Loss: 0.6136, Train Steps/Sec: 1.21
682
+ [2025-10-28 13:28:48] (step=0058300) Train Loss: 0.6143, Train Steps/Sec: 1.21
683
+ [2025-10-28 13:30:11] (step=0058400) Train Loss: 0.6141, Train Steps/Sec: 1.21
684
+ [2025-10-28 13:31:33] (step=0058500) Train Loss: 0.6136, Train Steps/Sec: 1.22
685
+ [2025-10-28 13:32:55] (step=0058600) Train Loss: 0.6136, Train Steps/Sec: 1.21
686
+ [2025-10-28 13:34:18] (step=0058700) Train Loss: 0.6140, Train Steps/Sec: 1.21
687
+ [2025-10-28 13:35:38] Beginning epoch 47...
688
+ [2025-10-28 13:35:43] (step=0058800) Train Loss: 0.6139, Train Steps/Sec: 1.18
689
+ [2025-10-28 13:37:05] (step=0058900) Train Loss: 0.6128, Train Steps/Sec: 1.21
690
+ [2025-10-28 13:38:28] (step=0059000) Train Loss: 0.6140, Train Steps/Sec: 1.21
691
+ [2025-10-28 13:39:51] (step=0059100) Train Loss: 0.6125, Train Steps/Sec: 1.21
692
+ [2025-10-28 13:41:13] (step=0059200) Train Loss: 0.6143, Train Steps/Sec: 1.21
693
+ [2025-10-28 13:42:35] (step=0059300) Train Loss: 0.6113, Train Steps/Sec: 1.21
694
+ [2025-10-28 13:43:58] (step=0059400) Train Loss: 0.6131, Train Steps/Sec: 1.21
695
+ [2025-10-28 13:45:20] (step=0059500) Train Loss: 0.6138, Train Steps/Sec: 1.21
696
+ [2025-10-28 13:46:43] (step=0059600) Train Loss: 0.6129, Train Steps/Sec: 1.21
697
+ [2025-10-28 13:48:05] (step=0059700) Train Loss: 0.6129, Train Steps/Sec: 1.21
698
+ [2025-10-28 13:49:28] (step=0059800) Train Loss: 0.6120, Train Steps/Sec: 1.21
699
+ [2025-10-28 13:50:50] (step=0059900) Train Loss: 0.6121, Train Steps/Sec: 1.21
700
+ [2025-10-28 13:52:12] (step=0060000) Train Loss: 0.6131, Train Steps/Sec: 1.21
701
+ [2025-10-28 13:52:53] Beginning epoch 48...
702
+ [2025-10-28 13:53:37] (step=0060100) Train Loss: 0.6122, Train Steps/Sec: 1.18
703
+ [2025-10-28 13:55:00] (step=0060200) Train Loss: 0.6128, Train Steps/Sec: 1.21
704
+ [2025-10-28 13:56:22] (step=0060300) Train Loss: 0.6120, Train Steps/Sec: 1.21
705
+ [2025-10-28 13:57:45] (step=0060400) Train Loss: 0.6112, Train Steps/Sec: 1.21
706
+ [2025-10-28 13:59:07] (step=0060500) Train Loss: 0.6133, Train Steps/Sec: 1.21
707
+ [2025-10-28 14:00:30] (step=0060600) Train Loss: 0.6120, Train Steps/Sec: 1.20
708
+ [2025-10-28 14:01:53] (step=0060700) Train Loss: 0.6118, Train Steps/Sec: 1.21
709
+ [2025-10-28 14:03:15] (step=0060800) Train Loss: 0.6121, Train Steps/Sec: 1.21
710
+ [2025-10-28 14:04:38] (step=0060900) Train Loss: 0.6113, Train Steps/Sec: 1.21
711
+ [2025-10-28 14:06:00] (step=0061000) Train Loss: 0.6121, Train Steps/Sec: 1.21
712
+ [2025-10-28 14:07:23] (step=0061100) Train Loss: 0.6118, Train Steps/Sec: 1.21
713
+ [2025-10-28 14:08:45] (step=0061200) Train Loss: 0.6120, Train Steps/Sec: 1.21
714
+ [2025-10-28 14:10:07] Beginning epoch 49...
715
+ [2025-10-28 14:10:10] (step=0061300) Train Loss: 0.6123, Train Steps/Sec: 1.18
716
+ [2025-10-28 14:11:32] (step=0061400) Train Loss: 0.6107, Train Steps/Sec: 1.21
717
+ [2025-10-28 14:12:54] (step=0061500) Train Loss: 0.6125, Train Steps/Sec: 1.21
718
+ [2025-10-28 14:14:17] (step=0061600) Train Loss: 0.6126, Train Steps/Sec: 1.21
719
+ [2025-10-28 14:15:39] (step=0061700) Train Loss: 0.6120, Train Steps/Sec: 1.21
720
+ [2025-10-28 14:17:01] (step=0061800) Train Loss: 0.6122, Train Steps/Sec: 1.21
721
+ [2025-10-28 14:18:23] (step=0061900) Train Loss: 0.6126, Train Steps/Sec: 1.22
722
+ [2025-10-28 14:19:46] (step=0062000) Train Loss: 0.6120, Train Steps/Sec: 1.21
723
+ [2025-10-28 14:21:08] (step=0062100) Train Loss: 0.6119, Train Steps/Sec: 1.21
724
+ [2025-10-28 14:22:31] (step=0062200) Train Loss: 0.6123, Train Steps/Sec: 1.21
725
+ [2025-10-28 14:23:54] (step=0062300) Train Loss: 0.6116, Train Steps/Sec: 1.21
726
+ [2025-10-28 14:25:16] (step=0062400) Train Loss: 0.6124, Train Steps/Sec: 1.21
727
+ [2025-10-28 14:26:38] (step=0062500) Train Loss: 0.6108, Train Steps/Sec: 1.22
728
+ [2025-10-28 14:27:20] Beginning epoch 50...
729
+ [2025-10-28 14:28:03] (step=0062600) Train Loss: 0.6111, Train Steps/Sec: 1.18
730
+ [2025-10-28 14:29:26] (step=0062700) Train Loss: 0.6107, Train Steps/Sec: 1.21
731
+ [2025-10-28 14:30:48] (step=0062800) Train Loss: 0.6113, Train Steps/Sec: 1.21
732
+ [2025-10-28 14:32:11] (step=0062900) Train Loss: 0.6115, Train Steps/Sec: 1.21
733
+ [2025-10-28 14:33:33] (step=0063000) Train Loss: 0.6120, Train Steps/Sec: 1.21
734
+ [2025-10-28 14:34:55] (step=0063100) Train Loss: 0.6098, Train Steps/Sec: 1.21
735
+ [2025-10-28 14:36:18] (step=0063200) Train Loss: 0.6102, Train Steps/Sec: 1.21
736
+ [2025-10-28 14:37:40] (step=0063300) Train Loss: 0.6123, Train Steps/Sec: 1.21
737
+ [2025-10-28 14:39:03] (step=0063400) Train Loss: 0.6102, Train Steps/Sec: 1.21
738
+ [2025-10-28 14:40:25] (step=0063500) Train Loss: 0.6101, Train Steps/Sec: 1.21
739
+ [2025-10-28 14:41:48] (step=0063600) Train Loss: 0.6126, Train Steps/Sec: 1.21
740
+ [2025-10-28 14:43:10] (step=0063700) Train Loss: 0.6117, Train Steps/Sec: 1.21
741
+ [2025-10-28 14:44:33] (step=0063800) Train Loss: 0.6122, Train Steps/Sec: 1.21
742
+ [2025-10-28 14:44:34] Beginning epoch 51...
743
+ [2025-10-28 14:45:58] (step=0063900) Train Loss: 0.6100, Train Steps/Sec: 1.17
744
+ [2025-10-28 14:47:21] (step=0064000) Train Loss: 0.6112, Train Steps/Sec: 1.20
745
+ [2025-10-28 14:48:44] (step=0064100) Train Loss: 0.6109, Train Steps/Sec: 1.21
746
+ [2025-10-28 14:50:06] (step=0064200) Train Loss: 0.6116, Train Steps/Sec: 1.21
747
+ [2025-10-28 14:51:28] (step=0064300) Train Loss: 0.6112, Train Steps/Sec: 1.21
748
+ [2025-10-28 14:52:51] (step=0064400) Train Loss: 0.6116, Train Steps/Sec: 1.21
749
+ [2025-10-28 14:54:13] (step=0064500) Train Loss: 0.6116, Train Steps/Sec: 1.22
750
+ [2025-10-28 14:55:35] (step=0064600) Train Loss: 0.6117, Train Steps/Sec: 1.22
751
+ [2025-10-28 14:56:58] (step=0064700) Train Loss: 0.6103, Train Steps/Sec: 1.21
752
+ [2025-10-28 14:58:20] (step=0064800) Train Loss: 0.6107, Train Steps/Sec: 1.21
753
+ [2025-10-28 14:59:42] (step=0064900) Train Loss: 0.6104, Train Steps/Sec: 1.21
754
+ [2025-10-28 15:01:05] (step=0065000) Train Loss: 0.6120, Train Steps/Sec: 1.22
755
+ [2025-10-28 15:01:48] Beginning epoch 52...
756
+ [2025-10-28 15:02:30] (step=0065100) Train Loss: 0.6100, Train Steps/Sec: 1.18
757
+ [2025-10-28 15:03:52] (step=0065200) Train Loss: 0.6115, Train Steps/Sec: 1.21
758
+ [2025-10-28 15:05:14] (step=0065300) Train Loss: 0.6115, Train Steps/Sec: 1.21
759
+ [2025-10-28 15:06:37] (step=0065400) Train Loss: 0.6111, Train Steps/Sec: 1.21
760
+ [2025-10-28 15:07:59] (step=0065500) Train Loss: 0.6100, Train Steps/Sec: 1.21
761
+ [2025-10-28 15:09:22] (step=0065600) Train Loss: 0.6106, Train Steps/Sec: 1.21
762
+ [2025-10-28 15:10:45] (step=0065700) Train Loss: 0.6097, Train Steps/Sec: 1.21
763
+ [2025-10-28 15:12:07] (step=0065800) Train Loss: 0.6094, Train Steps/Sec: 1.21
764
+ [2025-10-28 15:13:30] (step=0065900) Train Loss: 0.6110, Train Steps/Sec: 1.21
765
+ [2025-10-28 15:14:52] (step=0066000) Train Loss: 0.6098, Train Steps/Sec: 1.21
766
+ [2025-10-28 15:16:14] (step=0066100) Train Loss: 0.6111, Train Steps/Sec: 1.22
767
+ [2025-10-28 15:17:37] (step=0066200) Train Loss: 0.6103, Train Steps/Sec: 1.21
768
+ [2025-10-28 15:18:59] (step=0066300) Train Loss: 0.6095, Train Steps/Sec: 1.22
769
+ [2025-10-28 15:19:02] Beginning epoch 53...
770
+ [2025-10-28 15:20:24] (step=0066400) Train Loss: 0.6095, Train Steps/Sec: 1.18
771
+ [2025-10-28 15:21:46] (step=0066500) Train Loss: 0.6096, Train Steps/Sec: 1.21
772
+ [2025-10-28 15:23:09] (step=0066600) Train Loss: 0.6098, Train Steps/Sec: 1.22
773
+ [2025-10-28 15:24:31] (step=0066700) Train Loss: 0.6100, Train Steps/Sec: 1.22
774
+ [2025-10-28 15:25:53] (step=0066800) Train Loss: 0.6100, Train Steps/Sec: 1.22
775
+ [2025-10-28 15:27:15] (step=0066900) Train Loss: 0.6098, Train Steps/Sec: 1.22
776
+ [2025-10-28 15:28:38] (step=0067000) Train Loss: 0.6104, Train Steps/Sec: 1.21
777
+ [2025-10-28 15:30:00] (step=0067100) Train Loss: 0.6104, Train Steps/Sec: 1.21
778
+ [2025-10-28 15:31:23] (step=0067200) Train Loss: 0.6098, Train Steps/Sec: 1.21
779
+ [2025-10-28 15:32:46] (step=0067300) Train Loss: 0.6107, Train Steps/Sec: 1.21
780
+ [2025-10-28 15:34:08] (step=0067400) Train Loss: 0.6098, Train Steps/Sec: 1.21
781
+ [2025-10-28 15:35:31] (step=0067500) Train Loss: 0.6107, Train Steps/Sec: 1.21
782
+ [2025-10-28 15:36:16] Beginning epoch 54...
783
+ [2025-10-28 15:36:56] (step=0067600) Train Loss: 0.6102, Train Steps/Sec: 1.18
784
+ [2025-10-28 15:38:18] (step=0067700) Train Loss: 0.6082, Train Steps/Sec: 1.21
785
+ [2025-10-28 15:39:40] (step=0067800) Train Loss: 0.6096, Train Steps/Sec: 1.21
786
+ [2025-10-28 15:41:03] (step=0067900) Train Loss: 0.6108, Train Steps/Sec: 1.21
787
+ [2025-10-28 15:42:25] (step=0068000) Train Loss: 0.6101, Train Steps/Sec: 1.21
788
+ [2025-10-28 15:43:48] (step=0068100) Train Loss: 0.6109, Train Steps/Sec: 1.21
789
+ [2025-10-28 15:45:10] (step=0068200) Train Loss: 0.6103, Train Steps/Sec: 1.21
790
+ [2025-10-28 15:46:32] (step=0068300) Train Loss: 0.6095, Train Steps/Sec: 1.21
791
+ [2025-10-28 15:47:55] (step=0068400) Train Loss: 0.6100, Train Steps/Sec: 1.21
792
+ [2025-10-28 15:49:17] (step=0068500) Train Loss: 0.6095, Train Steps/Sec: 1.21
793
+ [2025-10-28 15:50:40] (step=0068600) Train Loss: 0.6106, Train Steps/Sec: 1.21
794
+ [2025-10-28 15:52:02] (step=0068700) Train Loss: 0.6093, Train Steps/Sec: 1.21
795
+ [2025-10-28 15:53:25] (step=0068800) Train Loss: 0.6099, Train Steps/Sec: 1.21
796
+ [2025-10-28 15:53:29] Beginning epoch 55...
797
+ [2025-10-28 15:54:50] (step=0068900) Train Loss: 0.6083, Train Steps/Sec: 1.17
798
+ [2025-10-28 15:56:13] (step=0069000) Train Loss: 0.6090, Train Steps/Sec: 1.21
799
+ [2025-10-28 15:57:35] (step=0069100) Train Loss: 0.6105, Train Steps/Sec: 1.22
800
+ [2025-10-28 15:58:57] (step=0069200) Train Loss: 0.6084, Train Steps/Sec: 1.21
801
+ [2025-10-28 16:00:20] (step=0069300) Train Loss: 0.6094, Train Steps/Sec: 1.21
802
+ [2025-10-28 16:01:42] (step=0069400) Train Loss: 0.6094, Train Steps/Sec: 1.21
803
+ [2025-10-28 16:03:05] (step=0069500) Train Loss: 0.6087, Train Steps/Sec: 1.21
804
+ [2025-10-28 16:04:27] (step=0069600) Train Loss: 0.6098, Train Steps/Sec: 1.21
805
+ [2025-10-28 16:05:49] (step=0069700) Train Loss: 0.6097, Train Steps/Sec: 1.21
806
+ [2025-10-28 16:07:12] (step=0069800) Train Loss: 0.6079, Train Steps/Sec: 1.21
807
+ [2025-10-28 16:08:34] (step=0069900) Train Loss: 0.6083, Train Steps/Sec: 1.21
808
+ [2025-10-28 16:09:57] (step=0070000) Train Loss: 0.6086, Train Steps/Sec: 1.21
809
+ [2025-10-28 16:10:43] Beginning epoch 56...
810
+ [2025-10-28 16:11:21] (step=0070100) Train Loss: 0.6092, Train Steps/Sec: 1.18
811
+ [2025-10-28 16:12:44] (step=0070200) Train Loss: 0.6095, Train Steps/Sec: 1.21
812
+ [2025-10-28 16:14:06] (step=0070300) Train Loss: 0.6083, Train Steps/Sec: 1.21
813
+ [2025-10-28 16:15:29] (step=0070400) Train Loss: 0.6085, Train Steps/Sec: 1.21
814
+ [2025-10-28 16:16:51] (step=0070500) Train Loss: 0.6085, Train Steps/Sec: 1.21
815
+ [2025-10-28 16:18:14] (step=0070600) Train Loss: 0.6084, Train Steps/Sec: 1.21
816
+ [2025-10-28 16:19:37] (step=0070700) Train Loss: 0.6095, Train Steps/Sec: 1.21
817
+ [2025-10-28 16:20:59] (step=0070800) Train Loss: 0.6085, Train Steps/Sec: 1.21
818
+ [2025-10-28 16:22:22] (step=0070900) Train Loss: 0.6107, Train Steps/Sec: 1.21
819
+ [2025-10-28 16:23:44] (step=0071000) Train Loss: 0.6099, Train Steps/Sec: 1.21
820
+ [2025-10-28 16:25:06] (step=0071100) Train Loss: 0.6091, Train Steps/Sec: 1.21
821
+ [2025-10-28 16:26:29] (step=0071200) Train Loss: 0.6084, Train Steps/Sec: 1.21
822
+ [2025-10-28 16:27:51] (step=0071300) Train Loss: 0.6090, Train Steps/Sec: 1.21
823
+ [2025-10-28 16:27:58] Beginning epoch 57...
824
+ [2025-10-28 16:29:16] (step=0071400) Train Loss: 0.6091, Train Steps/Sec: 1.18
825
+ [2025-10-28 16:30:39] (step=0071500) Train Loss: 0.6085, Train Steps/Sec: 1.21
826
+ [2025-10-28 16:32:01] (step=0071600) Train Loss: 0.6100, Train Steps/Sec: 1.21
827
+ [2025-10-28 16:33:24] (step=0071700) Train Loss: 0.6095, Train Steps/Sec: 1.21
828
+ [2025-10-28 16:34:46] (step=0071800) Train Loss: 0.6093, Train Steps/Sec: 1.21
829
+ [2025-10-28 16:36:08] (step=0071900) Train Loss: 0.6084, Train Steps/Sec: 1.21
830
+ [2025-10-28 16:37:31] (step=0072000) Train Loss: 0.6098, Train Steps/Sec: 1.21
831
+ [2025-10-28 16:38:53] (step=0072100) Train Loss: 0.6086, Train Steps/Sec: 1.21
832
+ [2025-10-28 16:40:16] (step=0072200) Train Loss: 0.6085, Train Steps/Sec: 1.21
833
+ [2025-10-28 16:41:39] (step=0072300) Train Loss: 0.6081, Train Steps/Sec: 1.21
834
+ [2025-10-28 16:43:01] (step=0072400) Train Loss: 0.6100, Train Steps/Sec: 1.21
835
+ [2025-10-28 16:44:24] (step=0072500) Train Loss: 0.6094, Train Steps/Sec: 1.21
836
+ [2025-10-28 16:45:12] Beginning epoch 58...
837
+ [2025-10-28 16:45:49] (step=0072600) Train Loss: 0.6090, Train Steps/Sec: 1.18
838
+ [2025-10-28 16:47:11] (step=0072700) Train Loss: 0.6089, Train Steps/Sec: 1.21
839
+ [2025-10-28 16:48:33] (step=0072800) Train Loss: 0.6082, Train Steps/Sec: 1.21
840
+ [2025-10-28 16:49:56] (step=0072900) Train Loss: 0.6080, Train Steps/Sec: 1.21
841
+ [2025-10-28 16:51:18] (step=0073000) Train Loss: 0.6088, Train Steps/Sec: 1.21
842
+ [2025-10-28 16:52:41] (step=0073100) Train Loss: 0.6083, Train Steps/Sec: 1.21
843
+ [2025-10-28 16:54:03] (step=0073200) Train Loss: 0.6084, Train Steps/Sec: 1.21
844
+ [2025-10-28 16:55:26] (step=0073300) Train Loss: 0.6061, Train Steps/Sec: 1.21
845
+ [2025-10-28 16:56:48] (step=0073400) Train Loss: 0.6090, Train Steps/Sec: 1.21
846
+ [2025-10-28 16:58:10] (step=0073500) Train Loss: 0.6081, Train Steps/Sec: 1.21
847
+ [2025-10-28 16:59:33] (step=0073600) Train Loss: 0.6096, Train Steps/Sec: 1.21
848
+ [2025-10-28 17:00:55] (step=0073700) Train Loss: 0.6094, Train Steps/Sec: 1.21
849
+ [2025-10-28 17:02:18] (step=0073800) Train Loss: 0.6077, Train Steps/Sec: 1.21
850
+ [2025-10-28 17:02:26] Beginning epoch 59...
851
+ [2025-10-28 17:03:43] (step=0073900) Train Loss: 0.6075, Train Steps/Sec: 1.17
852
+ [2025-10-28 17:05:06] (step=0074000) Train Loss: 0.6078, Train Steps/Sec: 1.21
853
+ [2025-10-28 17:06:28] (step=0074100) Train Loss: 0.6098, Train Steps/Sec: 1.21
854
+ [2025-10-28 17:07:51] (step=0074200) Train Loss: 0.6081, Train Steps/Sec: 1.21
855
+ [2025-10-28 17:09:13] (step=0074300) Train Loss: 0.6091, Train Steps/Sec: 1.21
856
+ [2025-10-28 17:10:35] (step=0074400) Train Loss: 0.6068, Train Steps/Sec: 1.21
857
+ [2025-10-28 17:11:58] (step=0074500) Train Loss: 0.6083, Train Steps/Sec: 1.21
858
+ [2025-10-28 17:13:20] (step=0074600) Train Loss: 0.6077, Train Steps/Sec: 1.21
859
+ [2025-10-28 17:14:43] (step=0074700) Train Loss: 0.6083, Train Steps/Sec: 1.21
860
+ [2025-10-28 17:16:05] (step=0074800) Train Loss: 0.6091, Train Steps/Sec: 1.21
861
+ [2025-10-28 17:17:28] (step=0074900) Train Loss: 0.6068, Train Steps/Sec: 1.21
862
+ [2025-10-28 17:18:50] (step=0075000) Train Loss: 0.6084, Train Steps/Sec: 1.21
863
+ [2025-10-28 17:19:46] Saved checkpoint to results/stage2/hfdata/lightningdit-xl-pe-vit-b-bf16/checkpoints/0075000.pt
864
+ [2025-10-28 17:19:46] Generating EMA samples...
865
+ [2025-10-28 17:20:15] Generating EMA samples done.
866
+ [2025-10-28 17:21:04] Beginning epoch 60...
867
+ [2025-10-28 17:21:39] (step=0075100) Train Loss: 0.6078, Train Steps/Sec: 0.59
868
+ [2025-10-28 17:23:02] (step=0075200) Train Loss: 0.6084, Train Steps/Sec: 1.21
869
+ [2025-10-28 17:24:24] (step=0075300) Train Loss: 0.6084, Train Steps/Sec: 1.21
870
+ [2025-10-28 17:25:47] (step=0075400) Train Loss: 0.6077, Train Steps/Sec: 1.21
871
+ [2025-10-28 17:27:09] (step=0075500) Train Loss: 0.6087, Train Steps/Sec: 1.21
872
+ [2025-10-28 17:28:32] (step=0075600) Train Loss: 0.6071, Train Steps/Sec: 1.21
873
+ [2025-10-28 17:29:55] (step=0075700) Train Loss: 0.6082, Train Steps/Sec: 1.21
874
+ [2025-10-28 17:31:17] (step=0075800) Train Loss: 0.6081, Train Steps/Sec: 1.22
875
+ [2025-10-28 17:32:39] (step=0075900) Train Loss: 0.6084, Train Steps/Sec: 1.22
876
+ [2025-10-28 17:34:02] (step=0076000) Train Loss: 0.6077, Train Steps/Sec: 1.21
877
+ [2025-10-28 17:35:24] (step=0076100) Train Loss: 0.6083, Train Steps/Sec: 1.22
878
+ [2025-10-28 17:36:46] (step=0076200) Train Loss: 0.6075, Train Steps/Sec: 1.22
879
+ [2025-10-28 17:38:09] (step=0076300) Train Loss: 0.6083, Train Steps/Sec: 1.21
880
+ [2025-10-28 17:38:18] Beginning epoch 61...
881
+ [2025-10-28 17:39:33] (step=0076400) Train Loss: 0.6076, Train Steps/Sec: 1.18
882
+ [2025-10-28 17:40:56] (step=0076500) Train Loss: 0.6080, Train Steps/Sec: 1.21
883
+ [2025-10-28 17:42:18] (step=0076600) Train Loss: 0.6083, Train Steps/Sec: 1.21
884
+ [2025-10-28 17:43:41] (step=0076700) Train Loss: 0.6059, Train Steps/Sec: 1.21
885
+ [2025-10-28 17:45:03] (step=0076800) Train Loss: 0.6076, Train Steps/Sec: 1.21
886
+ [2025-10-28 17:46:25] (step=0076900) Train Loss: 0.6068, Train Steps/Sec: 1.21
887
+ [2025-10-28 17:47:48] (step=0077000) Train Loss: 0.6079, Train Steps/Sec: 1.21
888
+ [2025-10-28 17:49:10] (step=0077100) Train Loss: 0.6074, Train Steps/Sec: 1.21
889
+ [2025-10-28 17:50:33] (step=0077200) Train Loss: 0.6066, Train Steps/Sec: 1.21
890
+ [2025-10-28 17:51:56] (step=0077300) Train Loss: 0.6080, Train Steps/Sec: 1.20
891
+ [2025-10-28 17:53:18] (step=0077400) Train Loss: 0.6069, Train Steps/Sec: 1.21
892
+ [2025-10-28 17:54:41] (step=0077500) Train Loss: 0.6072, Train Steps/Sec: 1.22
893
+ [2025-10-28 17:55:32] Beginning epoch 62...
894
+ [2025-10-28 17:56:06] (step=0077600) Train Loss: 0.6077, Train Steps/Sec: 1.18
895
+ [2025-10-28 17:57:28] (step=0077700) Train Loss: 0.6072, Train Steps/Sec: 1.21
896
+ [2025-10-28 17:58:50] (step=0077800) Train Loss: 0.6079, Train Steps/Sec: 1.21
897
+ [2025-10-28 18:00:13] (step=0077900) Train Loss: 0.6074, Train Steps/Sec: 1.21
898
+ [2025-10-28 18:01:35] (step=0078000) Train Loss: 0.6079, Train Steps/Sec: 1.21
899
+ [2025-10-28 18:02:58] (step=0078100) Train Loss: 0.6066, Train Steps/Sec: 1.21
900
+ [2025-10-28 18:04:20] (step=0078200) Train Loss: 0.6086, Train Steps/Sec: 1.21
901
+ [2025-10-28 18:05:42] (step=0078300) Train Loss: 0.6064, Train Steps/Sec: 1.21
902
+ [2025-10-28 18:07:05] (step=0078400) Train Loss: 0.6064, Train Steps/Sec: 1.21
903
+ [2025-10-28 18:08:27] (step=0078500) Train Loss: 0.6069, Train Steps/Sec: 1.21
904
+ [2025-10-28 18:09:50] (step=0078600) Train Loss: 0.6057, Train Steps/Sec: 1.21
905
+ [2025-10-28 18:11:12] (step=0078700) Train Loss: 0.6056, Train Steps/Sec: 1.21
906
+ [2025-10-28 18:12:34] (step=0078800) Train Loss: 0.6067, Train Steps/Sec: 1.21
907
+ [2025-10-28 18:12:46] Beginning epoch 63...
908
+ [2025-10-28 18:14:00] (step=0078900) Train Loss: 0.6065, Train Steps/Sec: 1.17
909
+ [2025-10-28 18:15:22] (step=0079000) Train Loss: 0.6057, Train Steps/Sec: 1.21
910
+ [2025-10-28 18:16:45] (step=0079100) Train Loss: 0.6067, Train Steps/Sec: 1.21
911
+ [2025-10-28 18:18:07] (step=0079200) Train Loss: 0.6064, Train Steps/Sec: 1.21
912
+ [2025-10-28 18:19:30] (step=0079300) Train Loss: 0.6063, Train Steps/Sec: 1.21
913
+ [2025-10-28 18:20:52] (step=0079400) Train Loss: 0.6074, Train Steps/Sec: 1.21
914
+ [2025-10-28 18:22:14] (step=0079500) Train Loss: 0.6067, Train Steps/Sec: 1.21
915
+ [2025-10-28 18:23:36] (step=0079600) Train Loss: 0.6069, Train Steps/Sec: 1.21
916
+ [2025-10-28 18:24:59] (step=0079700) Train Loss: 0.6072, Train Steps/Sec: 1.22
917
+ [2025-10-28 18:26:21] (step=0079800) Train Loss: 0.6064, Train Steps/Sec: 1.22
918
+ [2025-10-28 18:27:43] (step=0079900) Train Loss: 0.6068, Train Steps/Sec: 1.21
919
+ [2025-10-28 18:29:06] (step=0080000) Train Loss: 0.6059, Train Steps/Sec: 1.22
920
+ [2025-10-28 18:29:59] Beginning epoch 64...
921
+ [2025-10-28 18:30:31] (step=0080100) Train Loss: 0.6076, Train Steps/Sec: 1.17
922
+ [2025-10-28 18:31:53] (step=0080200) Train Loss: 0.6070, Train Steps/Sec: 1.21
923
+ [2025-10-28 18:33:16] (step=0080300) Train Loss: 0.6058, Train Steps/Sec: 1.21
924
+ [2025-10-28 18:34:38] (step=0080400) Train Loss: 0.6066, Train Steps/Sec: 1.21
925
+ [2025-10-28 18:36:01] (step=0080500) Train Loss: 0.6077, Train Steps/Sec: 1.21
926
+ [2025-10-28 18:37:24] (step=0080600) Train Loss: 0.6058, Train Steps/Sec: 1.20
927
+ [2025-10-28 18:38:46] (step=0080700) Train Loss: 0.6069, Train Steps/Sec: 1.21
928
+ [2025-10-28 18:40:09] (step=0080800) Train Loss: 0.6061, Train Steps/Sec: 1.21
929
+ [2025-10-28 18:41:31] (step=0080900) Train Loss: 0.6061, Train Steps/Sec: 1.21
930
+ [2025-10-28 18:42:54] (step=0081000) Train Loss: 0.6060, Train Steps/Sec: 1.21
931
+ [2025-10-28 18:44:16] (step=0081100) Train Loss: 0.6064, Train Steps/Sec: 1.21
932
+ [2025-10-28 18:45:38] (step=0081200) Train Loss: 0.6058, Train Steps/Sec: 1.21
933
+ [2025-10-28 18:47:01] (step=0081300) Train Loss: 0.6062, Train Steps/Sec: 1.21
934
+ [2025-10-28 18:47:14] Beginning epoch 65...
935
+ [2025-10-28 18:48:26] (step=0081400) Train Loss: 0.6064, Train Steps/Sec: 1.18
936
+ [2025-10-28 18:49:48] (step=0081500) Train Loss: 0.6055, Train Steps/Sec: 1.21
937
+ [2025-10-28 18:51:11] (step=0081600) Train Loss: 0.6057, Train Steps/Sec: 1.21
938
+ [2025-10-28 18:52:33] (step=0081700) Train Loss: 0.6056, Train Steps/Sec: 1.21
939
+ [2025-10-28 18:53:55] (step=0081800) Train Loss: 0.6063, Train Steps/Sec: 1.21
940
+ [2025-10-28 18:55:18] (step=0081900) Train Loss: 0.6063, Train Steps/Sec: 1.21
941
+ [2025-10-28 18:56:40] (step=0082000) Train Loss: 0.6060, Train Steps/Sec: 1.21
942
+ [2025-10-28 18:58:02] (step=0082100) Train Loss: 0.6064, Train Steps/Sec: 1.21
943
+ [2025-10-28 18:59:25] (step=0082200) Train Loss: 0.6061, Train Steps/Sec: 1.21
944
+ [2025-10-28 19:00:48] (step=0082300) Train Loss: 0.6056, Train Steps/Sec: 1.20
945
+ [2025-10-28 19:02:10] (step=0082400) Train Loss: 0.6068, Train Steps/Sec: 1.21
946
+ [2025-10-28 19:03:33] (step=0082500) Train Loss: 0.6050, Train Steps/Sec: 1.21
947
+ [2025-10-28 19:04:28] Beginning epoch 66...
948
+ [2025-10-28 19:04:58] (step=0082600) Train Loss: 0.6061, Train Steps/Sec: 1.18
949
+ [2025-10-28 19:06:20] (step=0082700) Train Loss: 0.6057, Train Steps/Sec: 1.21
950
+ [2025-10-28 19:07:43] (step=0082800) Train Loss: 0.6071, Train Steps/Sec: 1.21
951
+ [2025-10-28 19:09:05] (step=0082900) Train Loss: 0.6059, Train Steps/Sec: 1.21
952
+ [2025-10-28 19:10:27] (step=0083000) Train Loss: 0.6051, Train Steps/Sec: 1.21
953
+ [2025-10-28 19:11:50] (step=0083100) Train Loss: 0.6057, Train Steps/Sec: 1.21
954
+ [2025-10-28 19:13:12] (step=0083200) Train Loss: 0.6050, Train Steps/Sec: 1.21
955
+ [2025-10-28 19:14:35] (step=0083300) Train Loss: 0.6062, Train Steps/Sec: 1.21
956
+ [2025-10-28 19:15:57] (step=0083400) Train Loss: 0.6073, Train Steps/Sec: 1.21
957
+ [2025-10-28 19:17:20] (step=0083500) Train Loss: 0.6045, Train Steps/Sec: 1.21
958
+ [2025-10-28 19:18:42] (step=0083600) Train Loss: 0.6055, Train Steps/Sec: 1.21
959
+ [2025-10-28 19:20:04] (step=0083700) Train Loss: 0.6058, Train Steps/Sec: 1.21
960
+ [2025-10-28 19:21:27] (step=0083800) Train Loss: 0.6056, Train Steps/Sec: 1.21
961
+ [2025-10-28 19:21:41] Beginning epoch 67...
962
+ [2025-10-28 19:22:52] (step=0083900) Train Loss: 0.6058, Train Steps/Sec: 1.17
963
+ [2025-10-28 19:24:15] (step=0084000) Train Loss: 0.6065, Train Steps/Sec: 1.21
964
+ [2025-10-28 19:25:37] (step=0084100) Train Loss: 0.6035, Train Steps/Sec: 1.21
965
+ [2025-10-28 19:27:00] (step=0084200) Train Loss: 0.6055, Train Steps/Sec: 1.21
966
+ [2025-10-28 19:28:22] (step=0084300) Train Loss: 0.6057, Train Steps/Sec: 1.21
967
+ [2025-10-28 19:29:45] (step=0084400) Train Loss: 0.6046, Train Steps/Sec: 1.21
968
+ [2025-10-28 19:31:07] (step=0084500) Train Loss: 0.6051, Train Steps/Sec: 1.21
969
+ [2025-10-28 19:32:29] (step=0084600) Train Loss: 0.6055, Train Steps/Sec: 1.21
970
+ [2025-10-28 19:33:52] (step=0084700) Train Loss: 0.6046, Train Steps/Sec: 1.21
971
+ [2025-10-28 19:35:15] (step=0084800) Train Loss: 0.6044, Train Steps/Sec: 1.21
972
+ [2025-10-28 19:36:37] (step=0084900) Train Loss: 0.6061, Train Steps/Sec: 1.21
973
+ [2025-10-28 19:37:59] (step=0085000) Train Loss: 0.6055, Train Steps/Sec: 1.21
974
+ [2025-10-28 19:38:56] Beginning epoch 68...
975
+ [2025-10-28 19:39:24] (step=0085100) Train Loss: 0.6062, Train Steps/Sec: 1.18
976
+ [2025-10-28 19:40:47] (step=0085200) Train Loss: 0.6058, Train Steps/Sec: 1.21
977
+ [2025-10-28 19:42:09] (step=0085300) Train Loss: 0.6043, Train Steps/Sec: 1.21
978
+ [2025-10-28 19:43:32] (step=0085400) Train Loss: 0.6036, Train Steps/Sec: 1.21
979
+ [2025-10-28 19:44:54] (step=0085500) Train Loss: 0.6059, Train Steps/Sec: 1.21
980
+ [2025-10-28 19:46:17] (step=0085600) Train Loss: 0.6038, Train Steps/Sec: 1.20
981
+ [2025-10-28 19:47:40] (step=0085700) Train Loss: 0.6058, Train Steps/Sec: 1.21
982
+ [2025-10-28 19:49:02] (step=0085800) Train Loss: 0.6043, Train Steps/Sec: 1.21
983
+ [2025-10-28 19:50:24] (step=0085900) Train Loss: 0.6040, Train Steps/Sec: 1.21
984
+ [2025-10-28 19:51:47] (step=0086000) Train Loss: 0.6060, Train Steps/Sec: 1.21
985
+ [2025-10-28 19:53:09] (step=0086100) Train Loss: 0.6057, Train Steps/Sec: 1.21
986
+ [2025-10-28 19:54:31] (step=0086200) Train Loss: 0.6067, Train Steps/Sec: 1.21
987
+ [2025-10-28 19:55:54] (step=0086300) Train Loss: 0.6045, Train Steps/Sec: 1.21
988
+ [2025-10-28 19:56:10] Beginning epoch 69...
989
+ [2025-10-28 19:57:19] (step=0086400) Train Loss: 0.6041, Train Steps/Sec: 1.18
990
+ [2025-10-28 19:58:41] (step=0086500) Train Loss: 0.6041, Train Steps/Sec: 1.21
991
+ [2025-10-28 20:00:03] (step=0086600) Train Loss: 0.6042, Train Steps/Sec: 1.21
992
+ [2025-10-28 20:01:26] (step=0086700) Train Loss: 0.6051, Train Steps/Sec: 1.21
993
+ [2025-10-28 20:02:48] (step=0086800) Train Loss: 0.6049, Train Steps/Sec: 1.21
994
+ [2025-10-28 20:04:11] (step=0086900) Train Loss: 0.6049, Train Steps/Sec: 1.21
995
+ [2025-10-28 20:05:33] (step=0087000) Train Loss: 0.6051, Train Steps/Sec: 1.21
996
+ [2025-10-28 20:06:55] (step=0087100) Train Loss: 0.6048, Train Steps/Sec: 1.21
997
+ [2025-10-28 20:08:18] (step=0087200) Train Loss: 0.6048, Train Steps/Sec: 1.21
998
+ [2025-10-28 20:09:41] (step=0087300) Train Loss: 0.6050, Train Steps/Sec: 1.21
999
+ [2025-10-28 20:11:03] (step=0087400) Train Loss: 0.6041, Train Steps/Sec: 1.22
1000
+ [2025-10-28 20:12:26] (step=0087500) Train Loss: 0.6052, Train Steps/Sec: 1.22
1001
+ [2025-10-28 20:13:24] Beginning epoch 70...
1002
+ [2025-10-28 20:13:51] (step=0087600) Train Loss: 0.6043, Train Steps/Sec: 1.18
1003
+ [2025-10-28 20:15:13] (step=0087700) Train Loss: 0.6040, Train Steps/Sec: 1.21
1004
+ [2025-10-28 20:16:35] (step=0087800) Train Loss: 0.6047, Train Steps/Sec: 1.21
1005
+ [2025-10-28 20:17:58] (step=0087900) Train Loss: 0.6050, Train Steps/Sec: 1.21
1006
+ [2025-10-28 20:19:20] (step=0088000) Train Loss: 0.6056, Train Steps/Sec: 1.21
1007
+ [2025-10-28 20:20:43] (step=0088100) Train Loss: 0.6041, Train Steps/Sec: 1.21
1008
+ [2025-10-28 20:22:05] (step=0088200) Train Loss: 0.6028, Train Steps/Sec: 1.21
1009
+ [2025-10-28 20:23:27] (step=0088300) Train Loss: 0.6055, Train Steps/Sec: 1.21
1010
+ [2025-10-28 20:24:50] (step=0088400) Train Loss: 0.6056, Train Steps/Sec: 1.21
1011
+ [2025-10-28 20:26:12] (step=0088500) Train Loss: 0.6052, Train Steps/Sec: 1.21
1012
+ [2025-10-28 20:27:35] (step=0088600) Train Loss: 0.6060, Train Steps/Sec: 1.21
1013
+ [2025-10-28 20:28:57] (step=0088700) Train Loss: 0.6047, Train Steps/Sec: 1.21
1014
+ [2025-10-28 20:30:20] (step=0088800) Train Loss: 0.6041, Train Steps/Sec: 1.21
1015
+ [2025-10-28 20:30:37] Beginning epoch 71...
1016
+ [2025-10-28 20:31:45] (step=0088900) Train Loss: 0.6052, Train Steps/Sec: 1.17
1017
+ [2025-10-28 20:33:08] (step=0089000) Train Loss: 0.6051, Train Steps/Sec: 1.21
1018
+ [2025-10-28 20:34:30] (step=0089100) Train Loss: 0.6050, Train Steps/Sec: 1.21
1019
+ [2025-10-28 20:35:52] (step=0089200) Train Loss: 0.6039, Train Steps/Sec: 1.21
1020
+ [2025-10-28 20:37:15] (step=0089300) Train Loss: 0.6048, Train Steps/Sec: 1.21
1021
+ [2025-10-28 20:38:37] (step=0089400) Train Loss: 0.6039, Train Steps/Sec: 1.21
1022
+ [2025-10-28 20:39:59] (step=0089500) Train Loss: 0.6039, Train Steps/Sec: 1.21
1023
+ [2025-10-28 20:41:22] (step=0089600) Train Loss: 0.6028, Train Steps/Sec: 1.21
1024
+ [2025-10-28 20:42:44] (step=0089700) Train Loss: 0.6043, Train Steps/Sec: 1.21
1025
+ [2025-10-28 20:44:07] (step=0089800) Train Loss: 0.6061, Train Steps/Sec: 1.21
1026
+ [2025-10-28 20:45:29] (step=0089900) Train Loss: 0.6019, Train Steps/Sec: 1.21
1027
+ [2025-10-28 20:46:51] (step=0090000) Train Loss: 0.6035, Train Steps/Sec: 1.21
1028
+ [2025-10-28 20:47:51] Beginning epoch 72...
1029
+ [2025-10-28 20:48:16] (step=0090100) Train Loss: 0.6032, Train Steps/Sec: 1.18
1030
+ [2025-10-28 20:49:39] (step=0090200) Train Loss: 0.6047, Train Steps/Sec: 1.21
1031
+ [2025-10-28 20:51:01] (step=0090300) Train Loss: 0.6027, Train Steps/Sec: 1.21
1032
+ [2025-10-28 20:52:24] (step=0090400) Train Loss: 0.6056, Train Steps/Sec: 1.21
1033
+ [2025-10-28 20:53:46] (step=0090500) Train Loss: 0.6045, Train Steps/Sec: 1.21
1034
+ [2025-10-28 20:55:09] (step=0090600) Train Loss: 0.6045, Train Steps/Sec: 1.20
1035
+ [2025-10-28 20:56:32] (step=0090700) Train Loss: 0.6047, Train Steps/Sec: 1.21
1036
+ [2025-10-28 20:57:54] (step=0090800) Train Loss: 0.6037, Train Steps/Sec: 1.21
1037
+ [2025-10-28 20:59:16] (step=0090900) Train Loss: 0.6054, Train Steps/Sec: 1.21
1038
+ [2025-10-28 21:00:38] (step=0091000) Train Loss: 0.6043, Train Steps/Sec: 1.22
1039
+ [2025-10-28 21:02:01] (step=0091100) Train Loss: 0.6042, Train Steps/Sec: 1.22
1040
+ [2025-10-28 21:03:23] (step=0091200) Train Loss: 0.6042, Train Steps/Sec: 1.22
1041
+ [2025-10-28 21:04:45] (step=0091300) Train Loss: 0.6051, Train Steps/Sec: 1.22
1042
+ [2025-10-28 21:05:05] Beginning epoch 73...
1043
+ [2025-10-28 21:06:10] (step=0091400) Train Loss: 0.6041, Train Steps/Sec: 1.18
1044
+ [2025-10-28 21:07:33] (step=0091500) Train Loss: 0.6042, Train Steps/Sec: 1.21
1045
+ [2025-10-28 21:08:55] (step=0091600) Train Loss: 0.6034, Train Steps/Sec: 1.21
1046
+ [2025-10-28 21:10:18] (step=0091700) Train Loss: 0.6041, Train Steps/Sec: 1.21
1047
+ [2025-10-28 21:11:40] (step=0091800) Train Loss: 0.6046, Train Steps/Sec: 1.21
1048
+ [2025-10-28 21:13:02] (step=0091900) Train Loss: 0.6040, Train Steps/Sec: 1.21
1049
+ [2025-10-28 21:14:25] (step=0092000) Train Loss: 0.6046, Train Steps/Sec: 1.21
1050
+ [2025-10-28 21:15:47] (step=0092100) Train Loss: 0.6038, Train Steps/Sec: 1.21
1051
+ [2025-10-28 21:17:10] (step=0092200) Train Loss: 0.6029, Train Steps/Sec: 1.21
1052
+ [2025-10-28 21:18:33] (step=0092300) Train Loss: 0.6034, Train Steps/Sec: 1.21
1053
+ [2025-10-28 21:19:55] (step=0092400) Train Loss: 0.6039, Train Steps/Sec: 1.21
1054
+ [2025-10-28 21:21:17] (step=0092500) Train Loss: 0.6033, Train Steps/Sec: 1.21
1055
+ [2025-10-28 21:22:19] Beginning epoch 74...
1056
+ [2025-10-28 21:22:42] (step=0092600) Train Loss: 0.6031, Train Steps/Sec: 1.18
1057
+ [2025-10-28 21:24:05] (step=0092700) Train Loss: 0.6028, Train Steps/Sec: 1.21
1058
+ [2025-10-28 21:25:27] (step=0092800) Train Loss: 0.6040, Train Steps/Sec: 1.21
1059
+ [2025-10-28 21:26:50] (step=0092900) Train Loss: 0.6033, Train Steps/Sec: 1.21
1060
+ [2025-10-28 21:28:12] (step=0093000) Train Loss: 0.6022, Train Steps/Sec: 1.21
1061
+ [2025-10-28 21:29:35] (step=0093100) Train Loss: 0.6023, Train Steps/Sec: 1.21
1062
+ [2025-10-28 21:30:57] (step=0093200) Train Loss: 0.6036, Train Steps/Sec: 1.21
1063
+ [2025-10-28 21:32:19] (step=0093300) Train Loss: 0.6028, Train Steps/Sec: 1.21
1064
+ [2025-10-28 21:33:42] (step=0093400) Train Loss: 0.6027, Train Steps/Sec: 1.21
1065
+ [2025-10-28 21:35:04] (step=0093500) Train Loss: 0.6032, Train Steps/Sec: 1.21
1066
+ [2025-10-28 21:36:27] (step=0093600) Train Loss: 0.6033, Train Steps/Sec: 1.21
1067
+ [2025-10-28 21:37:49] (step=0093700) Train Loss: 0.6050, Train Steps/Sec: 1.21
1068
+ [2025-10-28 21:39:11] (step=0093800) Train Loss: 0.6031, Train Steps/Sec: 1.21
1069
+ [2025-10-28 21:39:33] Beginning epoch 75...
1070
+ [2025-10-28 21:40:37] (step=0093900) Train Loss: 0.6038, Train Steps/Sec: 1.17
1071
+ [2025-10-28 21:42:00] (step=0094000) Train Loss: 0.6031, Train Steps/Sec: 1.21
1072
+ [2025-10-28 21:43:22] (step=0094100) Train Loss: 0.6031, Train Steps/Sec: 1.21
1073
+ [2025-10-28 21:44:44] (step=0094200) Train Loss: 0.6030, Train Steps/Sec: 1.21
1074
+ [2025-10-28 21:46:07] (step=0094300) Train Loss: 0.6033, Train Steps/Sec: 1.21
1075
+ [2025-10-28 21:47:29] (step=0094400) Train Loss: 0.6032, Train Steps/Sec: 1.21
1076
+ [2025-10-28 21:48:52] (step=0094500) Train Loss: 0.6027, Train Steps/Sec: 1.21
1077
+ [2025-10-28 21:50:14] (step=0094600) Train Loss: 0.6039, Train Steps/Sec: 1.21
1078
+ [2025-10-28 21:51:37] (step=0094700) Train Loss: 0.6037, Train Steps/Sec: 1.21
1079
+ [2025-10-28 21:52:59] (step=0094800) Train Loss: 0.6034, Train Steps/Sec: 1.21
1080
+ [2025-10-28 21:54:21] (step=0094900) Train Loss: 0.6046, Train Steps/Sec: 1.21
1081
+ [2025-10-28 21:55:44] (step=0095000) Train Loss: 0.6017, Train Steps/Sec: 1.21
1082
+ [2025-10-28 21:56:47] Beginning epoch 76...
1083
+ [2025-10-28 21:57:09] (step=0095100) Train Loss: 0.6027, Train Steps/Sec: 1.18
1084
+ [2025-10-28 21:58:31] (step=0095200) Train Loss: 0.6028, Train Steps/Sec: 1.21
1085
+ [2025-10-28 21:59:53] (step=0095300) Train Loss: 0.6024, Train Steps/Sec: 1.21
1086
+ [2025-10-28 22:01:16] (step=0095400) Train Loss: 0.6033, Train Steps/Sec: 1.21
1087
+ [2025-10-28 22:02:38] (step=0095500) Train Loss: 0.6030, Train Steps/Sec: 1.22
1088
+ [2025-10-28 22:04:01] (step=0095600) Train Loss: 0.6013, Train Steps/Sec: 1.20
1089
+ [2025-10-28 22:05:24] (step=0095700) Train Loss: 0.6041, Train Steps/Sec: 1.21
1090
+ [2025-10-28 22:06:46] (step=0095800) Train Loss: 0.6020, Train Steps/Sec: 1.21
1091
+ [2025-10-28 22:08:09] (step=0095900) Train Loss: 0.6038, Train Steps/Sec: 1.21
1092
+ [2025-10-28 22:09:31] (step=0096000) Train Loss: 0.6043, Train Steps/Sec: 1.21
1093
+ [2025-10-28 22:10:54] (step=0096100) Train Loss: 0.6040, Train Steps/Sec: 1.21
1094
+ [2025-10-28 22:12:16] (step=0096200) Train Loss: 0.6036, Train Steps/Sec: 1.21
1095
+ [2025-10-28 22:13:38] (step=0096300) Train Loss: 0.6026, Train Steps/Sec: 1.21
1096
+ [2025-10-28 22:14:01] Beginning epoch 77...
1097
+ [2025-10-28 22:15:03] (step=0096400) Train Loss: 0.6027, Train Steps/Sec: 1.18
1098
+ [2025-10-28 22:16:26] (step=0096500) Train Loss: 0.6010, Train Steps/Sec: 1.21
1099
+ [2025-10-28 22:17:48] (step=0096600) Train Loss: 0.6014, Train Steps/Sec: 1.21
1100
+ [2025-10-28 22:19:11] (step=0096700) Train Loss: 0.6017, Train Steps/Sec: 1.21
1101
+ [2025-10-28 22:20:33] (step=0096800) Train Loss: 0.6028, Train Steps/Sec: 1.21
1102
+ [2025-10-28 22:21:55] (step=0096900) Train Loss: 0.6029, Train Steps/Sec: 1.21
1103
+ [2025-10-28 22:23:18] (step=0097000) Train Loss: 0.6023, Train Steps/Sec: 1.21
1104
+ [2025-10-28 22:24:40] (step=0097100) Train Loss: 0.6019, Train Steps/Sec: 1.21
1105
+ [2025-10-28 22:26:03] (step=0097200) Train Loss: 0.6038, Train Steps/Sec: 1.20
1106
+ [2025-10-28 22:27:26] (step=0097300) Train Loss: 0.6031, Train Steps/Sec: 1.21
1107
+ [2025-10-28 22:28:49] (step=0097400) Train Loss: 0.6031, Train Steps/Sec: 1.21
1108
+ [2025-10-28 22:30:11] (step=0097500) Train Loss: 0.6017, Train Steps/Sec: 1.21
1109
+ [2025-10-28 22:31:16] Beginning epoch 78...
1110
+ [2025-10-28 22:31:36] (step=0097600) Train Loss: 0.6025, Train Steps/Sec: 1.18
1111
+ [2025-10-28 22:32:58] (step=0097700) Train Loss: 0.6026, Train Steps/Sec: 1.21
1112
+ [2025-10-28 22:34:21] (step=0097800) Train Loss: 0.6030, Train Steps/Sec: 1.21
1113
+ [2025-10-28 22:35:43] (step=0097900) Train Loss: 0.6020, Train Steps/Sec: 1.21
1114
+ [2025-10-28 22:37:06] (step=0098000) Train Loss: 0.6021, Train Steps/Sec: 1.21
1115
+ [2025-10-28 22:38:28] (step=0098100) Train Loss: 0.6034, Train Steps/Sec: 1.21
1116
+ [2025-10-28 22:39:50] (step=0098200) Train Loss: 0.6023, Train Steps/Sec: 1.21