KublaiKhan1 commited on
Commit
244afc6
Β·
verified Β·
1 Parent(s): b68014e

Upload folder using huggingface_hub

Browse files
2e-5_kl_naive_globalscale_channelmean_sampling/log.txt CHANGED
@@ -67,7 +67,7 @@ Disc shape (1, 16, 16, 512)
67
  Disc shape (1, 8, 8, 512)
68
  Disc shape (1, 4, 4, 512)
69
  Total num of Discriminator parameters: 23998017
70
- Loaded checkpoint from 13763719 seconds ago.
71
  Loaded model with step 511001
72
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
73
  β”‚ TPU 0 β”‚
@@ -768,7 +768,7 @@ DiT: Input of shape (4, 32, 32, 4) dtype float32
768
  DiT: After patch embed, shape is (4, 256, 768) dtype bfloat16
769
  DiT: Patch Embed of shape (4, 256, 768) dtype bfloat16
770
  DiT: Conditioning of shape (1, 768) dtype float32
771
- Loaded checkpoint from 21940 seconds ago.
772
 
773
  parameter shapes:
774
  ('PatchEmbed_0', 'Conv_0', 'kernel'): (2, 2, 4, 768)
@@ -1938,501 +1938,60 @@ Decoder layer (128, 64, 64, 512)
1938
  Decoder layer (128, 128, 128, 512)
1939
  Decoder layer (128, 256, 256, 256)
1940
  Decoder layer (128, 256, 256, 128)
1941
- FID is 36.831787109375
1942
- (512, 256, 256, 3)
1943
- Calc FID for CFG 1.0 and denoise_timesteps 64
1944
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1945
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1946
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1947
- DiT: Conditioning of shape (512, 768) dtype float32
1948
- FID is 37.423683166503906
1949
- (512, 256, 256, 3)
1950
- Calc FID for CFG 1.0 and denoise_timesteps 32
1951
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1952
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1953
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1954
- DiT: Conditioning of shape (512, 768) dtype float32
1955
- FID is 39.304908752441406
1956
- (512, 256, 256, 3)
1957
- Calc FID for CFG 1.0 and denoise_timesteps 16
1958
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1959
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1960
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1961
- DiT: Conditioning of shape (512, 768) dtype float32
1962
- FID is 45.72352600097656
1963
- (512, 256, 256, 3)
1964
- Calc FID for CFG 1.0 and denoise_timesteps 8
1965
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1966
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1967
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1968
- DiT: Conditioning of shape (512, 768) dtype float32
1969
- FID is 67.60384368896484
1970
- (512, 256, 256, 3)
1971
- Calc FID for CFG 1.0 and denoise_timesteps 4
1972
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1973
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1974
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1975
- DiT: Conditioning of shape (512, 768) dtype float32
1976
- FID is 152.43223571777344
1977
- (512, 256, 256, 3)
1978
- Calc FID for CFG 1.0 and denoise_timesteps 2
1979
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1980
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1981
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1982
- DiT: Conditioning of shape (512, 768) dtype float32
1983
- FID is 325.9117431640625
1984
- (512, 256, 256, 3)
1985
- Calc FID for CFG 1.0 and denoise_timesteps 1
1986
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1987
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1988
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1989
- DiT: Conditioning of shape (512, 768) dtype float32
1990
- FID is 265.09783935546875
1991
  (512, 256, 256, 3)
1992
  Calc FID for CFG 1.25 and denoise_timesteps 128
1993
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1994
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1995
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1996
  DiT: Conditioning of shape (512, 768) dtype float32
1997
- FID is 22.617610931396484
1998
- (512, 256, 256, 3)
1999
- Calc FID for CFG 1.25 and denoise_timesteps 64
2000
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2001
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2002
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2003
- DiT: Conditioning of shape (512, 768) dtype float32
2004
- FID is 23.02716827392578
2005
- (512, 256, 256, 3)
2006
- Calc FID for CFG 1.25 and denoise_timesteps 32
2007
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2008
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2009
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2010
- DiT: Conditioning of shape (512, 768) dtype float32
2011
- FID is 24.476266860961914
2012
- (512, 256, 256, 3)
2013
- Calc FID for CFG 1.25 and denoise_timesteps 16
2014
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2015
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2016
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2017
- DiT: Conditioning of shape (512, 768) dtype float32
2018
- FID is 29.616016387939453
2019
- (512, 256, 256, 3)
2020
- Calc FID for CFG 1.25 and denoise_timesteps 8
2021
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2022
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2023
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2024
- DiT: Conditioning of shape (512, 768) dtype float32
2025
- FID is 49.262271881103516
2026
- (512, 256, 256, 3)
2027
- Calc FID for CFG 1.25 and denoise_timesteps 4
2028
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2029
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2030
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2031
- DiT: Conditioning of shape (512, 768) dtype float32
2032
- FID is 128.205322265625
2033
- (512, 256, 256, 3)
2034
- Calc FID for CFG 1.25 and denoise_timesteps 2
2035
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2036
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2037
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2038
- DiT: Conditioning of shape (512, 768) dtype float32
2039
- FID is 313.4750671386719
2040
- (512, 256, 256, 3)
2041
- Calc FID for CFG 1.25 and denoise_timesteps 1
2042
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2043
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2044
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2045
- DiT: Conditioning of shape (512, 768) dtype float32
2046
- FID is 256.4454345703125
2047
  (512, 256, 256, 3)
2048
  Calc FID for CFG 1.5 and denoise_timesteps 128
2049
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2050
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2051
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2052
  DiT: Conditioning of shape (512, 768) dtype float32
2053
- FID is 14.177565574645996
2054
- (512, 256, 256, 3)
2055
- Calc FID for CFG 1.5 and denoise_timesteps 64
2056
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2057
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2058
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2059
- DiT: Conditioning of shape (512, 768) dtype float32
2060
- FID is 14.451891899108887
2061
- (512, 256, 256, 3)
2062
- Calc FID for CFG 1.5 and denoise_timesteps 32
2063
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2064
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2065
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2066
- DiT: Conditioning of shape (512, 768) dtype float32
2067
- FID is 15.438105583190918
2068
- (512, 256, 256, 3)
2069
- Calc FID for CFG 1.5 and denoise_timesteps 16
2070
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2071
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2072
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2073
- DiT: Conditioning of shape (512, 768) dtype float32
2074
- FID is 19.319408416748047
2075
- (512, 256, 256, 3)
2076
- Calc FID for CFG 1.5 and denoise_timesteps 8
2077
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2078
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2079
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2080
- DiT: Conditioning of shape (512, 768) dtype float32
2081
- FID is 35.212379455566406
2082
- (512, 256, 256, 3)
2083
- Calc FID for CFG 1.5 and denoise_timesteps 4
2084
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2085
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2086
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2087
- DiT: Conditioning of shape (512, 768) dtype float32
2088
- FID is 106.40361785888672
2089
- (512, 256, 256, 3)
2090
- Calc FID for CFG 1.5 and denoise_timesteps 2
2091
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2092
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2093
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2094
- DiT: Conditioning of shape (512, 768) dtype float32
2095
- FID is 302.155517578125
2096
- (512, 256, 256, 3)
2097
- Calc FID for CFG 1.5 and denoise_timesteps 1
2098
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2099
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2100
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2101
- DiT: Conditioning of shape (512, 768) dtype float32
2102
- FID is 249.88137817382812
2103
  (512, 256, 256, 3)
2104
  Calc FID for CFG 1.75 and denoise_timesteps 128
2105
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2106
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2107
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2108
  DiT: Conditioning of shape (512, 768) dtype float32
2109
- FID is 9.794225692749023
2110
- (512, 256, 256, 3)
2111
- Calc FID for CFG 1.75 and denoise_timesteps 64
2112
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2113
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2114
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2115
- DiT: Conditioning of shape (512, 768) dtype float32
2116
- FID is 9.958918571472168
2117
- (512, 256, 256, 3)
2118
- Calc FID for CFG 1.75 and denoise_timesteps 32
2119
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2120
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2121
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2122
- DiT: Conditioning of shape (512, 768) dtype float32
2123
- FID is 10.63227367401123
2124
- (512, 256, 256, 3)
2125
- Calc FID for CFG 1.75 and denoise_timesteps 16
2126
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2127
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2128
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2129
- DiT: Conditioning of shape (512, 768) dtype float32
2130
- FID is 13.343984603881836
2131
- (512, 256, 256, 3)
2132
- Calc FID for CFG 1.75 and denoise_timesteps 8
2133
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2134
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2135
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2136
- DiT: Conditioning of shape (512, 768) dtype float32
2137
- FID is 25.57964324951172
2138
- (512, 256, 256, 3)
2139
- Calc FID for CFG 1.75 and denoise_timesteps 4
2140
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2141
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2142
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2143
- DiT: Conditioning of shape (512, 768) dtype float32
2144
- FID is 88.0321044921875
2145
- (512, 256, 256, 3)
2146
- Calc FID for CFG 1.75 and denoise_timesteps 2
2147
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2148
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2149
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2150
- DiT: Conditioning of shape (512, 768) dtype float32
2151
- FID is 291.9643249511719
2152
- (512, 256, 256, 3)
2153
- Calc FID for CFG 1.75 and denoise_timesteps 1
2154
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2155
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2156
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2157
- DiT: Conditioning of shape (512, 768) dtype float32
2158
- FID is 245.0141143798828
2159
  (512, 256, 256, 3)
2160
  Calc FID for CFG 2.0 and denoise_timesteps 128
2161
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2162
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2163
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2164
  DiT: Conditioning of shape (512, 768) dtype float32
2165
- FID is 7.921169281005859
2166
- (512, 256, 256, 3)
2167
- Calc FID for CFG 2.0 and denoise_timesteps 64
2168
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2169
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2170
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2171
- DiT: Conditioning of shape (512, 768) dtype float32
2172
- FID is 8.00857162475586
2173
- (512, 256, 256, 3)
2174
- Calc FID for CFG 2.0 and denoise_timesteps 32
2175
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2176
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2177
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2178
- DiT: Conditioning of shape (512, 768) dtype float32
2179
- FID is 8.395737648010254
2180
- (512, 256, 256, 3)
2181
- Calc FID for CFG 2.0 and denoise_timesteps 16
2182
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2183
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2184
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2185
- DiT: Conditioning of shape (512, 768) dtype float32
2186
- FID is 10.18696403503418
2187
- (512, 256, 256, 3)
2188
- Calc FID for CFG 2.0 and denoise_timesteps 8
2189
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2190
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2191
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2192
- DiT: Conditioning of shape (512, 768) dtype float32
2193
- FID is 19.315452575683594
2194
- (512, 256, 256, 3)
2195
- Calc FID for CFG 2.0 and denoise_timesteps 4
2196
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2197
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2198
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2199
- DiT: Conditioning of shape (512, 768) dtype float32
2200
- FID is 73.04430389404297
2201
- (512, 256, 256, 3)
2202
- Calc FID for CFG 2.0 and denoise_timesteps 2
2203
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2204
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2205
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2206
- DiT: Conditioning of shape (512, 768) dtype float32
2207
- FID is 282.81884765625
2208
- (512, 256, 256, 3)
2209
- Calc FID for CFG 2.0 and denoise_timesteps 1
2210
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2211
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2212
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2213
- DiT: Conditioning of shape (512, 768) dtype float32
2214
- FID is 241.4156036376953
2215
  (512, 256, 256, 3)
2216
  Calc FID for CFG 2.25 and denoise_timesteps 128
2217
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2218
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2219
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2220
  DiT: Conditioning of shape (512, 768) dtype float32
2221
- FID is 7.42802619934082
2222
- (512, 256, 256, 3)
2223
- Calc FID for CFG 2.25 and denoise_timesteps 64
2224
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2225
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2226
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2227
- DiT: Conditioning of shape (512, 768) dtype float32
2228
- FID is 7.460850715637207
2229
- (512, 256, 256, 3)
2230
- Calc FID for CFG 2.25 and denoise_timesteps 32
2231
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2232
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2233
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2234
- DiT: Conditioning of shape (512, 768) dtype float32
2235
- FID is 7.677089691162109
2236
- (512, 256, 256, 3)
2237
- Calc FID for CFG 2.25 and denoise_timesteps 16
2238
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2239
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2240
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2241
- DiT: Conditioning of shape (512, 768) dtype float32
2242
- FID is 8.823616027832031
2243
- (512, 256, 256, 3)
2244
- Calc FID for CFG 2.25 and denoise_timesteps 8
2245
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2246
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2247
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2248
- DiT: Conditioning of shape (512, 768) dtype float32
2249
- FID is 15.475367546081543
2250
- (512, 256, 256, 3)
2251
- Calc FID for CFG 2.25 and denoise_timesteps 4
2252
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2253
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2254
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2255
- DiT: Conditioning of shape (512, 768) dtype float32
2256
- FID is 61.213165283203125
2257
- (512, 256, 256, 3)
2258
- Calc FID for CFG 2.25 and denoise_timesteps 2
2259
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2260
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2261
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2262
- DiT: Conditioning of shape (512, 768) dtype float32
2263
- FID is 274.59478759765625
2264
- (512, 256, 256, 3)
2265
- Calc FID for CFG 2.25 and denoise_timesteps 1
2266
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2267
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2268
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2269
- DiT: Conditioning of shape (512, 768) dtype float32
2270
- FID is 238.5747833251953
2271
  (512, 256, 256, 3)
2272
  Calc FID for CFG 2.5 and denoise_timesteps 128
2273
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2274
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2275
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2276
  DiT: Conditioning of shape (512, 768) dtype float32
2277
- FID is 7.72978401184082
2278
- (512, 256, 256, 3)
2279
- Calc FID for CFG 2.5 and denoise_timesteps 64
2280
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2281
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2282
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2283
- DiT: Conditioning of shape (512, 768) dtype float32
2284
- FID is 7.7087931632995605
2285
- (512, 256, 256, 3)
2286
- Calc FID for CFG 2.5 and denoise_timesteps 32
2287
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2288
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2289
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2290
- DiT: Conditioning of shape (512, 768) dtype float32
2291
- FID is 7.7968902587890625
2292
- (512, 256, 256, 3)
2293
- Calc FID for CFG 2.5 and denoise_timesteps 16
2294
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2295
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2296
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2297
- DiT: Conditioning of shape (512, 768) dtype float32
2298
- FID is 8.493557929992676
2299
- (512, 256, 256, 3)
2300
- Calc FID for CFG 2.5 and denoise_timesteps 8
2301
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2302
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2303
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2304
- DiT: Conditioning of shape (512, 768) dtype float32
2305
- FID is 13.25935173034668
2306
- (512, 256, 256, 3)
2307
- Calc FID for CFG 2.5 and denoise_timesteps 4
2308
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2309
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2310
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2311
- DiT: Conditioning of shape (512, 768) dtype float32
2312
- FID is 51.91324996948242
2313
- (512, 256, 256, 3)
2314
- Calc FID for CFG 2.5 and denoise_timesteps 2
2315
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2316
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2317
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2318
- DiT: Conditioning of shape (512, 768) dtype float32
2319
- FID is 267.32489013671875
2320
- (512, 256, 256, 3)
2321
- Calc FID for CFG 2.5 and denoise_timesteps 1
2322
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2323
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2324
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2325
- DiT: Conditioning of shape (512, 768) dtype float32
2326
- FID is 236.34414672851562
2327
  (512, 256, 256, 3)
2328
  Calc FID for CFG 2.75 and denoise_timesteps 128
2329
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2330
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2331
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2332
  DiT: Conditioning of shape (512, 768) dtype float32
2333
- FID is 8.462559700012207
2334
- (512, 256, 256, 3)
2335
- Calc FID for CFG 2.75 and denoise_timesteps 64
2336
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2337
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2338
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2339
- DiT: Conditioning of shape (512, 768) dtype float32
2340
- FID is 8.41374683380127
2341
- (512, 256, 256, 3)
2342
- Calc FID for CFG 2.75 and denoise_timesteps 32
2343
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2344
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2345
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2346
- DiT: Conditioning of shape (512, 768) dtype float32
2347
- FID is 8.378111839294434
2348
- (512, 256, 256, 3)
2349
- Calc FID for CFG 2.75 and denoise_timesteps 16
2350
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2351
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2352
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2353
- DiT: Conditioning of shape (512, 768) dtype float32
2354
- FID is 8.747322082519531
2355
- (512, 256, 256, 3)
2356
- Calc FID for CFG 2.75 and denoise_timesteps 8
2357
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2358
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2359
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2360
- DiT: Conditioning of shape (512, 768) dtype float32
2361
- FID is 12.060856819152832
2362
- (512, 256, 256, 3)
2363
- Calc FID for CFG 2.75 and denoise_timesteps 4
2364
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2365
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2366
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2367
- DiT: Conditioning of shape (512, 768) dtype float32
2368
- FID is 44.55529022216797
2369
- (512, 256, 256, 3)
2370
- Calc FID for CFG 2.75 and denoise_timesteps 2
2371
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2372
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2373
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2374
- DiT: Conditioning of shape (512, 768) dtype float32
2375
- FID is 260.9886474609375
2376
- (512, 256, 256, 3)
2377
- Calc FID for CFG 2.75 and denoise_timesteps 1
2378
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2379
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2380
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2381
- DiT: Conditioning of shape (512, 768) dtype float32
2382
- FID is 234.50088500976562
2383
  (512, 256, 256, 3)
2384
  Calc FID for CFG 3.0 and denoise_timesteps 128
2385
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2386
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2387
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2388
  DiT: Conditioning of shape (512, 768) dtype float32
2389
- FID is 9.388092041015625
2390
- (512, 256, 256, 3)
2391
- Calc FID for CFG 3.0 and denoise_timesteps 64
2392
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2393
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2394
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2395
- DiT: Conditioning of shape (512, 768) dtype float32
2396
- FID is 9.323236465454102
2397
- (512, 256, 256, 3)
2398
- Calc FID for CFG 3.0 and denoise_timesteps 32
2399
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2400
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2401
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2402
- DiT: Conditioning of shape (512, 768) dtype float32
2403
- FID is 9.24086856842041
2404
- (512, 256, 256, 3)
2405
- Calc FID for CFG 3.0 and denoise_timesteps 16
2406
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2407
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2408
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2409
- DiT: Conditioning of shape (512, 768) dtype float32
2410
- FID is 9.349201202392578
2411
- (512, 256, 256, 3)
2412
- Calc FID for CFG 3.0 and denoise_timesteps 8
2413
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2414
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2415
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2416
- DiT: Conditioning of shape (512, 768) dtype float32
2417
- FID is 11.579026222229004
2418
- (512, 256, 256, 3)
2419
- Calc FID for CFG 3.0 and denoise_timesteps 4
2420
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2421
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2422
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2423
- DiT: Conditioning of shape (512, 768) dtype float32
2424
- FID is 38.7520751953125
2425
- (512, 256, 256, 3)
2426
- Calc FID for CFG 3.0 and denoise_timesteps 2
2427
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2428
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2429
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2430
- DiT: Conditioning of shape (512, 768) dtype float32
2431
- FID is 255.22378540039062
2432
- (512, 256, 256, 3)
2433
- Calc FID for CFG 3.0 and denoise_timesteps 1
2434
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2435
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2436
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2437
- DiT: Conditioning of shape (512, 768) dtype float32
2438
- FID is 232.8525390625
 
67
  Disc shape (1, 8, 8, 512)
68
  Disc shape (1, 4, 4, 512)
69
  Total num of Discriminator parameters: 23998017
70
+ Loaded checkpoint from 13891605 seconds ago.
71
  Loaded model with step 511001
72
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
73
  β”‚ TPU 0 β”‚
 
768
  DiT: After patch embed, shape is (4, 256, 768) dtype bfloat16
769
  DiT: Patch Embed of shape (4, 256, 768) dtype bfloat16
770
  DiT: Conditioning of shape (1, 768) dtype float32
771
+ Loaded checkpoint from 149826 seconds ago.
772
 
773
  parameter shapes:
774
  ('PatchEmbed_0', 'Conv_0', 'kernel'): (2, 2, 4, 768)
 
1938
  Decoder layer (128, 128, 128, 512)
1939
  Decoder layer (128, 256, 256, 256)
1940
  Decoder layer (128, 256, 256, 128)
1941
+ FID is 36.43708801269531
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1942
  (512, 256, 256, 3)
1943
  Calc FID for CFG 1.25 and denoise_timesteps 128
1944
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1945
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1946
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1947
  DiT: Conditioning of shape (512, 768) dtype float32
1948
+ FID is 22.433515548706055
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1949
  (512, 256, 256, 3)
1950
  Calc FID for CFG 1.5 and denoise_timesteps 128
1951
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1952
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1953
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1954
  DiT: Conditioning of shape (512, 768) dtype float32
1955
+ FID is 14.05518913269043
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1956
  (512, 256, 256, 3)
1957
  Calc FID for CFG 1.75 and denoise_timesteps 128
1958
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1959
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1960
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1961
  DiT: Conditioning of shape (512, 768) dtype float32
1962
+ FID is 9.713033676147461
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1963
  (512, 256, 256, 3)
1964
  Calc FID for CFG 2.0 and denoise_timesteps 128
1965
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1966
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1967
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1968
  DiT: Conditioning of shape (512, 768) dtype float32
1969
+ FID is 7.850697994232178
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1970
  (512, 256, 256, 3)
1971
  Calc FID for CFG 2.25 and denoise_timesteps 128
1972
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1973
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1974
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1975
  DiT: Conditioning of shape (512, 768) dtype float32
1976
+ FID is 7.359264373779297
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1977
  (512, 256, 256, 3)
1978
  Calc FID for CFG 2.5 and denoise_timesteps 128
1979
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1980
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1981
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1982
  DiT: Conditioning of shape (512, 768) dtype float32
1983
+ FID is 7.6529860496521
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1984
  (512, 256, 256, 3)
1985
  Calc FID for CFG 2.75 and denoise_timesteps 128
1986
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1987
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1988
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1989
  DiT: Conditioning of shape (512, 768) dtype float32
1990
+ FID is 8.407513618469238
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1991
  (512, 256, 256, 3)
1992
  Calc FID for CFG 3.0 and denoise_timesteps 128
1993
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1994
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1995
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1996
  DiT: Conditioning of shape (512, 768) dtype float32
1997
+ FID is 9.344593048095703