KublaiKhan1 commited on
Commit
b68014e
Β·
verified Β·
1 Parent(s): f5a5d00

Upload folder using huggingface_hub

Browse files
1e-5_kl_naive_globalscale_channelmean_sampling/log.txt CHANGED
@@ -67,7 +67,7 @@ Disc shape (1, 16, 16, 512)
67
  Disc shape (1, 8, 8, 512)
68
  Disc shape (1, 4, 4, 512)
69
  Total num of Discriminator parameters: 23998017
70
- Loaded checkpoint from 13345880 seconds ago.
71
  Loaded model with step 474001
72
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
73
  β”‚ TPU 0 β”‚
@@ -768,7 +768,7 @@ DiT: Input of shape (4, 32, 32, 4) dtype float32
768
  DiT: After patch embed, shape is (4, 256, 768) dtype bfloat16
769
  DiT: Patch Embed of shape (4, 256, 768) dtype bfloat16
770
  DiT: Conditioning of shape (1, 768) dtype float32
771
- Loaded checkpoint from 5202 seconds ago.
772
 
773
  parameter shapes:
774
  ('PatchEmbed_0', 'Conv_0', 'kernel'): (2, 2, 4, 768)
@@ -1938,501 +1938,60 @@ Decoder layer (128, 64, 64, 512)
1938
  Decoder layer (128, 128, 128, 512)
1939
  Decoder layer (128, 256, 256, 256)
1940
  Decoder layer (128, 256, 256, 128)
1941
- FID is 36.60750961303711
1942
- (512, 256, 256, 3)
1943
- Calc FID for CFG 1.0 and denoise_timesteps 64
1944
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1945
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1946
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1947
- DiT: Conditioning of shape (512, 768) dtype float32
1948
- FID is 37.717193603515625
1949
- (512, 256, 256, 3)
1950
- Calc FID for CFG 1.0 and denoise_timesteps 32
1951
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1952
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1953
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1954
- DiT: Conditioning of shape (512, 768) dtype float32
1955
- FID is 40.62632369995117
1956
- (512, 256, 256, 3)
1957
- Calc FID for CFG 1.0 and denoise_timesteps 16
1958
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1959
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1960
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1961
- DiT: Conditioning of shape (512, 768) dtype float32
1962
- FID is 48.93124771118164
1963
- (512, 256, 256, 3)
1964
- Calc FID for CFG 1.0 and denoise_timesteps 8
1965
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1966
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1967
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1968
- DiT: Conditioning of shape (512, 768) dtype float32
1969
- FID is 73.71484375
1970
- (512, 256, 256, 3)
1971
- Calc FID for CFG 1.0 and denoise_timesteps 4
1972
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1973
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1974
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1975
- DiT: Conditioning of shape (512, 768) dtype float32
1976
- FID is 158.0609130859375
1977
- (512, 256, 256, 3)
1978
- Calc FID for CFG 1.0 and denoise_timesteps 2
1979
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1980
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1981
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1982
- DiT: Conditioning of shape (512, 768) dtype float32
1983
- FID is 313.73358154296875
1984
- (512, 256, 256, 3)
1985
- Calc FID for CFG 1.0 and denoise_timesteps 1
1986
- DiT: Input of shape (512, 32, 32, 4) dtype float32
1987
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1988
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1989
- DiT: Conditioning of shape (512, 768) dtype float32
1990
- FID is 286.5404357910156
1991
  (512, 256, 256, 3)
1992
  Calc FID for CFG 1.25 and denoise_timesteps 128
1993
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1994
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1995
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1996
  DiT: Conditioning of shape (512, 768) dtype float32
1997
- FID is 22.644939422607422
1998
- (512, 256, 256, 3)
1999
- Calc FID for CFG 1.25 and denoise_timesteps 64
2000
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2001
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2002
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2003
- DiT: Conditioning of shape (512, 768) dtype float32
2004
- FID is 23.565139770507812
2005
- (512, 256, 256, 3)
2006
- Calc FID for CFG 1.25 and denoise_timesteps 32
2007
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2008
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2009
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2010
- DiT: Conditioning of shape (512, 768) dtype float32
2011
- FID is 25.94041633605957
2012
- (512, 256, 256, 3)
2013
- Calc FID for CFG 1.25 and denoise_timesteps 16
2014
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2015
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2016
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2017
- DiT: Conditioning of shape (512, 768) dtype float32
2018
- FID is 32.80158996582031
2019
- (512, 256, 256, 3)
2020
- Calc FID for CFG 1.25 and denoise_timesteps 8
2021
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2022
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2023
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2024
- DiT: Conditioning of shape (512, 768) dtype float32
2025
- FID is 55.220787048339844
2026
- (512, 256, 256, 3)
2027
- Calc FID for CFG 1.25 and denoise_timesteps 4
2028
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2029
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2030
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2031
- DiT: Conditioning of shape (512, 768) dtype float32
2032
- FID is 133.41140747070312
2033
- (512, 256, 256, 3)
2034
- Calc FID for CFG 1.25 and denoise_timesteps 2
2035
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2036
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2037
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2038
- DiT: Conditioning of shape (512, 768) dtype float32
2039
- FID is 302.35345458984375
2040
- (512, 256, 256, 3)
2041
- Calc FID for CFG 1.25 and denoise_timesteps 1
2042
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2043
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2044
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2045
- DiT: Conditioning of shape (512, 768) dtype float32
2046
- FID is 275.65960693359375
2047
  (512, 256, 256, 3)
2048
  Calc FID for CFG 1.5 and denoise_timesteps 128
2049
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2050
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2051
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2052
  DiT: Conditioning of shape (512, 768) dtype float32
2053
- FID is 14.374984741210938
2054
- (512, 256, 256, 3)
2055
- Calc FID for CFG 1.5 and denoise_timesteps 64
2056
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2057
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2058
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2059
- DiT: Conditioning of shape (512, 768) dtype float32
2060
- FID is 15.053552627563477
2061
- (512, 256, 256, 3)
2062
- Calc FID for CFG 1.5 and denoise_timesteps 32
2063
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2064
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2065
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2066
- DiT: Conditioning of shape (512, 768) dtype float32
2067
- FID is 16.86142921447754
2068
- (512, 256, 256, 3)
2069
- Calc FID for CFG 1.5 and denoise_timesteps 16
2070
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2071
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2072
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2073
- DiT: Conditioning of shape (512, 768) dtype float32
2074
- FID is 22.118249893188477
2075
- (512, 256, 256, 3)
2076
- Calc FID for CFG 1.5 and denoise_timesteps 8
2077
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2078
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2079
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2080
- DiT: Conditioning of shape (512, 768) dtype float32
2081
- FID is 40.5958137512207
2082
- (512, 256, 256, 3)
2083
- Calc FID for CFG 1.5 and denoise_timesteps 4
2084
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2085
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2086
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2087
- DiT: Conditioning of shape (512, 768) dtype float32
2088
- FID is 111.90969848632812
2089
- (512, 256, 256, 3)
2090
- Calc FID for CFG 1.5 and denoise_timesteps 2
2091
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2092
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2093
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2094
- DiT: Conditioning of shape (512, 768) dtype float32
2095
- FID is 291.7471923828125
2096
- (512, 256, 256, 3)
2097
- Calc FID for CFG 1.5 and denoise_timesteps 1
2098
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2099
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2100
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2101
- DiT: Conditioning of shape (512, 768) dtype float32
2102
- FID is 268.52325439453125
2103
  (512, 256, 256, 3)
2104
  Calc FID for CFG 1.75 and denoise_timesteps 128
2105
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2106
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2107
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2108
  DiT: Conditioning of shape (512, 768) dtype float32
2109
- FID is 9.992362976074219
2110
- (512, 256, 256, 3)
2111
- Calc FID for CFG 1.75 and denoise_timesteps 64
2112
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2113
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2114
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2115
- DiT: Conditioning of shape (512, 768) dtype float32
2116
- FID is 10.469978332519531
2117
- (512, 256, 256, 3)
2118
- Calc FID for CFG 1.75 and denoise_timesteps 32
2119
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2120
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2121
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2122
- DiT: Conditioning of shape (512, 768) dtype float32
2123
- FID is 11.733261108398438
2124
- (512, 256, 256, 3)
2125
- Calc FID for CFG 1.75 and denoise_timesteps 16
2126
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2127
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2128
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2129
- DiT: Conditioning of shape (512, 768) dtype float32
2130
- FID is 15.620132446289062
2131
- (512, 256, 256, 3)
2132
- Calc FID for CFG 1.75 and denoise_timesteps 8
2133
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2134
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2135
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2136
- DiT: Conditioning of shape (512, 768) dtype float32
2137
- FID is 30.157955169677734
2138
- (512, 256, 256, 3)
2139
- Calc FID for CFG 1.75 and denoise_timesteps 4
2140
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2141
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2142
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2143
- DiT: Conditioning of shape (512, 768) dtype float32
2144
- FID is 93.65843200683594
2145
- (512, 256, 256, 3)
2146
- Calc FID for CFG 1.75 and denoise_timesteps 2
2147
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2148
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2149
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2150
- DiT: Conditioning of shape (512, 768) dtype float32
2151
- FID is 282.39471435546875
2152
- (512, 256, 256, 3)
2153
- Calc FID for CFG 1.75 and denoise_timesteps 1
2154
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2155
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2156
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2157
- DiT: Conditioning of shape (512, 768) dtype float32
2158
- FID is 263.8368835449219
2159
  (512, 256, 256, 3)
2160
  Calc FID for CFG 2.0 and denoise_timesteps 128
2161
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2162
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2163
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2164
  DiT: Conditioning of shape (512, 768) dtype float32
2165
- FID is 8.087844848632812
2166
- (512, 256, 256, 3)
2167
- Calc FID for CFG 2.0 and denoise_timesteps 64
2168
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2169
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2170
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2171
- DiT: Conditioning of shape (512, 768) dtype float32
2172
- FID is 8.410569190979004
2173
- (512, 256, 256, 3)
2174
- Calc FID for CFG 2.0 and denoise_timesteps 32
2175
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2176
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2177
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2178
- DiT: Conditioning of shape (512, 768) dtype float32
2179
- FID is 9.256352424621582
2180
- (512, 256, 256, 3)
2181
- Calc FID for CFG 2.0 and denoise_timesteps 16
2182
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2183
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2184
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2185
- DiT: Conditioning of shape (512, 768) dtype float32
2186
- FID is 11.975582122802734
2187
- (512, 256, 256, 3)
2188
- Calc FID for CFG 2.0 and denoise_timesteps 8
2189
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2190
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2191
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2192
- DiT: Conditioning of shape (512, 768) dtype float32
2193
- FID is 23.16156005859375
2194
- (512, 256, 256, 3)
2195
- Calc FID for CFG 2.0 and denoise_timesteps 4
2196
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2197
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2198
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2199
- DiT: Conditioning of shape (512, 768) dtype float32
2200
- FID is 78.73834228515625
2201
- (512, 256, 256, 3)
2202
- Calc FID for CFG 2.0 and denoise_timesteps 2
2203
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2204
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2205
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2206
- DiT: Conditioning of shape (512, 768) dtype float32
2207
- FID is 274.115966796875
2208
- (512, 256, 256, 3)
2209
- Calc FID for CFG 2.0 and denoise_timesteps 1
2210
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2211
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2212
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2213
- DiT: Conditioning of shape (512, 768) dtype float32
2214
- FID is 260.2587585449219
2215
  (512, 256, 256, 3)
2216
  Calc FID for CFG 2.25 and denoise_timesteps 128
2217
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2218
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2219
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2220
  DiT: Conditioning of shape (512, 768) dtype float32
2221
- FID is 7.667808532714844
2222
- (512, 256, 256, 3)
2223
- Calc FID for CFG 2.25 and denoise_timesteps 64
2224
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2225
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2226
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2227
- DiT: Conditioning of shape (512, 768) dtype float32
2228
- FID is 7.854188919067383
2229
- (512, 256, 256, 3)
2230
- Calc FID for CFG 2.25 and denoise_timesteps 32
2231
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2232
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2233
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2234
- DiT: Conditioning of shape (512, 768) dtype float32
2235
- FID is 8.360774993896484
2236
- (512, 256, 256, 3)
2237
- Calc FID for CFG 2.25 and denoise_timesteps 16
2238
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2239
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2240
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2241
- DiT: Conditioning of shape (512, 768) dtype float32
2242
- FID is 10.235084533691406
2243
- (512, 256, 256, 3)
2244
- Calc FID for CFG 2.25 and denoise_timesteps 8
2245
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2246
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2247
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2248
- DiT: Conditioning of shape (512, 768) dtype float32
2249
- FID is 18.630203247070312
2250
- (512, 256, 256, 3)
2251
- Calc FID for CFG 2.25 and denoise_timesteps 4
2252
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2253
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2254
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2255
- DiT: Conditioning of shape (512, 768) dtype float32
2256
- FID is 66.66618347167969
2257
- (512, 256, 256, 3)
2258
- Calc FID for CFG 2.25 and denoise_timesteps 2
2259
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2260
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2261
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2262
- DiT: Conditioning of shape (512, 768) dtype float32
2263
- FID is 266.68902587890625
2264
- (512, 256, 256, 3)
2265
- Calc FID for CFG 2.25 and denoise_timesteps 1
2266
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2267
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2268
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2269
- DiT: Conditioning of shape (512, 768) dtype float32
2270
- FID is 257.45001220703125
2271
  (512, 256, 256, 3)
2272
  Calc FID for CFG 2.5 and denoise_timesteps 128
2273
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2274
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2275
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2276
  DiT: Conditioning of shape (512, 768) dtype float32
2277
- FID is 8.030121803283691
2278
- (512, 256, 256, 3)
2279
- Calc FID for CFG 2.5 and denoise_timesteps 64
2280
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2281
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2282
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2283
- DiT: Conditioning of shape (512, 768) dtype float32
2284
- FID is 8.140653610229492
2285
- (512, 256, 256, 3)
2286
- Calc FID for CFG 2.5 and denoise_timesteps 32
2287
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2288
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2289
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2290
- DiT: Conditioning of shape (512, 768) dtype float32
2291
- FID is 8.436915397644043
2292
- (512, 256, 256, 3)
2293
- Calc FID for CFG 2.5 and denoise_timesteps 16
2294
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2295
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2296
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2297
- DiT: Conditioning of shape (512, 768) dtype float32
2298
- FID is 9.632453918457031
2299
- (512, 256, 256, 3)
2300
- Calc FID for CFG 2.5 and denoise_timesteps 8
2301
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2302
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2303
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2304
- DiT: Conditioning of shape (512, 768) dtype float32
2305
- FID is 15.819036483764648
2306
- (512, 256, 256, 3)
2307
- Calc FID for CFG 2.5 and denoise_timesteps 4
2308
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2309
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2310
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2311
- DiT: Conditioning of shape (512, 768) dtype float32
2312
- FID is 57.212120056152344
2313
- (512, 256, 256, 3)
2314
- Calc FID for CFG 2.5 and denoise_timesteps 2
2315
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2316
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2317
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2318
- DiT: Conditioning of shape (512, 768) dtype float32
2319
- FID is 260.07421875
2320
- (512, 256, 256, 3)
2321
- Calc FID for CFG 2.5 and denoise_timesteps 1
2322
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2323
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2324
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2325
- DiT: Conditioning of shape (512, 768) dtype float32
2326
- FID is 255.27415466308594
2327
  (512, 256, 256, 3)
2328
  Calc FID for CFG 2.75 and denoise_timesteps 128
2329
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2330
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2331
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2332
  DiT: Conditioning of shape (512, 768) dtype float32
2333
- FID is 8.7904691696167
2334
- (512, 256, 256, 3)
2335
- Calc FID for CFG 2.75 and denoise_timesteps 64
2336
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2337
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2338
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2339
- DiT: Conditioning of shape (512, 768) dtype float32
2340
- FID is 8.839826583862305
2341
- (512, 256, 256, 3)
2342
- Calc FID for CFG 2.75 and denoise_timesteps 32
2343
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2344
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2345
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2346
- DiT: Conditioning of shape (512, 768) dtype float32
2347
- FID is 8.988865852355957
2348
- (512, 256, 256, 3)
2349
- Calc FID for CFG 2.75 and denoise_timesteps 16
2350
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2351
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2352
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2353
- DiT: Conditioning of shape (512, 768) dtype float32
2354
- FID is 9.728584289550781
2355
- (512, 256, 256, 3)
2356
- Calc FID for CFG 2.75 and denoise_timesteps 8
2357
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2358
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2359
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2360
- DiT: Conditioning of shape (512, 768) dtype float32
2361
- FID is 14.18990707397461
2362
- (512, 256, 256, 3)
2363
- Calc FID for CFG 2.75 and denoise_timesteps 4
2364
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2365
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2366
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2367
- DiT: Conditioning of shape (512, 768) dtype float32
2368
- FID is 49.663368225097656
2369
- (512, 256, 256, 3)
2370
- Calc FID for CFG 2.75 and denoise_timesteps 2
2371
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2372
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2373
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2374
- DiT: Conditioning of shape (512, 768) dtype float32
2375
- FID is 253.92054748535156
2376
- (512, 256, 256, 3)
2377
- Calc FID for CFG 2.75 and denoise_timesteps 1
2378
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2379
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2380
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2381
- DiT: Conditioning of shape (512, 768) dtype float32
2382
- FID is 253.68734741210938
2383
  (512, 256, 256, 3)
2384
  Calc FID for CFG 3.0 and denoise_timesteps 128
2385
  DiT: Input of shape (512, 32, 32, 4) dtype float32
2386
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2387
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2388
  DiT: Conditioning of shape (512, 768) dtype float32
2389
- FID is 9.772045135498047
2390
- (512, 256, 256, 3)
2391
- Calc FID for CFG 3.0 and denoise_timesteps 64
2392
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2393
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2394
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2395
- DiT: Conditioning of shape (512, 768) dtype float32
2396
- FID is 9.767073631286621
2397
- (512, 256, 256, 3)
2398
- Calc FID for CFG 3.0 and denoise_timesteps 32
2399
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2400
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2401
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2402
- DiT: Conditioning of shape (512, 768) dtype float32
2403
- FID is 9.813874244689941
2404
- (512, 256, 256, 3)
2405
- Calc FID for CFG 3.0 and denoise_timesteps 16
2406
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2407
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2408
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2409
- DiT: Conditioning of shape (512, 768) dtype float32
2410
- FID is 10.203750610351562
2411
- (512, 256, 256, 3)
2412
- Calc FID for CFG 3.0 and denoise_timesteps 8
2413
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2414
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2415
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2416
- DiT: Conditioning of shape (512, 768) dtype float32
2417
- FID is 13.351149559020996
2418
- (512, 256, 256, 3)
2419
- Calc FID for CFG 3.0 and denoise_timesteps 4
2420
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2421
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2422
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2423
- DiT: Conditioning of shape (512, 768) dtype float32
2424
- FID is 43.611480712890625
2425
- (512, 256, 256, 3)
2426
- Calc FID for CFG 3.0 and denoise_timesteps 2
2427
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2428
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2429
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2430
- DiT: Conditioning of shape (512, 768) dtype float32
2431
- FID is 248.34854125976562
2432
- (512, 256, 256, 3)
2433
- Calc FID for CFG 3.0 and denoise_timesteps 1
2434
- DiT: Input of shape (512, 32, 32, 4) dtype float32
2435
- DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
2436
- DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
2437
- DiT: Conditioning of shape (512, 768) dtype float32
2438
- FID is 252.22958374023438
 
67
  Disc shape (1, 8, 8, 512)
68
  Disc shape (1, 4, 4, 512)
69
  Total num of Discriminator parameters: 23998017
70
+ Loaded checkpoint from 14090812 seconds ago.
71
  Loaded model with step 474001
72
  β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
73
  β”‚ TPU 0 β”‚
 
768
  DiT: After patch embed, shape is (4, 256, 768) dtype bfloat16
769
  DiT: Patch Embed of shape (4, 256, 768) dtype bfloat16
770
  DiT: Conditioning of shape (1, 768) dtype float32
771
+ Loaded checkpoint from 750135 seconds ago.
772
 
773
  parameter shapes:
774
  ('PatchEmbed_0', 'Conv_0', 'kernel'): (2, 2, 4, 768)
 
1938
  Decoder layer (128, 128, 128, 512)
1939
  Decoder layer (128, 256, 256, 256)
1940
  Decoder layer (128, 256, 256, 128)
1941
+ FID is 36.649375915527344
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1942
  (512, 256, 256, 3)
1943
  Calc FID for CFG 1.25 and denoise_timesteps 128
1944
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1945
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1946
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1947
  DiT: Conditioning of shape (512, 768) dtype float32
1948
+ FID is 22.582571029663086
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1949
  (512, 256, 256, 3)
1950
  Calc FID for CFG 1.5 and denoise_timesteps 128
1951
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1952
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1953
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1954
  DiT: Conditioning of shape (512, 768) dtype float32
1955
+ FID is 14.302921295166016
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1956
  (512, 256, 256, 3)
1957
  Calc FID for CFG 1.75 and denoise_timesteps 128
1958
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1959
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1960
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1961
  DiT: Conditioning of shape (512, 768) dtype float32
1962
+ FID is 9.92587661743164
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1963
  (512, 256, 256, 3)
1964
  Calc FID for CFG 2.0 and denoise_timesteps 128
1965
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1966
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1967
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1968
  DiT: Conditioning of shape (512, 768) dtype float32
1969
+ FID is 8.037654876708984
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1970
  (512, 256, 256, 3)
1971
  Calc FID for CFG 2.25 and denoise_timesteps 128
1972
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1973
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1974
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1975
  DiT: Conditioning of shape (512, 768) dtype float32
1976
+ FID is 7.633633613586426
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1977
  (512, 256, 256, 3)
1978
  Calc FID for CFG 2.5 and denoise_timesteps 128
1979
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1980
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1981
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1982
  DiT: Conditioning of shape (512, 768) dtype float32
1983
+ FID is 8.01058292388916
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1984
  (512, 256, 256, 3)
1985
  Calc FID for CFG 2.75 and denoise_timesteps 128
1986
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1987
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1988
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1989
  DiT: Conditioning of shape (512, 768) dtype float32
1990
+ FID is 8.780667304992676
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1991
  (512, 256, 256, 3)
1992
  Calc FID for CFG 3.0 and denoise_timesteps 128
1993
  DiT: Input of shape (512, 32, 32, 4) dtype float32
1994
  DiT: After patch embed, shape is (512, 256, 768) dtype bfloat16
1995
  DiT: Patch Embed of shape (512, 256, 768) dtype bfloat16
1996
  DiT: Conditioning of shape (512, 768) dtype float32
1997
+ FID is 9.77159309387207