AbstractPhil commited on
Commit
d917852
Β·
verified Β·
1 Parent(s): c674ac6

Create deep_analysis_output.txt

Browse files
Files changed (1) hide show
  1. deep_analysis_output.txt +252 -0
deep_analysis_output.txt ADDED
@@ -0,0 +1,252 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ================================================================================
2
+ FLOW MATCH RELAY β€” FULL ANALYSIS TOOLKIT
3
+ Device: cuda
4
+ ================================================================================
5
+ Loading weights: 100%
6
+  169/169 [00:00<00:00, 4115.48it/s, Materializing param=unet.time_emb.3.weight]
7
+ Params: 6,746,403 (relay: 76,384, 1.1%)
8
+ Relay modules: 2
9
+
10
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
11
+ TEST 1: Relay Diagnostics β€” Drift, Gates, Anchor Geometry
12
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
13
+
14
+ unet.mid_block1.relay:
15
+ Patches: 16, Anchors/patch: 16, Patch dim: 16
16
+ Drift (rad): mean=0.096787 std=0.031425 min=0.025191 max=0.204950
17
+ Drift (deg): mean=5.55Β° max=11.74Β°
18
+ Gates: mean=0.0594 std=0.0017 min=0.0560 max=0.0619
19
+ Patch 0: anchor_cos mean=0.0443 max=0.6491 min=-0.5109
20
+ Patch 1: anchor_cos mean=0.0329 max=0.6991 min=-0.6696
21
+ Patch 2: anchor_cos mean=-0.0137 max=0.5337 min=-0.6256
22
+ Patch 3: anchor_cos mean=0.0025 max=0.5802 min=-0.5198
23
+ Near 0.29154: 0.0% of anchors within Β±0.05
24
+ Per-patch mean drift:
25
+ Patch 0: 0.083389 rad (4.78Β°)
26
+ Patch 1: 0.105304 rad (6.03Β°)
27
+ Patch 2: 0.084447 rad (4.84Β°)
28
+ Patch 3: 0.115432 rad (6.61Β°)
29
+ Patch 4: 0.103072 rad (5.91Β°)
30
+ Patch 5: 0.053514 rad (3.07Β°)
31
+ Patch 6: 0.116086 rad (6.65Β°)
32
+ Patch 7: 0.074704 rad (4.28Β°)
33
+ Patch 8: 0.119158 rad (6.83Β°)
34
+ Patch 9: 0.134659 rad (7.72Β°)
35
+ Patch 10: 0.112943 rad (6.47Β°)
36
+ Patch 11: 0.076762 rad (4.40Β°)
37
+ Patch 12: 0.114448 rad (6.56Β°)
38
+ Patch 13: 0.109745 rad (6.29Β°)
39
+ Patch 14: 0.056048 rad (3.21Β°)
40
+ Patch 15: 0.088878 rad (5.09Β°)
41
+
42
+ unet.mid_block2.relay:
43
+ Patches: 16, Anchors/patch: 16, Patch dim: 16
44
+ Drift (rad): mean=0.116423 std=0.031319 min=0.045233 max=0.246318
45
+ Drift (deg): mean=6.67Β° max=14.11Β°
46
+ Gates: mean=0.0658 std=0.0019 min=0.0627 max=0.0702
47
+ Patch 0: anchor_cos mean=-0.0104 max=0.5409 min=-0.5740
48
+ Patch 1: anchor_cos mean=0.0241 max=0.5879 min=-0.5389
49
+ Patch 2: anchor_cos mean=0.0198 max=0.6510 min=-0.5917
50
+ Patch 3: anchor_cos mean=0.0060 max=0.5366 min=-0.7307
51
+ Near 0.29154: 0.4% of anchors within Β±0.05
52
+ Per-patch mean drift:
53
+ Patch 0: 0.119493 rad (6.85Β°)
54
+ Patch 1: 0.139048 rad (7.97Β°)
55
+ Patch 2: 0.096759 rad (5.54Β°)
56
+ Patch 3: 0.159487 rad (9.14Β°)
57
+ Patch 4: 0.123855 rad (7.10Β°)
58
+ Patch 5: 0.094995 rad (5.44Β°)
59
+ Patch 6: 0.129639 rad (7.43Β°)
60
+ Patch 7: 0.077257 rad (4.43Β°)
61
+ Patch 8: 0.127187 rad (7.29Β°)
62
+ Patch 9: 0.114844 rad (6.58Β°)
63
+ Patch 10: 0.118358 rad (6.78Β°)
64
+ Patch 11: 0.097721 rad (5.60Β°)
65
+ Patch 12: 0.128776 rad (7.38Β°)
66
+ Patch 13: 0.127976 rad (7.33Β°)
67
+ Patch 14: 0.086075 rad (4.93Β°)
68
+ Patch 15: 0.121296 rad (6.95Β°)
69
+
70
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
71
+ TEST 2: Bottleneck Feature Geometry β€” CV at the relay point
72
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
73
+
74
+ CV of bottleneck features at different timesteps:
75
+ t module CV eff_d norm
76
+ 0.00 unet.mid_block1.relay 0.5503 34.5 51.61
77
+ 0.00 unet.mid_block1 0.5316 34.5 51.61
78
+ 0.00 unet.mid_attn 0.4850 38.8 51.91
79
+ 0.00 unet.mid_block2.relay 0.5030 40.8 63.07
80
+ 0.00 unet.mid_block2 0.4623 40.8 63.07
81
+ 0.25 unet.mid_block1.relay 0.4621 43.1 52.02
82
+ 0.25 unet.mid_block1 0.4761 43.1 52.02
83
+ 0.25 unet.mid_attn 0.4429 47.2 52.35
84
+ 0.25 unet.mid_block2.relay 0.4353 48.7 63.47
85
+ 0.25 unet.mid_block2 0.4243 48.7 63.47
86
+ 0.50 unet.mid_block1.relay 0.4567 41.3 51.88
87
+ 0.50 unet.mid_block1 0.4775 41.3 51.88
88
+ 0.50 unet.mid_attn 0.4195 45.0 52.22
89
+ 0.50 unet.mid_block2.relay 0.4303 46.3 63.28
90
+ 0.50 unet.mid_block2 0.4266 46.3 63.28
91
+ 0.75 unet.mid_block1.relay 0.4972 35.2 51.85
92
+ 0.75 unet.mid_block1 0.5265 35.2 51.85
93
+ 0.75 unet.mid_attn 0.4821 39.5 52.07
94
+ 0.75 unet.mid_block2.relay 0.4743 40.7 63.12
95
+ 0.75 unet.mid_block2 0.4939 40.7 63.12
96
+ 1.00 unet.mid_block1.relay 0.5709 24.0 58.44
97
+ 1.00 unet.mid_block1 0.5983 24.0 58.44
98
+ 1.00 unet.mid_attn 0.5804 24.8 58.74
99
+ 1.00 unet.mid_block2.relay 0.6099 25.0 70.53
100
+ 1.00 unet.mid_block2 0.6065 25.0 70.53
101
+
102
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
103
+ TEST 3: Per-Class Anchor Utilization
104
+ Which anchors activate for each class?
105
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
106
+
107
+ Nearest anchor distribution per class (Patch 0):
108
+ class 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
109
+ plane 0% 0% 0% 0% 0% 0% 0% 22%β–ˆ 0% 0% 0% 0% 0% 0% 77%β–ˆ 0%
110
+ auto 0% 0% 0% 0% 0% 0% 0% 20%β–ˆ 0% 0% 0% 0% 0% 0% 80%β–ˆ 0%
111
+ bird 0% 0% 1% 0% 0% 0% 0% 42%β–ˆ 0% 0% 0% 0% 0% 0% 57%β–ˆ 0%
112
+ cat 0% 0% 0% 0% 0% 0% 0% 40%β–ˆ 0% 0% 0% 0% 0% 0% 60%β–ˆ 0%
113
+ deer 0% 0% 1% 0% 0% 0% 0% 58%β–ˆ 0% 0% 0% 0% 0% 0% 41%β–ˆ 0%
114
+ dog 0% 0% 0% 0% 0% 0% 0% 39%β–ˆ 0% 0% 0% 0% 0% 0% 61%β–ˆ 0%
115
+ frog 0% 0% 0% 0% 0% 0% 0% 58%β–ˆ 0% 0% 0% 0% 0% 0% 42%β–ˆ 0%
116
+ horse 0% 0% 0% 0% 0% 0% 0% 57%β–ˆ 0% 0% 0% 0% 0% 0% 43%β–ˆ 0%
117
+ ship 0% 0% 0% 0% 0% 0% 0% 2% 0% 0% 0% 0% 0% 0% 98%β–ˆ 0%
118
+ truck 0% 0% 0% 0% 0% 0% 0% 54%β–ˆ 0% 0% 0% 0% 0% 0% 46%β–ˆ 0%
119
+
120
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
121
+ TEST 4: Gate Dynamics β€” do relay gates respond to timestep?
122
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
123
+ Note: gates are learned parameters, not t-dependent.
124
+ Measuring relay output magnitude at different t instead.
125
+
126
+ t relay_Ξ”_norm relay_Ξ”_cos input_norm output_norm
127
+ 0.00 116.3409 0.01136649 321.42 423.60
128
+ 0.10 116.6277 0.01213849 323.16 424.56
129
+ 0.25 116.7582 0.01237500 329.56 430.23
130
+ 0.50 117.6399 0.01344323 331.23 431.26
131
+ 0.75 118.0994 0.01383245 334.28 433.97
132
+ 0.90 118.1683 0.01333457 350.58 449.37
133
+ 1.00 119.9183 0.01262599 384.39 482.87
134
+
135
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
136
+ TEST 5: Generation Quality β€” Per-Class Diversity
137
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
138
+ class intra_cos intra_std CV norm
139
+ plane 0.8748 0.0821 0.6407 33.17
140
+ auto 0.8073 0.0672 0.3739 27.07
141
+ bird 0.8779 0.0694 0.6898 27.62
142
+ cat 0.7880 0.0993 0.5233 26.13
143
+ deer 0.8695 0.0630 0.5286 26.79
144
+ dog 0.8332 0.0703 0.4365 27.46
145
+ frog 0.8332 0.0711 0.5007 25.48
146
+ horse 0.8313 0.0810 0.4687 29.01
147
+ ship 0.8920 0.0519 0.5069 32.44
148
+ truck 0.8311 0.0585 0.3973 29.58
149
+
150
+ βœ“ Saved per-class grids to analysis/
151
+
152
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
153
+ TEST 6: Velocity Field β€” how does v_pred behave across t?
154
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
155
+ t v_norm v_std vΒ·target v_cos_t
156
+ 0.05 53.05 0.9593 0.8501 0.3426
157
+ 0.10 55.90 1.0099 0.8958 0.2447
158
+ 0.25 58.23 1.0513 0.9352 0.1561
159
+ 0.50 58.96 1.0641 0.9486 0.1258
160
+ 0.75 58.85 1.0615 0.9411 0.1444
161
+ 0.90 57.49 1.0360 0.9244 0.1876
162
+ 0.95 56.50 1.0187 0.9126 0.2168
163
+
164
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
165
+ TEST 7: Ablation β€” Relay ON vs OFF during generation
166
+ Disable relay gates, measure generation difference
167
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
168
+ Relay ON β€” mean pixel: 0.4528 std: 0.2227
169
+ Relay OFF β€” mean pixel: 0.4524 std: 0.2232
170
+ Pixel diff: 0.002449
171
+ Cosine sim: 0.999977
172
+ Max pixel Ξ”: 0.074433
173
+ βœ“ Saved analysis/relay_ablation.png (top=ON, bottom=OFF)
174
+
175
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
176
+ TEST 8: Anchor Constellation Structure
177
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
178
+
179
+ unet.mid_block1.relay:
180
+ Home↔Current cos: mean=0.994831 min=0.979071
181
+ Patch 0 anchor spread: mean_cos=0.0443 max_cos=0.6491 min_cos=-0.5109
182
+ Patch 1 anchor spread: mean_cos=0.0329 max_cos=0.6991 min_cos=-0.6696
183
+ Patch 2 anchor spread: mean_cos=-0.0137 max_cos=0.5337 min_cos=-0.6256
184
+ Patch 3 anchor spread: mean_cos=0.0025 max_cos=0.5802 min_cos=-0.5198
185
+ Patch 0 anchor eff_dim: 11.3 / 16
186
+ Patch 1 anchor eff_dim: 11.9 / 16
187
+ Patch 2 anchor eff_dim: 11.7 / 16
188
+ Patch 3 anchor eff_dim: 11.5 / 16
189
+
190
+ unet.mid_block2.relay:
191
+ Home↔Current cos: mean=0.992746 min=0.969817
192
+ Patch 0 anchor spread: mean_cos=-0.0104 max_cos=0.5409 min_cos=-0.5740
193
+ Patch 1 anchor spread: mean_cos=0.0241 max_cos=0.5879 min_cos=-0.5389
194
+ Patch 2 anchor spread: mean_cos=0.0198 max_cos=0.6510 min_cos=-0.5917
195
+ Patch 3 anchor spread: mean_cos=0.0060 max_cos=0.5366 min_cos=-0.7307
196
+ Patch 0 anchor eff_dim: 11.8 / 16
197
+ Patch 1 anchor eff_dim: 11.9 / 16
198
+ Patch 2 anchor eff_dim: 12.0 / 16
199
+ Patch 3 anchor eff_dim: 11.4 / 16
200
+
201
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
202
+ TEST 9: Sampling Trajectory β€” CV through ODE steps
203
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
204
+ step t x_norm x_std CV_pixel
205
+ 0 1.00 54.32 0.9801 0.0110
206
+ 1 0.98 53.21 0.9602 0.0111
207
+ 5 0.90 48.88 0.8819 0.0123
208
+ 10 0.80 43.61 0.7869 0.0112
209
+ 20 0.60 33.95 0.6123 0.0241
210
+ 30 0.40 26.37 0.4764 0.0707
211
+ 40 0.20 22.88 0.4184 0.1773
212
+ 49 0.02 24.85 0.4587 0.2497
213
+
214
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
215
+ TEST 10: Inter-Class vs Intra-Class Separation
216
+ ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
217
+ Inter-class cosine similarity matrix:
218
+ plan auto bird cat deer dog frog hors ship truc
219
+ plane 1.0 0.99 0.99 0.98 0.98 0.97 0.98 0.99 1.00 0.98
220
+ auto 0.99 1.0 0.99 0.98 0.98 0.97 0.98 0.99 0.99 0.99
221
+ bird 0.99 0.99 1.0 0.99 0.99 0.99 0.99 1.00 0.99 0.98
222
+ cat 0.98 0.98 0.99 1.0 0.99 1.00 0.99 0.99 0.97 0.96
223
+ deer 0.98 0.98 0.99 0.99 1.0 0.99 1.00 0.99 0.97 0.96
224
+ dog 0.97 0.97 0.99 1.00 0.99 1.0 0.99 0.98 0.96 0.95
225
+ frog 0.98 0.98 0.99 0.99 1.00 0.99 1.0 0.99 0.97 0.97
226
+ horse 0.99 0.99 1.00 0.99 0.99 0.98 0.99 1.0 0.98 0.98
227
+ ship 1.00 0.99 0.99 0.97 0.97 0.96 0.97 0.98 1.0 0.99
228
+ truck 0.98 0.99 0.98 0.96 0.96 0.95 0.97 0.98 0.99 1.0
229
+
230
+ Intra-class cos: 0.8438 Β± 0.0318
231
+ Inter-class cos: 0.8305 Β± 0.0230
232
+ Separation ratio: 1.02Γ—
233
+
234
+ ================================================================================
235
+ ANALYSIS COMPLETE
236
+ ================================================================================
237
+
238
+ Files saved to analysis/:
239
+ - class_*.png: per-class generated samples
240
+ - all_classes.png: 4 samples per class, 10 columns
241
+ - relay_ablation.png: relay ON (top) vs OFF (bottom)
242
+
243
+ Key metrics to look for:
244
+ 1. Anchor drift β†’ did any converge near 0.29154?
245
+ 2. Gate values β†’ did they learn to open from init (0.047)?
246
+ 3. Per-class anchor utilization β†’ class-specific routing?
247
+ 4. Relay ablation β†’ does turning off the relay change generation?
248
+ 5. Intra/inter-class ratio β†’ > 1.0 means classes are separable
249
+ 6. Velocity cosine β†’ higher = better flow matching
250
+ 7. CV through ODE β†’ how does geometry evolve during generation?
251
+
252
+ ================================================================================