AbstractPhil commited on
Commit
78219ce
Β·
verified Β·
1 Parent(s): 2370068

Create quad_vae_liminal_results.txt

Browse files
Files changed (1) hide show
  1. quad_vae_liminal_results.txt +164 -0
quad_vae_liminal_results.txt ADDED
@@ -0,0 +1,164 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ======================================================================
2
+ Multi-VAE Geometric Comparison Pipeline
3
+ ======================================================================
4
+ Downloading liminal.zip from AbstractPhil/grid-geometric-classifier-sliding-proto...
5
+ Found 957 images
6
+
7
+ ======================================================================
8
+ Step 1: Classifier
9
+ ======================================================================
10
+ Loaded 638,387 params
11
+
12
+ ======================================================================
13
+ Processing: SD 1.5
14
+ Model: stable-diffusion-v1-5/stable-diffusion-v1-5
15
+ ======================================================================
16
+ Latent: (4, 64, 64) mean=0.014 std=3.719 [-33.188, 40.219]
17
+ Groups: [[0], [2], [3], [1]]
18
+ Scales: [(4, 64, 64), (4, 32, 32), (4, 16, 16)]
19
+ SD 1.5 extract: 100%
20
+  30/30 [00:07<00:00,  5.30Γ—32/s]
21
+ SD 1.5: 105,895 annotations, conf=0.880
22
+ saddle 59,460 ( 56.1%)
23
+ pentachoron 37,281 ( 35.2%)
24
+ triangular_prism 3,009 ( 2.8%)
25
+ line_diag 1,913 ( 1.8%)
26
+ cone 1,689 ( 1.6%)
27
+
28
+ ======================================================================
29
+ Processing: SDXL
30
+ Model: madebyollin/sdxl-vae-fp16-fix
31
+ ======================================================================
32
+ config.json: 100%
33
+  631/631 [00:00<00:00, 142kB/s]
34
+ diffusion_pytorch_model.safetensors: 100%
35
+  335M/335M [00:01<00:00, 329MB/s]
36
+ Encoding: 100%
37
+  240/240 [01:12<00:00,  1.65batch/s]
38
+ Latent: (4, 128, 128) mean=0.605 std=5.059 [-18.938, 21.297]
39
+ Groups: [[0], [3], [2], [1]]
40
+ Scales: [(4, 64, 64), (4, 32, 32), (4, 16, 16)]
41
+ SDXL extract: 100%
42
+  30/30 [00:17<00:00,  2.13Γ—32/s]
43
+ SDXL: 483,789 annotations, conf=0.872
44
+ saddle 263,065 ( 54.4%)
45
+ pentachoron 137,450 ( 28.4%)
46
+ triangular_prism 26,717 ( 5.5%)
47
+ line_diag 21,664 ( 4.5%)
48
+ plane 14,704 ( 3.0%)
49
+
50
+ ======================================================================
51
+ Processing: Flux.1
52
+ Model: black-forest-labs/FLUX.1-dev
53
+ ======================================================================
54
+ Latent: (16, 128, 128) mean=-0.317 std=1.997 [-10.000, 7.500]
55
+ Groups: [[0, 6], [3, 2], [4, 14], [8, 5], [11, 13], [10, 7], [9, 15], [12, 1]]
56
+ Scales: [(16, 64, 64), (8, 32, 32), (8, 16, 16)]
57
+ Flux.1 extract: 100%
58
+  30/30 [00:44<00:00,  1.29s/Γ—32]
59
+ Flux.1: 1,456,910 annotations, conf=0.891
60
+ plane 527,329 ( 36.2%)
61
+ pentachoron 387,406 ( 26.6%)
62
+ square_xy 205,738 ( 14.1%)
63
+ saddle 190,141 ( 13.1%)
64
+ triangular_prism 90,982 ( 6.2%)
65
+
66
+ ======================================================================
67
+ Processing: Flux.2
68
+ Model: black-forest-labs/FLUX.2-dev
69
+ ======================================================================
70
+ Some weights of the model checkpoint at black-forest-labs/FLUX.2-dev were not used when initializing AutoencoderKL:
71
+ ['bn.num_batches_tracked, bn.running_mean, bn.running_var']
72
+ Latent: (32, 128, 128) mean=0.023 std=1.719 [-9.312, 9.812]
73
+ Groups: [[0, 20, 4, 26], [29, 15, 19, 2], [27, 3, 8, 9], [13, 14, 30, 24], [6, 22, 11, 18], [28, 12, 7, 1], [23, 17, 5, 10], [16, 31, 25, 21]]
74
+ Scales: [(16, 64, 64), (8, 32, 32), (8, 16, 16)]
75
+ Flux.2 extract: 100%
76
+  30/30 [00:58<00:00,  2.06s/Γ—32]
77
+ Flux.2: 1,932,126 annotations, conf=0.871
78
+ saddle 1,295,564 ( 67.1%)
79
+ pentachoron 453,701 ( 23.5%)
80
+ tetrahedron 71,594 ( 3.7%)
81
+ line_diag 49,691 ( 2.6%)
82
+ triangular_prism 38,569 ( 2.0%)
83
+
84
+ ======================================================================
85
+ COMPARATIVE ANALYSIS: VAE Geometric Structures
86
+ ======================================================================
87
+
88
+ ──────────────────────────────────────────────────────────────────────
89
+ VAE Latent Ann Ann/img Conf Classes
90
+ ──────────────────────────────────────────────────────────────────────
91
+ SD 1.5 4Γ—64Γ—64 105,895 111 0.880 18/38
92
+ SDXL 4Γ—128Γ—128 483,789 506 0.872 16/38
93
+ Flux.1 16Γ—128Γ—128 1,456,910 1522 0.891 24/38
94
+ Flux.2 32Γ—128Γ—128 1,932,126 2019 0.871 18/38
95
+
96
+ ──────────────────────────────────────────────────────────────────────
97
+ TOP-5 CLASSES PER VAE
98
+ ──────────────────────��───────────────────────────────────────────────
99
+ SD 1.5 saddle:56% pentachoron:35% triangular_prism:3% line_diag:2% cone:2%
100
+ SDXL saddle:54% pentachoron:28% triangular_prism:6% line_diag:4% plane:3%
101
+ Flux.1 plane:36% pentachoron:27% square_xy:14% saddle:13% triangular_prism:6%
102
+ Flux.2 saddle:67% pentachoron:23% tetrahedron:4% line_diag:3% triangular_prism:2%
103
+
104
+ ──────────────────────────────────────────────────────────────────────
105
+ CLASS PRESENCE ACROSS VAEs (>0.5% of annotations)
106
+ ──────────────────────────────────────────────────────────────────────
107
+ Class SD 1.5 SDXL Flux.1 Flux.2
108
+ saddle 56.1% 54.4% 13.1% 67.1%
109
+ pentachoron 35.2% 28.4% 26.6% 23.5%
110
+ plane trace 3.0% 36.2% 0.8%
111
+ square_xy trace 1.5% 14.1% trace
112
+ triangular_prism 2.8% 5.5% 6.2% 2.0%
113
+ tetrahedron 1.1% 1.7% 1.6% 3.7%
114
+ line_diag 1.8% 4.5% trace 2.6%
115
+ tube 0.6% trace 1.2% trace
116
+ cone 1.6% 0.7% trace trace
117
+
118
+ ──────────────────────────────────────────────────────────────────────
119
+ GEOMETRIC FINGERPRINT SIMILARITY (cosine between class distributions)
120
+ ──────────────────────────────────────────────────────────────────────
121
+ SD 1.5 SDXL Flux.1 Flux.2
122
+ SD 1.5 1.000 0.994 0.521 0.974
123
+ SDXL 0.994 1.000 0.536 0.986
124
+ Flux.1 0.521 0.536 1.000 0.442
125
+ Flux.2 0.974 0.986 0.442 1.000
126
+
127
+ ──────────────────────────────────────────────────────────────────────
128
+ DIMENSIONAL DISTRIBUTION
129
+ ──────────────────────────────────────────────────────────────────────
130
+ VAE 0D 1D 2D 3D Curved
131
+ SD 1.5 0.0% 0.8% 0.5% 98.7% 42.3%
132
+ SDXL 0.0% 1.8% 2.5% 95.7% 42.0%
133
+ Flux.1 0.0% 0.8% 45.7% 53.5% 44.7%
134
+ Flux.2 0.0% 0.8% 1.0% 98.3% 63.9%
135
+
136
+ ──────────────────────────────────────────────────────────────────────
137
+ CHANNEL GROUPS
138
+ ──────────────────────────────────────────────────────────────────────
139
+ SD 1.5 (4ch): [[0], [2], [3], [1]]
140
+ SDXL (4ch): [[0], [3], [2], [1]]
141
+ Flux.1 (16ch): [[0, 6], [3, 2], [4, 14], [8, 5], [11, 13], [10, 7], [9, 15], [12, 1]]
142
+ Flux.2 (32ch): [[0, 20, 4, 26], [29, 15, 19, 2], [27, 3, 8, 9], [13, 14, 30, 24], [6, 22, 11, 18], [28, 12, 7, 1], [23, 17, 5, 10], [16, 31, 25, 21]]
143
+
144
+ ──────────────────────────────────────────────────────────────────────
145
+ PER-IMAGE CROSS-VAE CONSISTENCY
146
+ ──────────────────────────────────────────────────────────────────────
147
+ Do images that are geometrically distinct in one VAE stay distinct in another?
148
+ SD 1.5 ↔ SDXL : mean=0.940 std=0.084 [0.550, 1.000]
149
+ β†’ Same geometric structure
150
+ SD 1.5 ↔ Flux.1 : mean=0.504 std=0.241 [0.007, 0.977]
151
+ β†’ Different structures
152
+ SD 1.5 ↔ Flux.2 : mean=0.943 std=0.081 [0.583, 1.000]
153
+ β†’ Same geometric structure
154
+ SDXL ↔ Flux.1 : mean=0.527 std=0.241 [0.012, 0.979]
155
+ β†’ Different structures
156
+ SDXL ↔ Flux.2 : mean=0.956 std=0.076 [0.489, 1.000]
157
+ β†’ Same geometric structure
158
+ Flux.1 ↔ Flux.2 : mean=0.461 std=0.251 [0.012, 0.973]
159
+ β†’ Different structures
160
+
161
+ Saved to /content/multi_vae_comparison.json
162
+ ======================================================================
163
+ βœ“ Multi-VAE comparison complete!
164
+ ======================================================================