Create quad_vae_liminal_results.txt
Browse files- quad_vae_liminal_results.txt +164 -0
quad_vae_liminal_results.txt
ADDED
|
@@ -0,0 +1,164 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
======================================================================
|
| 2 |
+
Multi-VAE Geometric Comparison Pipeline
|
| 3 |
+
======================================================================
|
| 4 |
+
Downloading liminal.zip from AbstractPhil/grid-geometric-classifier-sliding-proto...
|
| 5 |
+
Found 957 images
|
| 6 |
+
|
| 7 |
+
======================================================================
|
| 8 |
+
Step 1: Classifier
|
| 9 |
+
======================================================================
|
| 10 |
+
Loaded 638,387 params
|
| 11 |
+
|
| 12 |
+
======================================================================
|
| 13 |
+
Processing: SD 1.5
|
| 14 |
+
Model: stable-diffusion-v1-5/stable-diffusion-v1-5
|
| 15 |
+
======================================================================
|
| 16 |
+
Latent: (4, 64, 64) mean=0.014 std=3.719 [-33.188, 40.219]
|
| 17 |
+
Groups: [[0], [2], [3], [1]]
|
| 18 |
+
Scales: [(4, 64, 64), (4, 32, 32), (4, 16, 16)]
|
| 19 |
+
SDβ1.5βextract:β100%
|
| 20 |
+
β30/30β[00:07<00:00,ββ5.30Γ32/s]
|
| 21 |
+
SD 1.5: 105,895 annotations, conf=0.880
|
| 22 |
+
saddle 59,460 ( 56.1%)
|
| 23 |
+
pentachoron 37,281 ( 35.2%)
|
| 24 |
+
triangular_prism 3,009 ( 2.8%)
|
| 25 |
+
line_diag 1,913 ( 1.8%)
|
| 26 |
+
cone 1,689 ( 1.6%)
|
| 27 |
+
|
| 28 |
+
======================================================================
|
| 29 |
+
Processing: SDXL
|
| 30 |
+
Model: madebyollin/sdxl-vae-fp16-fix
|
| 31 |
+
======================================================================
|
| 32 |
+
config.json:β100%
|
| 33 |
+
β631/631β[00:00<00:00,β142kB/s]
|
| 34 |
+
diffusion_pytorch_model.safetensors:β100%
|
| 35 |
+
β335M/335Mβ[00:01<00:00,β329MB/s]
|
| 36 |
+
Encoding:β100%
|
| 37 |
+
β240/240β[01:12<00:00,ββ1.65batch/s]
|
| 38 |
+
Latent: (4, 128, 128) mean=0.605 std=5.059 [-18.938, 21.297]
|
| 39 |
+
Groups: [[0], [3], [2], [1]]
|
| 40 |
+
Scales: [(4, 64, 64), (4, 32, 32), (4, 16, 16)]
|
| 41 |
+
SDXLβextract:β100%
|
| 42 |
+
β30/30β[00:17<00:00,ββ2.13Γ32/s]
|
| 43 |
+
SDXL: 483,789 annotations, conf=0.872
|
| 44 |
+
saddle 263,065 ( 54.4%)
|
| 45 |
+
pentachoron 137,450 ( 28.4%)
|
| 46 |
+
triangular_prism 26,717 ( 5.5%)
|
| 47 |
+
line_diag 21,664 ( 4.5%)
|
| 48 |
+
plane 14,704 ( 3.0%)
|
| 49 |
+
|
| 50 |
+
======================================================================
|
| 51 |
+
Processing: Flux.1
|
| 52 |
+
Model: black-forest-labs/FLUX.1-dev
|
| 53 |
+
======================================================================
|
| 54 |
+
Latent: (16, 128, 128) mean=-0.317 std=1.997 [-10.000, 7.500]
|
| 55 |
+
Groups: [[0, 6], [3, 2], [4, 14], [8, 5], [11, 13], [10, 7], [9, 15], [12, 1]]
|
| 56 |
+
Scales: [(16, 64, 64), (8, 32, 32), (8, 16, 16)]
|
| 57 |
+
Flux.1βextract:β100%
|
| 58 |
+
β30/30β[00:44<00:00,ββ1.29s/Γ32]
|
| 59 |
+
Flux.1: 1,456,910 annotations, conf=0.891
|
| 60 |
+
plane 527,329 ( 36.2%)
|
| 61 |
+
pentachoron 387,406 ( 26.6%)
|
| 62 |
+
square_xy 205,738 ( 14.1%)
|
| 63 |
+
saddle 190,141 ( 13.1%)
|
| 64 |
+
triangular_prism 90,982 ( 6.2%)
|
| 65 |
+
|
| 66 |
+
======================================================================
|
| 67 |
+
Processing: Flux.2
|
| 68 |
+
Model: black-forest-labs/FLUX.2-dev
|
| 69 |
+
======================================================================
|
| 70 |
+
Some weights of the model checkpoint at black-forest-labs/FLUX.2-dev were not used when initializing AutoencoderKL:
|
| 71 |
+
['bn.num_batches_tracked, bn.running_mean, bn.running_var']
|
| 72 |
+
Latent: (32, 128, 128) mean=0.023 std=1.719 [-9.312, 9.812]
|
| 73 |
+
Groups: [[0, 20, 4, 26], [29, 15, 19, 2], [27, 3, 8, 9], [13, 14, 30, 24], [6, 22, 11, 18], [28, 12, 7, 1], [23, 17, 5, 10], [16, 31, 25, 21]]
|
| 74 |
+
Scales: [(16, 64, 64), (8, 32, 32), (8, 16, 16)]
|
| 75 |
+
Flux.2βextract:β100%
|
| 76 |
+
β30/30β[00:58<00:00,ββ2.06s/Γ32]
|
| 77 |
+
Flux.2: 1,932,126 annotations, conf=0.871
|
| 78 |
+
saddle 1,295,564 ( 67.1%)
|
| 79 |
+
pentachoron 453,701 ( 23.5%)
|
| 80 |
+
tetrahedron 71,594 ( 3.7%)
|
| 81 |
+
line_diag 49,691 ( 2.6%)
|
| 82 |
+
triangular_prism 38,569 ( 2.0%)
|
| 83 |
+
|
| 84 |
+
======================================================================
|
| 85 |
+
COMPARATIVE ANALYSIS: VAE Geometric Structures
|
| 86 |
+
======================================================================
|
| 87 |
+
|
| 88 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 89 |
+
VAE Latent Ann Ann/img Conf Classes
|
| 90 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 91 |
+
SD 1.5 4Γ64Γ64 105,895 111 0.880 18/38
|
| 92 |
+
SDXL 4Γ128Γ128 483,789 506 0.872 16/38
|
| 93 |
+
Flux.1 16Γ128Γ128 1,456,910 1522 0.891 24/38
|
| 94 |
+
Flux.2 32Γ128Γ128 1,932,126 2019 0.871 18/38
|
| 95 |
+
|
| 96 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 97 |
+
TOP-5 CLASSES PER VAE
|
| 98 |
+
ββββββββββββββββββββββοΏ½οΏ½βββββββββββββββββββββββββββββββββββββββββββββββ
|
| 99 |
+
SD 1.5 saddle:56% pentachoron:35% triangular_prism:3% line_diag:2% cone:2%
|
| 100 |
+
SDXL saddle:54% pentachoron:28% triangular_prism:6% line_diag:4% plane:3%
|
| 101 |
+
Flux.1 plane:36% pentachoron:27% square_xy:14% saddle:13% triangular_prism:6%
|
| 102 |
+
Flux.2 saddle:67% pentachoron:23% tetrahedron:4% line_diag:3% triangular_prism:2%
|
| 103 |
+
|
| 104 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 105 |
+
CLASS PRESENCE ACROSS VAEs (>0.5% of annotations)
|
| 106 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 107 |
+
Class SD 1.5 SDXL Flux.1 Flux.2
|
| 108 |
+
saddle 56.1% 54.4% 13.1% 67.1%
|
| 109 |
+
pentachoron 35.2% 28.4% 26.6% 23.5%
|
| 110 |
+
plane trace 3.0% 36.2% 0.8%
|
| 111 |
+
square_xy trace 1.5% 14.1% trace
|
| 112 |
+
triangular_prism 2.8% 5.5% 6.2% 2.0%
|
| 113 |
+
tetrahedron 1.1% 1.7% 1.6% 3.7%
|
| 114 |
+
line_diag 1.8% 4.5% trace 2.6%
|
| 115 |
+
tube 0.6% trace 1.2% trace
|
| 116 |
+
cone 1.6% 0.7% trace trace
|
| 117 |
+
|
| 118 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 119 |
+
GEOMETRIC FINGERPRINT SIMILARITY (cosine between class distributions)
|
| 120 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 121 |
+
SD 1.5 SDXL Flux.1 Flux.2
|
| 122 |
+
SD 1.5 1.000 0.994 0.521 0.974
|
| 123 |
+
SDXL 0.994 1.000 0.536 0.986
|
| 124 |
+
Flux.1 0.521 0.536 1.000 0.442
|
| 125 |
+
Flux.2 0.974 0.986 0.442 1.000
|
| 126 |
+
|
| 127 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 128 |
+
DIMENSIONAL DISTRIBUTION
|
| 129 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 130 |
+
VAE 0D 1D 2D 3D Curved
|
| 131 |
+
SD 1.5 0.0% 0.8% 0.5% 98.7% 42.3%
|
| 132 |
+
SDXL 0.0% 1.8% 2.5% 95.7% 42.0%
|
| 133 |
+
Flux.1 0.0% 0.8% 45.7% 53.5% 44.7%
|
| 134 |
+
Flux.2 0.0% 0.8% 1.0% 98.3% 63.9%
|
| 135 |
+
|
| 136 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 137 |
+
CHANNEL GROUPS
|
| 138 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 139 |
+
SD 1.5 (4ch): [[0], [2], [3], [1]]
|
| 140 |
+
SDXL (4ch): [[0], [3], [2], [1]]
|
| 141 |
+
Flux.1 (16ch): [[0, 6], [3, 2], [4, 14], [8, 5], [11, 13], [10, 7], [9, 15], [12, 1]]
|
| 142 |
+
Flux.2 (32ch): [[0, 20, 4, 26], [29, 15, 19, 2], [27, 3, 8, 9], [13, 14, 30, 24], [6, 22, 11, 18], [28, 12, 7, 1], [23, 17, 5, 10], [16, 31, 25, 21]]
|
| 143 |
+
|
| 144 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 145 |
+
PER-IMAGE CROSS-VAE CONSISTENCY
|
| 146 |
+
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
|
| 147 |
+
Do images that are geometrically distinct in one VAE stay distinct in another?
|
| 148 |
+
SD 1.5 β SDXL : mean=0.940 std=0.084 [0.550, 1.000]
|
| 149 |
+
β Same geometric structure
|
| 150 |
+
SD 1.5 β Flux.1 : mean=0.504 std=0.241 [0.007, 0.977]
|
| 151 |
+
β Different structures
|
| 152 |
+
SD 1.5 β Flux.2 : mean=0.943 std=0.081 [0.583, 1.000]
|
| 153 |
+
β Same geometric structure
|
| 154 |
+
SDXL β Flux.1 : mean=0.527 std=0.241 [0.012, 0.979]
|
| 155 |
+
β Different structures
|
| 156 |
+
SDXL β Flux.2 : mean=0.956 std=0.076 [0.489, 1.000]
|
| 157 |
+
β Same geometric structure
|
| 158 |
+
Flux.1 β Flux.2 : mean=0.461 std=0.251 [0.012, 0.973]
|
| 159 |
+
β Different structures
|
| 160 |
+
|
| 161 |
+
Saved to /content/multi_vae_comparison.json
|
| 162 |
+
======================================================================
|
| 163 |
+
β Multi-VAE comparison complete!
|
| 164 |
+
======================================================================
|