File size: 5,699 Bytes
65038b2 66a1c55 65038b2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 |
---
license: mit
---
This version essentially imploded when learning cutmix cifar100, and yet the geometry held the entire time through a cutmix gauntlet that could not be overcome.
I plan to run a few experiments on it with the geometric and simplex blocks frozen. As it stands, the geometric side is essentially a scaffold of cutmix potential,
however this scaffold is limited. It must be further trained, but this is a good basline test.
I will run 4 full trains to test the potential;
1. Frozen geometry, no augmentation. It's working, but overfitted by epoch 30 - which, took nearly 60 before. So that's interesting and yet overfitted.
2. Full augmentation no cutmix/mixup using cifar10 augs.
3. Unfrozen cross-attention only, full augs.
4. Unfrozen cross-attention only, no augs.
```
======================================================================
CHAOS-NATIVE DUAL-STREAM PROBE
======================================================================
Loading model...
Model loaded from: ./checkpoints_dualstream/20251008_163456_chaos_native/model_epoch_149.safetensors
Parameters: 41,132,926
Loading data...
Test samples: 10000
Probe samples: 5000
======================================================================
PROBE 1: Dual-Stream Analysis
======================================================================
[1.1] Extracting stream features...
Extracting: 0%| | 0/40 [00:00<?, ?it/s]
Model output keys: ['logits', 'visual_features', 'geometric_features', 'pe_features', 'cantor_measure', 'visual_stream', 'geometric_stream']
logits: torch.Size([128, 100])
visual_features: torch.Size([128, 512])
geometric_features: torch.Size([128, 256])
pe_features: torch.Size([65, 24])
cantor_measure: torch.Size([65])
visual_stream: torch.Size([128, 65, 512])
geometric_stream: torch.Size([128, 8, 256])
Extracting: 100%|ββββββββββ| 40/40 [00:02<00:00, 18.99it/s]
Visual features shape: (5000, 100)
Geometric features shape: (5000, 256)
[1.2] Analyzing stream dimensionality...
Visual stream: 1/100 dims for 90% variance
Geometric stream: 1/256 dims for 90% variance
Visual intrinsic dim: 1.04
Geometric intrinsic dim: 1.01
[1.3] Measuring stream independence...
Mean abs correlation: 0.2077
Max abs correlation: 0.9985
Interpretation: Independent
[1.4] Testing class separability per stream...
Visual stream probe: 0.0227
Geometric stream probe: 0.0227
Visual advantage: +0.0000
======================================================================
PROBE 2: Geometric Stream Health
======================================================================
[2.1] Collecting geometric health metrics...
Health metrics: 100%|ββββββββββ| 40/40 [00:02<00:00, 18.90it/s]
Token Diversity: 0.2820 Β± 0.0025
Feature Diversity: 1.3960 Β± 0.0029
Mean Norm: 22.2937 Β± 0.0464
Status: β Healthy
======================================================================
PROBE 3: Chaos Tolerance (CutMix Robustness)
======================================================================
[3.0] Testing alpha=0.0...
Accuracy: 0.0221
Geometric mean norm: 22.2951
[3.2] Testing alpha=0.2...
Accuracy: 0.0173
Geometric mean norm: 22.3495
[3.5] Testing alpha=0.5...
Accuracy: 0.0165
Geometric mean norm: 22.3517
[3.10] Testing alpha=1.0...
Accuracy: 0.0174
Geometric mean norm: 22.3580
[3.15] Testing alpha=1.5...
Accuracy: 0.0161
Geometric mean norm: 22.3592
[3.20] Testing alpha=2.0...
Accuracy: 0.0164
Geometric mean norm: 22.3624
Clean accuracy (Ξ±=0.0): 0.0221
Extreme chaos (Ξ±=2.0): 0.0164
Chaos tolerance: 74.21%
Interpretation: Good
======================================================================
PROBE 4: Clean vs Mixed Image Analysis
======================================================================
[4.1] Testing on clean images...
Accuracy: 0.0221
Mean confidence: 0.0120
[4.2] Testing on mixed images (Ξ±=1.0)...
Accuracy: 0.0160
Mean confidence: 0.0106
Performance gap: 0.0061
Confidence gap: 0.0014
Adaptation: Strong
======================================================================
PROBE 5: Geometric Stream Stability
======================================================================
[5.1] Testing Gaussian noise robustness...
Noise Ο=0.0: norm=22.2937
Noise Ο=0.1: norm=22.2945
Noise Ο=0.2: norm=22.2965
Noise Ο=0.3: norm=22.2948
[5.2] Testing CutMix stability...
Norm range across CutMix: 0.0704
Status: β Stable
======================================================================
Results saved to: probe_chaos_results/chaos_probe_results.json
======================================================================
======================================================================
Generating visualizations...
======================================================================
[Viz 1] Chaos tolerance curve...
[Viz 2] Stream correlation heatmap...
[Viz 3] Geometric stability...
Visualizations saved to: probe_chaos_results
======================================================================
CHAOS-NATIVE PROBE SUMMARY
======================================================================
[Dual-Stream Analysis]
Visual intrinsic dim: 1.04
Geometric intrinsic dim: 1.01
Stream independence: 0.2077
[Geometric Health]
Mean norm: 22.2937
Token diversity: 0.2820
[Chaos Tolerance]
Clean accuracy: 0.0221
Max chaos accuracy: 0.0164
Tolerance score: 74.21%
[Clean vs Mixed]
Clean accuracy: 0.0221
Mixed accuracy: 0.0160
Performance gap: 0.0061
[Geometric Stability]
Noise stable: True
CutMix stable: True
``` |