AbstractPhil commited on
Commit
c952a70
Β·
verified Β·
1 Parent(s): 5cbc675

Epoch 50: 55.79%

Browse files
Files changed (1) hide show
  1. README.md +14 -157
README.md CHANGED
@@ -5,7 +5,7 @@ tags:
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
- - trained
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
@@ -15,7 +15,7 @@ metrics:
15
  library_name: pytorch
16
  pipeline_tag: image-classification
17
  model-index:
18
- - name: geo-beatrix-step4-feats100
19
  results:
20
  - task:
21
  type: image-classification
@@ -30,13 +30,13 @@ model-index:
30
  verified: false
31
  ---
32
 
33
- # geo-beatrix-step4-feats100
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
- πŸŽ‰ **Training Complete** πŸŽ‰
38
 
39
- Final Status: Epoch 200/200
40
 
41
  ---
42
 
@@ -46,26 +46,9 @@ Final Status: Epoch 200/200
46
  |--------|-------|
47
  | **Best Test Accuracy** | **69.08%** |
48
  | **Best Epoch** | 190 |
49
- | **Current Train Accuracy** | 68.80% |
50
- | **Current Test Accuracy** | 60.81% |
51
- | **Current Ξ± (Cantor param)** | 0.4320 |
52
- | **Total Parameters** | 45,356,337 |
53
- | **Training Time** | 0:48:43 |
54
-
55
- ### All Training Runs
56
-
57
- | Timestamp | Status | Best Epoch | Test Acc | Train Acc | Ξ± |
58
- |-----------|--------|------------|----------|-----------|---|
59
- | `20251010_154433` | βœ… | 190 | **69.08%** | 68.80% | 0.4320 |
60
-
61
- ### Comparison to State-of-the-Art
62
-
63
- | Model | Accuracy | Status |
64
- |-------|----------|--------|
65
- | **geo-beatrix (this model)** | **69.08%** | βœ… Complete |
66
- | vit-beatrix-dualstream | 66.0% | Vision Transformer + Cross-Entropy |
67
-
68
- βœ… **geo-beatrix has surpassed all baselines!**
69
 
70
  ---
71
 
@@ -74,137 +57,11 @@ Final Status: Epoch 200/200
74
  - **Base**: ResNet-style with residual blocks
75
  - **Channels**: 64 β†’ 128 β†’ 256 β†’ 512 β†’ 1024
76
  - **Positional Encoding**: Devil's Staircase (Cantor function, 1883)
77
- - **PE Levels**: 4
78
- - **PE Features/Level**: 100
79
- - **Classification**: Geometric Basin Compatibility (NO cross-entropy)
80
- - **Attention Mechanisms**: NONE
81
-
82
- ---
83
-
84
- ## Training Configuration
85
-
86
- ```json
87
- {
88
- "model_name": "geo-beatrix-step4-feats100",
89
- "model_type": "geometric_basin_classifier",
90
- "num_classes": 100,
91
- "batch_size": 256,
92
- "num_epochs": 200,
93
- "base_learning_rate": 0.001,
94
- "weight_decay": 0.05,
95
- "warmup_epochs": 10,
96
- "pe_levels": 4,
97
- "pe_features_per_level": 100,
98
- "dropout": 0.1,
99
- "upload_every_n_epochs": 50,
100
- "alphamix": {
101
- "enabled": true,
102
- "range": [
103
- 0.3,
104
- 0.7
105
- ],
106
- "spatial_ratio": 0.25,
107
- "curriculum_start": 0.05,
108
- "curriculum_end": 0.25
109
- },
110
- "architecture": "ResNet-style with Devil's Staircase PE",
111
- "loss_function": "Geometric Basin Compatibility",
112
- "cross_entropy": false,
113
- "attention_mechanisms": false,
114
- "timestamp": "20251010_154433"
115
- }
116
- ```
117
-
118
- ---
119
-
120
- ## Files Structure
121
-
122
- ```
123
- β”œβ”€β”€ model.pt (BEST overall model - easy access!)
124
- β”œβ”€β”€ model.safetensors (BEST overall model - easy access!)
125
- β”œβ”€β”€ best_model_info.json (which epoch/run this came from)
126
- β”œβ”€β”€ runs_history.json (all training runs and their results)
127
- β”œβ”€β”€ README.md
128
- β”œβ”€β”€ weights/geo-beatrix-step4-feats100/20251010_154433/
129
- β”‚ β”œβ”€β”€ model.pt (best from this training run)
130
- β”‚ β”œβ”€β”€ model.safetensors (best from this training run)
131
- β”‚ β”œβ”€β”€ config.json
132
- β”‚ β”œβ”€β”€ training_log.txt
133
- β”‚ └── checkpoints/
134
- β”‚ β”œβ”€β”€ checkpoint_epoch_10.safetensors
135
- β”‚ β”œβ”€β”€ checkpoint_epoch_20.safetensors
136
- β”‚ └── checkpoint_epoch_30.safetensors
137
- β”‚ (snapshots every 50 epochs)
138
- └── runs/geo-beatrix-step4-feats100/20251010_154433/
139
- β”œβ”€β”€ events.out.tfevents.* (TensorBoard logs)
140
- └── metrics.csv (training metrics)
141
- ```
142
-
143
- **Note**: The root `model.pt` and `model.safetensors` always contain the best model across all training runs!
144
-
145
- ---
146
-
147
- ## Usage
148
-
149
- ```python
150
- from huggingface_hub import hf_hub_download
151
- import torch
152
-
153
- # EASIEST: Download BEST overall model from root (recommended!)
154
- from safetensors.torch import load_file
155
- model_path = hf_hub_download(
156
- repo_id="AbstractPhil/geo-beatrix",
157
- filename="model.safetensors"
158
- )
159
- state_dict = load_file(model_path)
160
- # model.load_state_dict(state_dict)
161
-
162
- # Check which epoch/run the best model came from
163
- info_path = hf_hub_download(
164
- repo_id="AbstractPhil/geo-beatrix",
165
- filename="best_model_info.json"
166
- )
167
- with open(info_path) as f:
168
- best_info = json.load(f)
169
- print(f"Best model: epoch {best_info['epoch']}, {best_info['test_accuracy']:.2f}%")
170
-
171
- # Or download from specific training run
172
- model_path = hf_hub_download(
173
- repo_id="AbstractPhil/geo-beatrix",
174
- filename="weights/geo-beatrix-step4-feats100/20251010_154433/model.safetensors"
175
- )
176
-
177
- # Download specific epoch checkpoint
178
- epoch_checkpoint = hf_hub_download(
179
- repo_id="AbstractPhil/geo-beatrix",
180
- filename="weights/geo-beatrix-step4-feats100/20251010_154433/checkpoints/checkpoint_epoch_100.safetensors"
181
- )
182
- ```
183
-
184
- ---
185
-
186
- ## Training History
187
-
188
- ### Best Checkpoint
189
- - Epoch: 190
190
- - Train Acc: 64.35%
191
- - Test Acc: 60.00%
192
- - Alpha: 0.4318
193
- - Loss: 0.9990
194
-
195
- ### Latest 5 Epochs
196
-
197
- - **Epoch 196**: Train 64.06%, Test 0.00%, Ξ±=0.4320, Loss=0.9867
198
- - **Epoch 197**: Train 67.90%, Test 0.00%, Ξ±=0.4320, Loss=1.0384
199
- - **Epoch 198**: Train 64.27%, Test 0.00%, Ξ±=0.4320, Loss=0.9999
200
- - **Epoch 199**: Train 65.64%, Test 0.00%, Ξ±=0.4320, Loss=1.0067
201
- - **Epoch 200**: Train 68.80%, Test 60.81%, Ξ±=0.4320, Loss=1.0461
202
-
203
- ### Training Milestones
204
- - 🎯 **50% Accuracy** reached at epoch 35
205
- - 🎯 **60% Accuracy** reached at epoch 135
206
- - πŸ“Š **Ξ± β‰₯ 0.40** reached at epoch 8
207
- - πŸ“Š **Ξ± β‰₯ 0.44** (near triadic equilibrium) at epoch 11
208
 
209
  ---
210
 
@@ -214,7 +71,7 @@ epoch_checkpoint = hf_hub_download(
214
  βœ… **NO cross-entropy loss**
215
  βœ… **Fractal positional encoding** (Cantor function from 1883)
216
  βœ… **Geometric compatibility classification**
217
- βœ… **Ancient convolutions** (ResNet-style from 1990s)
218
 
219
  ---
220
 
 
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
+ - in-training
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
 
15
  library_name: pytorch
16
  pipeline_tag: image-classification
17
  model-index:
18
+ - name: geo-beatrix-fractal
19
  results:
20
  - task:
21
  type: image-classification
 
30
  verified: false
31
  ---
32
 
33
+ # geo-beatrix-fractal
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
+ 🚧 **Training in Progress** 🚧
38
 
39
+ Current Status: Epoch 50/200
40
 
41
  ---
42
 
 
46
  |--------|-------|
47
  | **Best Test Accuracy** | **69.08%** |
48
  | **Best Epoch** | 190 |
49
+ | **Current Ξ± (Cantor param)** | 0.4165 |
50
+ | **Total Parameters** | 45,161,489 |
51
+ | **Mixing Mode** | Fractal (triadic) |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
 
53
  ---
54
 
 
57
  - **Base**: ResNet-style with residual blocks
58
  - **Channels**: 64 β†’ 128 β†’ 256 β†’ 512 β†’ 1024
59
  - **Positional Encoding**: Devil's Staircase (Cantor function, 1883)
60
+ - **PE Levels**: 20
61
+ - **PE Features/Level**: 4
62
+ - **Classification**: Geometric Basin Compatibility
63
+ - **Attention**: NONE
64
+ - **Cross-Entropy**: NONE
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
65
 
66
  ---
67
 
 
71
  βœ… **NO cross-entropy loss**
72
  βœ… **Fractal positional encoding** (Cantor function from 1883)
73
  βœ… **Geometric compatibility classification**
74
+ βœ… **Triadic fractal mixing** (base-3 aligned)
75
 
76
  ---
77