AbstractPhil commited on
Commit
9dd3adb
Β·
verified Β·
1 Parent(s): 16c09b5

Epoch 50: 39.68%

Browse files
Files changed (1) hide show
  1. README.md +28 -28
README.md CHANGED
@@ -5,7 +5,7 @@ tags:
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
- - trained
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
@@ -34,9 +34,9 @@ model-index:
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
- πŸŽ‰ **Training Complete** πŸŽ‰
38
 
39
- Final Status: Epoch 200/200
40
 
41
  ---
42
 
@@ -46,11 +46,11 @@ Final Status: Epoch 200/200
46
  |--------|-------|
47
  | **Best Test Accuracy** | **56.12%** |
48
  | **Best Epoch** | 160 |
49
- | **Current Train Accuracy** | 53.80% |
50
- | **Current Test Accuracy** | 39.51% |
51
- | **Current Ξ± (Cantor param)** | 0.4377 |
52
  | **Total Parameters** | 28,561,101 |
53
- | **Training Time** | 0:21:41 |
54
 
55
  ### All Training Runs
56
 
@@ -66,6 +66,7 @@ Final Status: Epoch 200/200
66
  | `20251011_023128` | βœ… | 160 | **56.12%** | 54.65% | 0.5005 |
67
  | `20251011_025919` | βœ… | 160 | **56.12%** | 57.80% | 0.4994 |
68
  | `20251011_032343` | βœ… | 160 | **56.12%** | 53.80% | 0.4377 |
 
69
  | `20251010_200842` | βœ… | 180 | **53.61%** | 67.53% | 0.4442 |
70
  | `20251010_185133` | βœ… | 200 | **52.97%** | 69.87% | 0.4452 |
71
 
@@ -73,7 +74,7 @@ Final Status: Epoch 200/200
73
 
74
  | Model | Accuracy | Status |
75
  |-------|----------|--------|
76
- | **geo-beatrix (this model)** | **56.12%** | βœ… Complete |
77
  | geo-beatrix (50M params) | 69.0% | Geometric Basin CONV architecture |
78
 
79
  🎯 **Current target**: Beat geo-beatrix (69.0%) - Currently -12.88%
@@ -90,7 +91,7 @@ Final Status: Epoch 200/200
90
  - **PE Features/Level**: 1000
91
  - **Classification**: Geometric Basin Compatibility (NO cross-entropy)
92
  - **Attention Mechanisms**: NONE
93
- - **Mixing**: Fractal (triadic multi-patch)
94
 
95
  ---
96
 
@@ -119,12 +120,12 @@ Final Status: Epoch 200/200
119
  },
120
  "alphamix": {
121
  "enabled": true,
122
- "fractal_mode": true,
123
  "range": [
124
  0.3,
125
  0.7
126
  ],
127
- "spatial_ratio": 0.25,
128
  "curriculum_start": 0.0,
129
  "curriculum_end": 0.5,
130
  "fractal_steps": [
@@ -141,7 +142,7 @@ Final Status: Epoch 200/200
141
  "loss_function": "Geometric Basin Compatibility",
142
  "cross_entropy": false,
143
  "attention_mechanisms": false,
144
- "timestamp": "20251011_032343"
145
  }
146
  ```
147
 
@@ -155,7 +156,7 @@ Final Status: Epoch 200/200
155
  β”œβ”€β”€ best_model_info.json (which epoch/run this came from)
156
  β”œβ”€β”€ runs_history.json (all training runs and their results)
157
  β”œβ”€β”€ README.md
158
- β”œβ”€β”€ weights/geo-beatrix-resnet34-step20-feats1000/20251011_032343/
159
  β”‚ β”œβ”€β”€ model.pt (best from this training run)
160
  β”‚ β”œβ”€β”€ model.safetensors (best from this training run)
161
  β”‚ β”œβ”€β”€ config.json
@@ -165,7 +166,7 @@ Final Status: Epoch 200/200
165
  β”‚ β”œβ”€β”€ checkpoint_epoch_100.safetensors
166
  β”‚ └── checkpoint_epoch_150.safetensors
167
  β”‚ (snapshots every 10 epochs)
168
- └── runs/geo-beatrix-resnet34-step20-feats1000/20251011_032343/
169
  β”œβ”€β”€ events.out.tfevents.* (TensorBoard logs)
170
  └── metrics.csv (training metrics)
171
  ```
@@ -201,13 +202,13 @@ with open(info_path) as f:
201
  # Or download from specific training run
202
  model_path = hf_hub_download(
203
  repo_id="AbstractPhil/geo-beatrix-resnet",
204
- filename="weights/geo-beatrix-resnet34-step20-feats1000/20251011_032343/model.safetensors"
205
  )
206
 
207
  # Download specific epoch checkpoint
208
  epoch_checkpoint = hf_hub_download(
209
  repo_id="AbstractPhil/geo-beatrix-resnet",
210
- filename="weights/geo-beatrix-resnet34-step20-feats1000/20251011_032343/checkpoints/checkpoint_epoch_100.safetensors"
211
  )
212
  ```
213
 
@@ -217,22 +218,21 @@ epoch_checkpoint = hf_hub_download(
217
 
218
  ### Best Checkpoint
219
  - Epoch: 160
220
- - Train Acc: 58.15%
221
- - Test Acc: 41.74%
222
- - Alpha: 0.4387
223
- - Loss: 0.8552
224
 
225
  ### Latest 5 Epochs
226
 
227
- - **Epoch 196**: Train 60.15%, Test 0.00%, Ξ±=0.4377, Loss=0.8412
228
- - **Epoch 197**: Train 54.50%, Test 0.00%, Ξ±=0.4377, Loss=0.7728
229
- - **Epoch 198**: Train 54.60%, Test 0.00%, Ξ±=0.4377, Loss=0.7710
230
- - **Epoch 199**: Train 54.85%, Test 0.00%, Ξ±=0.4377, Loss=0.7651
231
- - **Epoch 200**: Train 53.80%, Test 39.51%, Ξ±=0.4377, Loss=0.7805
232
 
233
  ### Training Milestones
234
- - πŸ“Š **Ξ± β‰₯ 0.40** reached at epoch 40
235
- - πŸ“Š **Ξ± β‰₯ 0.44** (near triadic equilibrium) at epoch 104
236
 
237
  ---
238
 
@@ -243,7 +243,7 @@ epoch_checkpoint = hf_hub_download(
243
  βœ… **Fractal positional encoding** (Cantor function from 1883)
244
  βœ… **Geometric compatibility classification**
245
  βœ… **ResNet34 backbone** (proven CNN architecture)
246
- βœ… **Triadic fractal mixing** (base-3 aligned)
247
 
248
  ---
249
 
 
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
+ - in-training
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
 
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
+ 🚧 **Training in Progress** 🚧
38
 
39
+ Current Status: Epoch 50/200
40
 
41
  ---
42
 
 
46
  |--------|-------|
47
  | **Best Test Accuracy** | **56.12%** |
48
  | **Best Epoch** | 160 |
49
+ | **Current Train Accuracy** | 40.78% |
50
+ | **Current Test Accuracy** | 39.68% |
51
+ | **Current Ξ± (Cantor param)** | 0.4132 |
52
  | **Total Parameters** | 28,561,101 |
53
+ | **Training Time** | 0:06:42 |
54
 
55
  ### All Training Runs
56
 
 
66
  | `20251011_023128` | βœ… | 160 | **56.12%** | 54.65% | 0.5005 |
67
  | `20251011_025919` | βœ… | 160 | **56.12%** | 57.80% | 0.4994 |
68
  | `20251011_032343` | βœ… | 160 | **56.12%** | 53.80% | 0.4377 |
69
+ | `20251011_034748` | πŸ”„ | 160 | **56.12%** | 40.78% | 0.4132 |
70
  | `20251010_200842` | βœ… | 180 | **53.61%** | 67.53% | 0.4442 |
71
  | `20251010_185133` | βœ… | 200 | **52.97%** | 69.87% | 0.4452 |
72
 
 
74
 
75
  | Model | Accuracy | Status |
76
  |-------|----------|--------|
77
+ | **geo-beatrix (this model)** | **56.12%** | πŸ”„ Training |
78
  | geo-beatrix (50M params) | 69.0% | Geometric Basin CONV architecture |
79
 
80
  🎯 **Current target**: Beat geo-beatrix (69.0%) - Currently -12.88%
 
91
  - **PE Features/Level**: 1000
92
  - **Classification**: Geometric Basin Compatibility (NO cross-entropy)
93
  - **Attention Mechanisms**: NONE
94
+ - **Mixing**: Standard (single patch)
95
 
96
  ---
97
 
 
120
  },
121
  "alphamix": {
122
  "enabled": true,
123
+ "fractal_mode": false,
124
  "range": [
125
  0.3,
126
  0.7
127
  ],
128
+ "spatial_ratio": 0.1,
129
  "curriculum_start": 0.0,
130
  "curriculum_end": 0.5,
131
  "fractal_steps": [
 
142
  "loss_function": "Geometric Basin Compatibility",
143
  "cross_entropy": false,
144
  "attention_mechanisms": false,
145
+ "timestamp": "20251011_034748"
146
  }
147
  ```
148
 
 
156
  β”œβ”€β”€ best_model_info.json (which epoch/run this came from)
157
  β”œβ”€β”€ runs_history.json (all training runs and their results)
158
  β”œβ”€β”€ README.md
159
+ β”œβ”€β”€ weights/geo-beatrix-resnet34-step20-feats1000/20251011_034748/
160
  β”‚ β”œβ”€β”€ model.pt (best from this training run)
161
  β”‚ β”œβ”€β”€ model.safetensors (best from this training run)
162
  β”‚ β”œβ”€β”€ config.json
 
166
  β”‚ β”œβ”€β”€ checkpoint_epoch_100.safetensors
167
  β”‚ └── checkpoint_epoch_150.safetensors
168
  β”‚ (snapshots every 10 epochs)
169
+ └── runs/geo-beatrix-resnet34-step20-feats1000/20251011_034748/
170
  β”œβ”€β”€ events.out.tfevents.* (TensorBoard logs)
171
  └── metrics.csv (training metrics)
172
  ```
 
202
  # Or download from specific training run
203
  model_path = hf_hub_download(
204
  repo_id="AbstractPhil/geo-beatrix-resnet",
205
+ filename="weights/geo-beatrix-resnet34-step20-feats1000/20251011_034748/model.safetensors"
206
  )
207
 
208
  # Download specific epoch checkpoint
209
  epoch_checkpoint = hf_hub_download(
210
  repo_id="AbstractPhil/geo-beatrix-resnet",
211
+ filename="weights/geo-beatrix-resnet34-step20-feats1000/20251011_034748/checkpoints/checkpoint_epoch_100.safetensors"
212
  )
213
  ```
214
 
 
218
 
219
  ### Best Checkpoint
220
  - Epoch: 160
221
+ - Train Acc: 40.78%
222
+ - Test Acc: 56.12%
223
+ - Alpha: 0.4132
224
+ - Loss: 0.0000
225
 
226
  ### Latest 5 Epochs
227
 
228
+ - **Epoch 46**: Train 39.24%, Test 0.00%, Ξ±=0.4101, Loss=1.7538
229
+ - **Epoch 47**: Train 39.59%, Test 0.00%, Ξ±=0.4090, Loss=1.8065
230
+ - **Epoch 48**: Train 40.28%, Test 0.00%, Ξ±=0.4128, Loss=1.8005
231
+ - **Epoch 49**: Train 40.06%, Test 0.00%, Ξ±=0.4142, Loss=1.7817
232
+ - **Epoch 50**: Train 40.78%, Test 39.68%, Ξ±=0.4132, Loss=1.7571
233
 
234
  ### Training Milestones
235
+ - πŸ“Š **Ξ± β‰₯ 0.40** reached at epoch 17
 
236
 
237
  ---
238
 
 
243
  βœ… **Fractal positional encoding** (Cantor function from 1883)
244
  βœ… **Geometric compatibility classification**
245
  βœ… **ResNet34 backbone** (proven CNN architecture)
246
+
247
 
248
  ---
249