AbstractPhil commited on
Commit
a570768
Β·
verified Β·
1 Parent(s): d576dd9

Epoch 50: 33.38%

Browse files
Files changed (1) hide show
  1. README.md +44 -43
README.md CHANGED
@@ -5,7 +5,7 @@ tags:
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
- - trained
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
@@ -15,7 +15,7 @@ metrics:
15
  library_name: pytorch
16
  pipeline_tag: image-classification
17
  model-index:
18
- - name: geo-beatrix-resnet34-step18-feats1000
19
  results:
20
  - task:
21
  type: image-classification
@@ -30,13 +30,13 @@ model-index:
30
  verified: false
31
  ---
32
 
33
- # geo-beatrix-resnet34-step18-feats1000
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
- πŸŽ‰ **Training Complete** πŸŽ‰
38
 
39
- Final Status: Epoch 200/200
40
 
41
  ---
42
 
@@ -46,11 +46,11 @@ Final Status: Epoch 200/200
46
  |--------|-------|
47
  | **Best Test Accuracy** | **56.12%** |
48
  | **Best Epoch** | 160 |
49
- | **Current Train Accuracy** | 64.44% |
50
- | **Current Test Accuracy** | 53.62% |
51
- | **Current Ξ± (Cantor param)** | 0.4419 |
52
- | **Total Parameters** | 27,846,901 |
53
- | **Training Time** | 0:27:37 |
54
 
55
  ### All Training Runs
56
 
@@ -59,6 +59,7 @@ Final Status: Epoch 200/200
59
  | `20251010_203717` | βœ… | 160 | **56.12%** | 67.82% | 0.4481 |
60
  | `20251010_211210` | πŸ”„ | 160 | **56.12%** | 16.21% | 0.3879 |
61
  | `20251010_213807` | βœ… | 160 | **56.12%** | 64.44% | 0.4419 |
 
62
  | `20251010_200842` | βœ… | 180 | **53.61%** | 67.53% | 0.4442 |
63
  | `20251010_185133` | βœ… | 200 | **52.97%** | 69.87% | 0.4452 |
64
 
@@ -66,8 +67,8 @@ Final Status: Epoch 200/200
66
 
67
  | Model | Accuracy | Status |
68
  |-------|----------|--------|
69
- | **geo-beatrix (this model)** | **56.12%** | βœ… Complete |
70
- | geo-beatrix | 69.0% | Geometric Basin CONV architecture - 50m Params |
71
 
72
  🎯 **Current target**: Beat geo-beatrix (69.0%) - Currently -12.88%
73
 
@@ -75,12 +76,12 @@ Final Status: Epoch 200/200
75
 
76
  ## Architecture
77
 
78
- - **Base**: ResNet18 (torchvision)
79
  - **Pretrained**: From scratch
80
- - **Features**: 512-dim from ResNet18
81
  - **Positional Encoding**: Devil's Staircase (Cantor function, 1883)
82
- - **PE Levels**: 18
83
- - **PE Features/Level**: 1000
84
  - **Classification**: Geometric Basin Compatibility (NO cross-entropy)
85
  - **Attention Mechanisms**: NONE
86
  - **Mixing**: Fractal (triadic multi-patch)
@@ -91,16 +92,16 @@ Final Status: Epoch 200/200
91
 
92
  ```json
93
  {
94
- "model_name": "geo-beatrix-resnet34-step18-feats1000",
95
  "model_type": "geometric_basin_classifier",
96
  "num_classes": 100,
97
- "batch_size": 512,
98
  "num_epochs": 200,
99
- "base_learning_rate": 0.002,
100
- "weight_decay": 0.05,
101
  "warmup_epochs": 10,
102
- "pe_levels": 18,
103
- "pe_features_per_level": 1000,
104
  "dropout": 0.1,
105
  "pretrained_resnet": false,
106
  "frozen_resnet": false,
@@ -117,8 +118,8 @@ Final Status: Epoch 200/200
117
  0.3,
118
  0.7
119
  ],
120
- "spatial_ratio": 0.25,
121
- "curriculum_start": 0.0,
122
  "curriculum_end": 0.5,
123
  "fractal_steps": [
124
  1,
@@ -130,11 +131,11 @@ Final Status: Epoch 200/200
130
  0.037037037037037035
131
  ]
132
  },
133
- "architecture": "ResNet18 + Beatrix's Staircase PE",
134
  "loss_function": "Geometric Basin Compatibility",
135
  "cross_entropy": false,
136
  "attention_mechanisms": false,
137
- "timestamp": "20251010_213807"
138
  }
139
  ```
140
 
@@ -148,7 +149,7 @@ Final Status: Epoch 200/200
148
  β”œβ”€β”€ best_model_info.json (which epoch/run this came from)
149
  β”œβ”€β”€ runs_history.json (all training runs and their results)
150
  β”œβ”€β”€ README.md
151
- β”œβ”€β”€ weights/geo-beatrix-resnet34-step18-feats1000/20251010_213807/
152
  β”‚ β”œβ”€β”€ model.pt (best from this training run)
153
  β”‚ β”œβ”€β”€ model.safetensors (best from this training run)
154
  β”‚ β”œβ”€β”€ config.json
@@ -158,7 +159,7 @@ Final Status: Epoch 200/200
158
  β”‚ β”œβ”€β”€ checkpoint_epoch_100.safetensors
159
  β”‚ └── checkpoint_epoch_150.safetensors
160
  β”‚ (snapshots every 10 epochs)
161
- └── runs/geo-beatrix-resnet34-step18-feats1000/20251010_213807/
162
  β”œβ”€β”€ events.out.tfevents.* (TensorBoard logs)
163
  └── metrics.csv (training metrics)
164
  ```
@@ -194,13 +195,13 @@ with open(info_path) as f:
194
  # Or download from specific training run
195
  model_path = hf_hub_download(
196
  repo_id="AbstractPhil/geo-beatrix-resnet",
197
- filename="weights/geo-beatrix-resnet34-step18-feats1000/20251010_213807/model.safetensors"
198
  )
199
 
200
  # Download specific epoch checkpoint
201
  epoch_checkpoint = hf_hub_download(
202
  repo_id="AbstractPhil/geo-beatrix-resnet",
203
- filename="weights/geo-beatrix-resnet34-step18-feats1000/20251010_213807/checkpoints/checkpoint_epoch_100.safetensors"
204
  )
205
  ```
206
 
@@ -210,23 +211,23 @@ epoch_checkpoint = hf_hub_download(
210
 
211
  ### Best Checkpoint
212
  - Epoch: 160
213
- - Train Acc: 65.96%
214
- - Test Acc: 54.14%
215
- - Alpha: 0.4427
216
- - Loss: 0.8208
217
 
218
  ### Latest 5 Epochs
219
 
220
- - **Epoch 196**: Train 65.52%, Test 0.00%, Ξ±=0.4420, Loss=0.7704
221
- - **Epoch 197**: Train 62.64%, Test 0.00%, Ξ±=0.4420, Loss=0.7315
222
- - **Epoch 198**: Train 64.22%, Test 0.00%, Ξ±=0.4419, Loss=0.7351
223
- - **Epoch 199**: Train 60.50%, Test 0.00%, Ξ±=0.4419, Loss=0.7178
224
- - **Epoch 200**: Train 64.44%, Test 53.62%, Ξ±=0.4419, Loss=0.7510
225
 
226
  ### Training Milestones
227
- - 🎯 **50% Accuracy** reached at epoch 85
228
- - πŸ“Š **Ξ± β‰₯ 0.40** reached at epoch 12
229
- - πŸ“Š **Ξ± β‰₯ 0.44** (near triadic equilibrium) at epoch 64
230
 
231
  ---
232
 
@@ -236,7 +237,7 @@ epoch_checkpoint = hf_hub_download(
236
  βœ… **NO cross-entropy loss**
237
  βœ… **Fractal positional encoding** (Cantor function from 1883)
238
  βœ… **Geometric compatibility classification**
239
- βœ… **ResNet18 backbone** (proven CNN architecture)
240
  βœ… **Triadic fractal mixing** (base-3 aligned)
241
 
242
  ---
 
5
  - cifar100
6
  - geometric-learning
7
  - fractal-encoding
8
+ - in-training
9
  - no-attention
10
  - no-cross-entropy
11
  datasets:
 
15
  library_name: pytorch
16
  pipeline_tag: image-classification
17
  model-index:
18
+ - name: geo-beatrix-resnet34-step12-feats100
19
  results:
20
  - task:
21
  type: image-classification
 
30
  verified: false
31
  ---
32
 
33
+ # geo-beatrix-resnet34-step12-feats100
34
 
35
  **Geometric Basin Classification for CIFAR-100**
36
 
37
+ 🚧 **Training in Progress** 🚧
38
 
39
+ Current Status: Epoch 50/200
40
 
41
  ---
42
 
 
46
  |--------|-------|
47
  | **Best Test Accuracy** | **56.12%** |
48
  | **Best Epoch** | 160 |
49
+ | **Current Train Accuracy** | 56.74% |
50
+ | **Current Test Accuracy** | 33.38% |
51
+ | **Current Ξ± (Cantor param)** | 0.4995 |
52
+ | **Total Parameters** | 21,846,001 |
53
+ | **Training Time** | 0:02:59 |
54
 
55
  ### All Training Runs
56
 
 
59
  | `20251010_203717` | βœ… | 160 | **56.12%** | 67.82% | 0.4481 |
60
  | `20251010_211210` | πŸ”„ | 160 | **56.12%** | 16.21% | 0.3879 |
61
  | `20251010_213807` | βœ… | 160 | **56.12%** | 64.44% | 0.4419 |
62
+ | `20251010_230300` | πŸ”„ | 160 | **56.12%** | 56.74% | 0.4995 |
63
  | `20251010_200842` | βœ… | 180 | **53.61%** | 67.53% | 0.4442 |
64
  | `20251010_185133` | βœ… | 200 | **52.97%** | 69.87% | 0.4452 |
65
 
 
67
 
68
  | Model | Accuracy | Status |
69
  |-------|----------|--------|
70
+ | **geo-beatrix (this model)** | **56.12%** | πŸ”„ Training |
71
+ | geo-beatrix (50M params) | 69.0% | Geometric Basin CONV architecture |
72
 
73
  🎯 **Current target**: Beat geo-beatrix (69.0%) - Currently -12.88%
74
 
 
76
 
77
  ## Architecture
78
 
79
+ - **Base**: ResNet34 (torchvision)
80
  - **Pretrained**: From scratch
81
+ - **Features**: 512-dim from ResNet34
82
  - **Positional Encoding**: Devil's Staircase (Cantor function, 1883)
83
+ - **PE Levels**: 12
84
+ - **PE Features/Level**: 100
85
  - **Classification**: Geometric Basin Compatibility (NO cross-entropy)
86
  - **Attention Mechanisms**: NONE
87
  - **Mixing**: Fractal (triadic multi-patch)
 
92
 
93
  ```json
94
  {
95
+ "model_name": "geo-beatrix-resnet34-step12-feats100",
96
  "model_type": "geometric_basin_classifier",
97
  "num_classes": 100,
98
+ "batch_size": 1024,
99
  "num_epochs": 200,
100
+ "base_learning_rate": 0.01,
101
+ "weight_decay": 0.0,
102
  "warmup_epochs": 10,
103
+ "pe_levels": 12,
104
+ "pe_features_per_level": 100,
105
  "dropout": 0.1,
106
  "pretrained_resnet": false,
107
  "frozen_resnet": false,
 
118
  0.3,
119
  0.7
120
  ],
121
+ "spatial_ratio": 0.05,
122
+ "curriculum_start": 0.25,
123
  "curriculum_end": 0.5,
124
  "fractal_steps": [
125
  1,
 
131
  0.037037037037037035
132
  ]
133
  },
134
+ "architecture": "ResNet34 + Devil's Staircase PE",
135
  "loss_function": "Geometric Basin Compatibility",
136
  "cross_entropy": false,
137
  "attention_mechanisms": false,
138
+ "timestamp": "20251010_230300"
139
  }
140
  ```
141
 
 
149
  β”œβ”€β”€ best_model_info.json (which epoch/run this came from)
150
  β”œβ”€β”€ runs_history.json (all training runs and their results)
151
  β”œβ”€β”€ README.md
152
+ β”œβ”€β”€ weights/geo-beatrix-resnet34-step12-feats100/20251010_230300/
153
  β”‚ β”œβ”€β”€ model.pt (best from this training run)
154
  β”‚ β”œβ”€β”€ model.safetensors (best from this training run)
155
  β”‚ β”œβ”€β”€ config.json
 
159
  β”‚ β”œβ”€β”€ checkpoint_epoch_100.safetensors
160
  β”‚ └── checkpoint_epoch_150.safetensors
161
  β”‚ (snapshots every 10 epochs)
162
+ └── runs/geo-beatrix-resnet34-step12-feats100/20251010_230300/
163
  β”œβ”€β”€ events.out.tfevents.* (TensorBoard logs)
164
  └── metrics.csv (training metrics)
165
  ```
 
195
  # Or download from specific training run
196
  model_path = hf_hub_download(
197
  repo_id="AbstractPhil/geo-beatrix-resnet",
198
+ filename="weights/geo-beatrix-resnet34-step12-feats100/20251010_230300/model.safetensors"
199
  )
200
 
201
  # Download specific epoch checkpoint
202
  epoch_checkpoint = hf_hub_download(
203
  repo_id="AbstractPhil/geo-beatrix-resnet",
204
+ filename="weights/geo-beatrix-resnet34-step12-feats100/20251010_230300/checkpoints/checkpoint_epoch_100.safetensors"
205
  )
206
  ```
207
 
 
211
 
212
  ### Best Checkpoint
213
  - Epoch: 160
214
+ - Train Acc: 56.74%
215
+ - Test Acc: 56.12%
216
+ - Alpha: 0.4995
217
+ - Loss: 0.0000
218
 
219
  ### Latest 5 Epochs
220
 
221
+ - **Epoch 46**: Train 60.77%, Test 33.07%, Ξ±=0.4970, Loss=0.9218
222
+ - **Epoch 47**: Train 60.55%, Test 0.00%, Ξ±=0.5002, Loss=0.9846
223
+ - **Epoch 48**: Train 57.74%, Test 33.91%, Ξ±=0.5023, Loss=0.9368
224
+ - **Epoch 49**: Train 56.59%, Test 0.00%, Ξ±=0.4993, Loss=0.9326
225
+ - **Epoch 50**: Train 56.74%, Test 33.38%, Ξ±=0.4995, Loss=0.8828
226
 
227
  ### Training Milestones
228
+ - πŸ“Š **Ξ± β‰₯ 0.40** reached at epoch 6
229
+ - πŸ“Š **Ξ± β‰₯ 0.44** (near triadic equilibrium) at epoch 8
230
+ - βš›οΈ **Ξ± = 0.50** (TRIADIC EQUILIBRIUM!) at epoch 21
231
 
232
  ---
233
 
 
237
  βœ… **NO cross-entropy loss**
238
  βœ… **Fractal positional encoding** (Cantor function from 1883)
239
  βœ… **Geometric compatibility classification**
240
+ βœ… **ResNet34 backbone** (proven CNN architecture)
241
  βœ… **Triadic fractal mixing** (base-3 aligned)
242
 
243
  ---