AbstractPhil commited on
Commit
242d1f5
Β·
verified Β·
1 Parent(s): 78aacb2

Update README - Run 20251012_231445

Browse files
Files changed (1) hide show
  1. README.md +28 -31
README.md CHANGED
@@ -12,7 +12,7 @@ datasets:
12
  metrics:
13
  - accuracy
14
  model-index:
15
- - name: David-decoupled-deep_efficiency
16
  results:
17
  - task:
18
  type: image-classification
@@ -21,7 +21,7 @@ model-index:
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
- value: 66.84
25
  ---
26
 
27
  # David: Multi-Scale Feature Classifier
@@ -36,12 +36,12 @@ exist simultaneously in the same shared space with the correct checks and spacin
36
  ## Model Details
37
 
38
  ### Architecture
39
- - **Preset**: high_accuracy
40
- - **Sharing Mode**: decoupled
41
- - **Fusion Mode**: deep_efficiency
42
- - **Scales**: [256, 512, 768, 1024, 1280]
43
  - **Feature Dim**: 512
44
- - **Parameters**: 14,877,593
45
 
46
  ### Training Configuration
47
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
@@ -49,22 +49,19 @@ exist simultaneously in the same shared space with the correct checks and spacin
49
  - **Epochs**: 10
50
  - **Batch Size**: 1024
51
  - **Learning Rate**: 0.01
52
- - **Rose Loss Weight**: 0.2 β†’ 0.8
53
  - **Cayley Loss**: False
54
 
55
  ## Performance
56
 
57
  ### Best Results
58
- - **Validation Accuracy**: 66.84%
59
- - **Best Epoch**: 9
60
- - **Final Train Accuracy**: 66.12%
61
 
62
  ### Per-Scale Performance
63
- - **Scale 256**: 66.84%
64
- - **Scale 512**: 72.72%
65
- - **Scale 768**: 74.34%
66
- - **Scale 1024**: 75.09%
67
- - **Scale 1280**: 75.37%
68
 
69
 
70
  ## Usage
@@ -81,19 +78,19 @@ AbstractPhil/david-shared-space/
81
  β”œβ”€β”€ README.md # This file
82
  β”œβ”€β”€ best_model.json # Latest best model info
83
  β”œβ”€β”€ weights/
84
- β”‚ └── david_high_accuracy/
85
- β”‚ └── 20251012_221046/
86
  β”‚ β”œβ”€β”€ MODEL_SUMMARY.txt # 🎯 Human-readable performance summary
87
  β”‚ β”œβ”€β”€ training_history.json # πŸ“ˆ Epoch-by-epoch training curve
88
- β”‚ β”œβ”€β”€ best_model_acc66.84.safetensors # ⭐ Accuracy in filename!
89
- β”‚ β”œβ”€β”€ best_model_acc66.84_metadata.json
90
  β”‚ β”œβ”€β”€ final_model.safetensors
91
  β”‚ β”œβ”€β”€ checkpoint_epoch_X_accYY.YY.safetensors
92
  β”‚ β”œβ”€β”€ david_config.json
93
  β”‚ └── train_config.json
94
  └── runs/
95
- └── david_high_accuracy/
96
- └── 20251012_221046/
97
  └── events.out.tfevents.* # TensorBoard logs
98
  ```
99
 
@@ -106,9 +103,9 @@ from huggingface_hub import hf_hub_download
106
  # Browse available models in MODELS_INDEX.json first!
107
 
108
  # Specify model variant and run
109
- model_name = "david_high_accuracy"
110
- run_id = "20251012_221046"
111
- accuracy = "66.84" # From MODELS_INDEX.json
112
 
113
  # Download config
114
  config_path = hf_hub_download(
@@ -157,7 +154,7 @@ with torch.no_grad():
157
  ## Architecture Overview
158
 
159
  ### Multi-Scale Processing
160
- David processes inputs at multiple scales (256, 512, 768, 1024, 1280),
161
  allowing it to capture both coarse and fine-grained features.
162
 
163
  ### Shared Representation Space
@@ -178,20 +175,20 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
178
  ```
179
 
180
  ### Fusion Strategy
181
- **deep_efficiency**: Intelligently combines predictions from multiple scales.
182
 
183
  ## Training Details
184
 
185
  ### Loss Components
186
  - **Cross-Entropy**: Standard classification loss
187
- - **Rose Loss**: Pentachora role-weighted margin loss (weight: 0.2β†’0.8)
188
  - **Cayley Loss**: Geometric regularization (disabled)
189
 
190
  ### Optimization
191
  - **Optimizer**: AdamW
192
  - **Weight Decay**: 1e-05
193
  - **Scheduler**: cosine_restarts
194
- - **Gradient Clip**: 10.0
195
  - **Mixed Precision**: False
196
 
197
  ## Citation
@@ -202,7 +199,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
202
  author = {AbstractPhil},
203
  year = {2025},
204
  url = {https://huggingface.co/AbstractPhil/david-shared-space},
205
- note = {Run ID: 20251012_221046}
206
  }
207
  ```
208
 
@@ -217,4 +214,4 @@ Special thanks to Claude (Anthropic) for debugging assistance.
217
 
218
  ---
219
 
220
- *Generated on 2025-10-12 22:58:07*
 
12
  metrics:
13
  - accuracy
14
  model-index:
15
+ - name: David-fully_shared-weighted_sum
16
  results:
17
  - task:
18
  type: image-classification
 
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
+ value: 63.04
25
  ---
26
 
27
  # David: Multi-Scale Feature Classifier
 
36
  ## Model Details
37
 
38
  ### Architecture
39
+ - **Preset**: small_fast
40
+ - **Sharing Mode**: fully_shared
41
+ - **Fusion Mode**: weighted_sum
42
+ - **Scales**: [256, 512]
43
  - **Feature Dim**: 512
44
+ - **Parameters**: 656,898
45
 
46
  ### Training Configuration
47
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
 
49
  - **Epochs**: 10
50
  - **Batch Size**: 1024
51
  - **Learning Rate**: 0.01
52
+ - **Rose Loss Weight**: 0.2 β†’ 0.6
53
  - **Cayley Loss**: False
54
 
55
  ## Performance
56
 
57
  ### Best Results
58
+ - **Validation Accuracy**: 63.04%
59
+ - **Best Epoch**: 0
60
+ - **Final Train Accuracy**: 54.92%
61
 
62
  ### Per-Scale Performance
63
+ - **Scale 256**: 62.12%
64
+ - **Scale 512**: 62.97%
 
 
 
65
 
66
 
67
  ## Usage
 
78
  β”œβ”€β”€ README.md # This file
79
  β”œβ”€β”€ best_model.json # Latest best model info
80
  β”œβ”€β”€ weights/
81
+ β”‚ └── david_small_fast/
82
+ β”‚ └── 20251012_231445/
83
  β”‚ β”œβ”€β”€ MODEL_SUMMARY.txt # 🎯 Human-readable performance summary
84
  β”‚ β”œβ”€β”€ training_history.json # πŸ“ˆ Epoch-by-epoch training curve
85
+ β”‚ β”œβ”€β”€ best_model_acc63.04.safetensors # ⭐ Accuracy in filename!
86
+ β”‚ β”œβ”€β”€ best_model_acc63.04_metadata.json
87
  β”‚ β”œβ”€β”€ final_model.safetensors
88
  β”‚ β”œβ”€β”€ checkpoint_epoch_X_accYY.YY.safetensors
89
  β”‚ β”œβ”€β”€ david_config.json
90
  β”‚ └── train_config.json
91
  └── runs/
92
+ └── david_small_fast/
93
+ └── 20251012_231445/
94
  └── events.out.tfevents.* # TensorBoard logs
95
  ```
96
 
 
103
  # Browse available models in MODELS_INDEX.json first!
104
 
105
  # Specify model variant and run
106
+ model_name = "david_small_fast"
107
+ run_id = "20251012_231445"
108
+ accuracy = "63.04" # From MODELS_INDEX.json
109
 
110
  # Download config
111
  config_path = hf_hub_download(
 
154
  ## Architecture Overview
155
 
156
  ### Multi-Scale Processing
157
+ David processes inputs at multiple scales (256, 512),
158
  allowing it to capture both coarse and fine-grained features.
159
 
160
  ### Shared Representation Space
 
175
  ```
176
 
177
  ### Fusion Strategy
178
+ **weighted_sum**: Intelligently combines predictions from multiple scales.
179
 
180
  ## Training Details
181
 
182
  ### Loss Components
183
  - **Cross-Entropy**: Standard classification loss
184
+ - **Rose Loss**: Pentachora role-weighted margin loss (weight: 0.2β†’0.6)
185
  - **Cayley Loss**: Geometric regularization (disabled)
186
 
187
  ### Optimization
188
  - **Optimizer**: AdamW
189
  - **Weight Decay**: 1e-05
190
  - **Scheduler**: cosine_restarts
191
+ - **Gradient Clip**: 5.0
192
  - **Mixed Precision**: False
193
 
194
  ## Citation
 
199
  author = {AbstractPhil},
200
  year = {2025},
201
  url = {https://huggingface.co/AbstractPhil/david-shared-space},
202
+ note = {Run ID: 20251012_231445}
203
  }
204
  ```
205
 
 
214
 
215
  ---
216
 
217
+ *Generated on 2025-10-12 23:19:29*