AbstractPhil commited on
Commit
61f9d02
Β·
verified Β·
1 Parent(s): b6ae463

Update README - Run 20251013_004438

Browse files
Files changed (1) hide show
  1. README.md +33 -26
README.md CHANGED
@@ -12,7 +12,7 @@ datasets:
12
  metrics:
13
  - accuracy
14
  model-index:
15
- - name: David-fully_shared-weighted_sum
16
  results:
17
  - task:
18
  type: image-classification
@@ -21,7 +21,7 @@ model-index:
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
- value: 66.52
25
  ---
26
 
27
  # David: Multi-Scale Feature Classifier
@@ -36,12 +36,12 @@ exist simultaneously in the same shared space with the correct checks and spacin
36
  ## Model Details
37
 
38
  ### Architecture
39
- - **Preset**: small_fast
40
- - **Sharing Mode**: fully_shared
41
- - **Fusion Mode**: weighted_sum
42
- - **Scales**: [256, 512]
43
  - **Feature Dim**: 512
44
- - **Parameters**: 656,898
45
 
46
  ### Training Configuration
47
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
@@ -55,13 +55,20 @@ exist simultaneously in the same shared space with the correct checks and spacin
55
  ## Performance
56
 
57
  ### Best Results
58
- - **Validation Accuracy**: 66.52%
59
- - **Best Epoch**: 9
60
- - **Final Train Accuracy**: 63.87%
61
 
62
  ### Per-Scale Performance
63
- - **Scale 256**: 65.96%
64
- - **Scale 512**: 66.43%
 
 
 
 
 
 
 
65
 
66
 
67
  ## Usage
@@ -78,19 +85,19 @@ AbstractPhil/david-shared-space/
78
  β”œβ”€β”€ README.md # This file
79
  β”œβ”€β”€ best_model.json # Latest best model info
80
  β”œβ”€β”€ weights/
81
- β”‚ └── david_small_fast/
82
- β”‚ └── 20251012_235237/
83
  β”‚ β”œβ”€β”€ MODEL_SUMMARY.txt # 🎯 Human-readable performance summary
84
  β”‚ β”œβ”€β”€ training_history.json # πŸ“ˆ Epoch-by-epoch training curve
85
- β”‚ β”œβ”€β”€ best_model_acc66.52.safetensors # ⭐ Accuracy in filename!
86
- β”‚ β”œβ”€β”€ best_model_acc66.52_metadata.json
87
  β”‚ β”œβ”€β”€ final_model.safetensors
88
  β”‚ β”œβ”€β”€ checkpoint_epoch_X_accYY.YY.safetensors
89
  β”‚ β”œβ”€β”€ david_config.json
90
  β”‚ └── train_config.json
91
  └── runs/
92
- └── david_small_fast/
93
- └── 20251012_235237/
94
  └── events.out.tfevents.* # TensorBoard logs
95
  ```
96
 
@@ -103,9 +110,9 @@ from huggingface_hub import hf_hub_download
103
  # Browse available models in MODELS_INDEX.json first!
104
 
105
  # Specify model variant and run
106
- model_name = "david_small_fast"
107
- run_id = "20251012_235237"
108
- accuracy = "66.52" # From MODELS_INDEX.json
109
 
110
  # Download config
111
  config_path = hf_hub_download(
@@ -154,7 +161,7 @@ with torch.no_grad():
154
  ## Architecture Overview
155
 
156
  ### Multi-Scale Processing
157
- David processes inputs at multiple scales (256, 512),
158
  allowing it to capture both coarse and fine-grained features.
159
 
160
  ### Shared Representation Space
@@ -175,7 +182,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
175
  ```
176
 
177
  ### Fusion Strategy
178
- **weighted_sum**: Intelligently combines predictions from multiple scales.
179
 
180
  ## Training Details
181
 
@@ -188,7 +195,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
188
  - **Optimizer**: AdamW
189
  - **Weight Decay**: 1e-05
190
  - **Scheduler**: cosine_restarts
191
- - **Gradient Clip**: 15.0
192
  - **Mixed Precision**: False
193
 
194
  ## Citation
@@ -199,7 +206,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
199
  author = {AbstractPhil},
200
  year = {2025},
201
  url = {https://huggingface.co/AbstractPhil/david-shared-space},
202
- note = {Run ID: 20251012_235237}
203
  }
204
  ```
205
 
@@ -214,4 +221,4 @@ Special thanks to Claude (Anthropic) for debugging assistance.
214
 
215
  ---
216
 
217
- *Generated on 2025-10-13 00:38:47*
 
12
  metrics:
13
  - accuracy
14
  model-index:
15
+ - name: David-decoupled-deep_efficiency
16
  results:
17
  - task:
18
  type: image-classification
 
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
+ value: 58.40
25
  ---
26
 
27
  # David: Multi-Scale Feature Classifier
 
36
  ## Model Details
37
 
38
  ### Architecture
39
+ - **Preset**: gated_expert_team
40
+ - **Sharing Mode**: decoupled
41
+ - **Fusion Mode**: deep_efficiency
42
+ - **Scales**: [128, 256, 384, 448, 512, 576, 640, 768, 896]
43
  - **Feature Dim**: 512
44
+ - **Parameters**: 22,133,801
45
 
46
  ### Training Configuration
47
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
 
55
  ## Performance
56
 
57
  ### Best Results
58
+ - **Validation Accuracy**: 58.40%
59
+ - **Best Epoch**: 0
60
+ - **Final Train Accuracy**: 51.66%
61
 
62
  ### Per-Scale Performance
63
+ - **Scale 128**: 58.40%
64
+ - **Scale 256**: 67.03%
65
+ - **Scale 384**: 69.55%
66
+ - **Scale 448**: 70.34%
67
+ - **Scale 512**: 70.84%
68
+ - **Scale 576**: 71.29%
69
+ - **Scale 640**: 71.60%
70
+ - **Scale 768**: 72.03%
71
+ - **Scale 896**: 72.25%
72
 
73
 
74
  ## Usage
 
85
  β”œβ”€β”€ README.md # This file
86
  β”œβ”€β”€ best_model.json # Latest best model info
87
  β”œβ”€β”€ weights/
88
+ β”‚ └── david_gated_expert_team/
89
+ β”‚ └── 20251013_004438/
90
  β”‚ β”œβ”€β”€ MODEL_SUMMARY.txt # 🎯 Human-readable performance summary
91
  β”‚ β”œβ”€β”€ training_history.json # πŸ“ˆ Epoch-by-epoch training curve
92
+ β”‚ β”œβ”€β”€ best_model_acc58.40.safetensors # ⭐ Accuracy in filename!
93
+ β”‚ β”œβ”€β”€ best_model_acc58.40_metadata.json
94
  β”‚ β”œβ”€β”€ final_model.safetensors
95
  β”‚ β”œβ”€β”€ checkpoint_epoch_X_accYY.YY.safetensors
96
  β”‚ β”œβ”€β”€ david_config.json
97
  β”‚ └── train_config.json
98
  └── runs/
99
+ └── david_gated_expert_team/
100
+ └── 20251013_004438/
101
  └── events.out.tfevents.* # TensorBoard logs
102
  ```
103
 
 
110
  # Browse available models in MODELS_INDEX.json first!
111
 
112
  # Specify model variant and run
113
+ model_name = "david_gated_expert_team"
114
+ run_id = "20251013_004438"
115
+ accuracy = "58.40" # From MODELS_INDEX.json
116
 
117
  # Download config
118
  config_path = hf_hub_download(
 
161
  ## Architecture Overview
162
 
163
  ### Multi-Scale Processing
164
+ David processes inputs at multiple scales (128, 256, 384, 448, 512, 576, 640, 768, 896),
165
  allowing it to capture both coarse and fine-grained features.
166
 
167
  ### Shared Representation Space
 
182
  ```
183
 
184
  ### Fusion Strategy
185
+ **deep_efficiency**: Intelligently combines predictions from multiple scales.
186
 
187
  ## Training Details
188
 
 
195
  - **Optimizer**: AdamW
196
  - **Weight Decay**: 1e-05
197
  - **Scheduler**: cosine_restarts
198
+ - **Gradient Clip**: 10.0
199
  - **Mixed Precision**: False
200
 
201
  ## Citation
 
206
  author = {AbstractPhil},
207
  year = {2025},
208
  url = {https://huggingface.co/AbstractPhil/david-shared-space},
209
+ note = {Run ID: 20251013_004438}
210
  }
211
  ```
212
 
 
221
 
222
  ---
223
 
224
+ *Generated on 2025-10-13 00:49:36*