AbstractPhil commited on
Commit
9fc6e13
Β·
verified Β·
1 Parent(s): 7ffdb24

Update README - Run 20251012_065325

Browse files
Files changed (1) hide show
  1. README.md +44 -42
README.md CHANGED
@@ -12,7 +12,7 @@ datasets:
12
  metrics:
13
  - accuracy
14
  model-index:
15
- - name: David-partial_shared-deep_efficiency
16
  results:
17
  - task:
18
  type: image-classification
@@ -21,7 +21,7 @@ model-index:
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
- value: 83.04
25
  ---
26
 
27
  # David: Multi-Scale Crystal Classifier
@@ -32,17 +32,17 @@ as class prototypes with role-weighted similarity computation (Rose Loss).
32
  ## Model Details
33
 
34
  ### Architecture
35
- - **Preset**: clip_vit_l14_deep
36
- - **Sharing Mode**: partial_shared
37
  - **Fusion Mode**: deep_efficiency
38
- - **Scales**: [256, 512, 768, 1024, 1280, 1536, 1792, 2048, 2304, 2560]
39
- - **Feature Dim**: 768
40
  - **Parameters**: ~8.8M
41
 
42
  ### Training Configuration
43
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
44
- - **Model Variant**: clip_vit_l14
45
- - **Epochs**: 10
46
  - **Batch Size**: 1024
47
  - **Learning Rate**: 0.001
48
  - **Rose Loss Weight**: 0.1 β†’ 0.5
@@ -51,21 +51,12 @@ as class prototypes with role-weighted similarity computation (Rose Loss).
51
  ## Performance
52
 
53
  ### Best Results
54
- - **Validation Accuracy**: 83.04%
55
- - **Best Epoch**: 9
56
- - **Final Train Accuracy**: 91.00%
57
 
58
  ### Per-Scale Performance
59
- - **Scale 256**: 83.04%
60
- - **Scale 512**: 83.12%
61
- - **Scale 768**: 83.20%
62
- - **Scale 1024**: 83.21%
63
- - **Scale 1280**: 83.25%
64
- - **Scale 1536**: 83.13%
65
- - **Scale 1792**: 83.15%
66
- - **Scale 2048**: 83.14%
67
- - **Scale 2304**: 82.96%
68
- - **Scale 2560**: 82.71%
69
 
70
 
71
  ## Usage
@@ -75,17 +66,20 @@ as class prototypes with role-weighted similarity computation (Rose Loss).
75
  ```
76
  AbstractPhil/gated-david/
77
  β”œβ”€β”€ weights/
78
- β”‚ β”œβ”€β”€ best_model.pth # Best model weights (PyTorch)
79
- β”‚ β”œβ”€β”€ best_model.safetensors # Best model weights (SafeTensors)
80
- β”‚ β”œβ”€β”€ best_model_metadata.json # Training metadata
81
- β”‚ β”œβ”€β”€ final_model.pth # Final epoch weights
82
- β”‚ β”œβ”€β”€ final_model.safetensors
83
- β”‚ β”œβ”€β”€ david_config.json # Model architecture config
84
- β”‚ └── train_config.json # Training configuration
 
85
  β”œβ”€β”€ runs/
86
- β”‚ └── events.out.tfevents.* # TensorBoard logs
87
- β”œβ”€β”€ README.md # This file
88
- └── best_model.json # Performance summary
 
 
89
  ```
90
 
91
  ### Loading the Model
@@ -94,19 +88,27 @@ AbstractPhil/gated-david/
94
  from geovocab2.train.model.core.david import David, DavidArchitectureConfig
95
  from huggingface_hub import hf_hub_download
96
 
 
 
 
 
97
  # Download config
98
- config_path = hf_hub_download(repo_id="AbstractPhil/gated-david",
99
- filename="weights/david_config.json")
 
 
100
  config = DavidArchitectureConfig.from_json(config_path)
101
 
102
  # Download weights
103
- weights_path = hf_hub_download(repo_id="AbstractPhil/gated-david",
104
- filename="weights/best_model.pth")
 
 
105
 
106
- # Initialize model
 
107
  david = David.from_config(config)
108
- checkpoint = torch.load(weights_path)
109
- david.load_state_dict(checkpoint['model_state_dict'])
110
  david.eval()
111
  ```
112
 
@@ -131,7 +133,7 @@ with torch.no_grad():
131
  ## Architecture Overview
132
 
133
  ### Multi-Scale Processing
134
- David processes inputs at multiple scales (256, 512, 768, 1024, 1280, 1536, 1792, 2048, 2304, 2560),
135
  allowing it to capture both coarse and fine-grained features.
136
 
137
  ### Crystal Geometry
@@ -162,7 +164,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
162
  - **Optimizer**: AdamW
163
  - **Weight Decay**: 1e-05
164
  - **Scheduler**: cosine_restarts
165
- - **Gradient Clip**: 5.0
166
  - **Mixed Precision**: False
167
 
168
  ## Citation
@@ -173,7 +175,7 @@ score = w_anchor * sim(z, anchor) + w_need * sim(z, need) + ...
173
  author = {AbstractPhil},
174
  year = {2025},
175
  url = {https://huggingface.co/AbstractPhil/gated-david},
176
- note = {Run ID: 20251012_060013}
177
  }
178
  ```
179
 
@@ -188,4 +190,4 @@ Special thanks to Claude (Anthropic) for debugging assistance.
188
 
189
  ---
190
 
191
- *Generated on 2025-10-12 06:40:54*
 
12
  metrics:
13
  - accuracy
14
  model-index:
15
+ - name: David-decoupled-deep_efficiency
16
  results:
17
  - task:
18
  type: image-classification
 
21
  type: imagenet-1k
22
  metrics:
23
  - type: accuracy
24
+ value: 69.49
25
  ---
26
 
27
  # David: Multi-Scale Crystal Classifier
 
32
  ## Model Details
33
 
34
  ### Architecture
35
+ - **Preset**: high_accuracy
36
+ - **Sharing Mode**: decoupled
37
  - **Fusion Mode**: deep_efficiency
38
+ - **Scales**: [256, 512, 768, 1024, 1280]
39
+ - **Feature Dim**: 512
40
  - **Parameters**: ~8.8M
41
 
42
  ### Training Configuration
43
  - **Dataset**: AbstractPhil/imagenet-clip-features-orderly
44
+ - **Model Variant**: clip_vit_laion_b32
45
+ - **Epochs**: 20
46
  - **Batch Size**: 1024
47
  - **Learning Rate**: 0.001
48
  - **Rose Loss Weight**: 0.1 β†’ 0.5
 
51
  ## Performance
52
 
53
  ### Best Results
54
+ - **Validation Accuracy**: 69.49%
55
+ - **Best Epoch**: 0
56
+ - **Final Train Accuracy**: 65.85%
57
 
58
  ### Per-Scale Performance
59
+ - **Scale 256**: 69.49%
 
 
 
 
 
 
 
 
 
60
 
61
 
62
  ## Usage
 
66
  ```
67
  AbstractPhil/gated-david/
68
  β”œβ”€β”€ weights/
69
+ β”‚ └── david_high_accuracy/
70
+ β”‚ └── 20251012_065325/
71
+ β”‚ β”œβ”€β”€ best_model.safetensors
72
+ β”‚ β”œβ”€β”€ best_model_metadata.json
73
+ β”‚ β”œβ”€β”€ final_model.safetensors
74
+ β”‚ β”œβ”€β”€ checkpoint_epoch_X.safetensors
75
+ β”‚ β”œβ”€β”€ david_config.json
76
+ β”‚ └── train_config.json
77
  β”œβ”€β”€ runs/
78
+ β”‚ └── david_high_accuracy/
79
+ β”‚ └── 20251012_065325/
80
+ β”‚ └── events.out.tfevents.*
81
+ β”œβ”€β”€ README.md
82
+ └── best_model.json
83
  ```
84
 
85
  ### Loading the Model
 
88
  from geovocab2.train.model.core.david import David, DavidArchitectureConfig
89
  from huggingface_hub import hf_hub_download
90
 
91
+ # Specify model variant and run
92
+ model_name = "david_high_accuracy"
93
+ run_id = "20251012_065325"
94
+
95
  # Download config
96
+ config_path = hf_hub_download(
97
+ repo_id="AbstractPhil/gated-david",
98
+ filename=f"weights/{model_name}/{run_id}/david_config.json"
99
+ )
100
  config = DavidArchitectureConfig.from_json(config_path)
101
 
102
  # Download weights
103
+ weights_path = hf_hub_download(
104
+ repo_id="AbstractPhil/gated-david",
105
+ filename=f"weights/{model_name}/{run_id}/best_model.safetensors"
106
+ )
107
 
108
+ # Load model
109
+ from safetensors.torch import load_file
110
  david = David.from_config(config)
111
+ david.load_state_dict(load_file(weights_path))
 
112
  david.eval()
113
  ```
114
 
 
133
  ## Architecture Overview
134
 
135
  ### Multi-Scale Processing
136
+ David processes inputs at multiple scales (256, 512, 768, 1024, 1280),
137
  allowing it to capture both coarse and fine-grained features.
138
 
139
  ### Crystal Geometry
 
164
  - **Optimizer**: AdamW
165
  - **Weight Decay**: 1e-05
166
  - **Scheduler**: cosine_restarts
167
+ - **Gradient Clip**: 10.0
168
  - **Mixed Precision**: False
169
 
170
  ## Citation
 
175
  author = {AbstractPhil},
176
  year = {2025},
177
  url = {https://huggingface.co/AbstractPhil/gated-david},
178
+ note = {Run ID: 20251012_065325}
179
  }
180
  ```
181
 
 
190
 
191
  ---
192
 
193
+ *Generated on 2025-10-12 06:57:45*