AbstractPhil commited on
Commit
4c67308
·
verified ·
1 Parent(s): 4800afa

Upload runs/cifar100_weighted_20251119_144723/README.md with huggingface_hub

Browse files
runs/cifar100_weighted_20251119_144723/README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Run: cifar100_weighted_20251119_144723
2
+
3
+ ## Configuration
4
+ - **Dataset**: CIFAR100
5
+ - **Fusion Mode**: weighted
6
+ - **Parameters**: 15,894,644
7
+ - **Simplex**: 4-simplex (5 vertices)
8
+
9
+ ## Performance
10
+ - **Best Validation Accuracy**: 39.98%
11
+ - **Training Time**: 1.0 hours
12
+ - **Batch Size**: 128
13
+ - **Mixed Precision**: False
14
+ - **Final Epoch**: 100
15
+
16
+ ## Files
17
+ - `runs/cifar100_weighted_20251119_144723/checkpoints/best_model.safetensors` - Model weights (SafeTensors)
18
+ - `runs/cifar100_weighted_20251119_144723/checkpoints/best_training_state.pt` - Optimizer/scheduler state
19
+ - `runs/cifar100_weighted_20251119_144723/checkpoints/best_metadata.json` - Training metadata
20
+ - `runs/cifar100_weighted_20251119_144723/config.yaml` - Full configuration
21
+ - `runs/cifar100_weighted_20251119_144723/tensorboard/` - TensorBoard logs
22
+
23
+ ## Usage
24
+ ```python
25
+ from safetensors.torch import load_file
26
+ import torch
27
+
28
+ # Download from HuggingFace Hub
29
+ from huggingface_hub import hf_hub_download
30
+
31
+ model_path = hf_hub_download(
32
+ repo_id="AbstractPhil/vit-beans-v3",
33
+ filename="runs/cifar100_weighted_20251119_144723/checkpoints/best_model.safetensors"
34
+ )
35
+
36
+ # Load model weights (SafeTensors - no pickle!)
37
+ state_dict = load_file(model_path)
38
+ model.load_state_dict(state_dict)
39
+ ```
40
+
41
+ ## Training Configuration
42
+ ```yaml
43
+ embed_dim: 512
44
+ num_fusion_blocks: 6
45
+ num_heads: 8
46
+ fusion_mode: weighted
47
+ k_simplex: 4
48
+ learning_rate: 0.0003
49
+ batch_size: 128
50
+ epochs: 100
51
+ weight_decay: 0.05
52
+ ```
53
+
54
+ ## Details
55
+
56
+ Built with geometric consciousness-aware routing using the Devil's Staircase (Beatrix) and pentachoron parameterization.
57
+
58
+ **Training completed**: 2025-11-19 15:46:35
59
+
60
+ **Safe Format**: All model weights use SafeTensors (not pickle) for maximum security.
61
+
62
+ ---
63
+
64
+ [← Back to main repository](https://huggingface.co/AbstractPhil/vit-beans-v3)