Update README.md
Browse files
README.md
CHANGED
|
@@ -144,7 +144,7 @@ Sparse Autoencoder checkpoint:
|
|
| 144 |
'model_state_dict': <OrderedDict>, # SAE weights
|
| 145 |
'model_config': {
|
| 146 |
'input_dim': 768, # ViT hidden dimension
|
| 147 |
-
'hidden_dim':
|
| 148 |
'k': 16, # TopK sparsity (16 for CIFAR-10, 32 for Imagenette)
|
| 149 |
'activation': 'topk' # Activation type
|
| 150 |
},
|
|
@@ -195,7 +195,6 @@ Class-specific expert features:
|
|
| 195 |
### SAE Models
|
| 196 |
|
| 197 |
- **Layers**: 8, 9, 10 (out of 12 ViT layers)
|
| 198 |
-
- **Architecture**: Overcomplete (768 → 3072 → 768)
|
| 199 |
- **Sparsity**: TopK activation
|
| 200 |
- **CIFAR-10**: k=16 (only top 16 features active per sample)
|
| 201 |
- **Imagenette**: k=32 (only top 32 features active per sample)
|
|
|
|
| 144 |
'model_state_dict': <OrderedDict>, # SAE weights
|
| 145 |
'model_config': {
|
| 146 |
'input_dim': 768, # ViT hidden dimension
|
| 147 |
+
'hidden_dim': 768, # SAE latent dimension (768×1)
|
| 148 |
'k': 16, # TopK sparsity (16 for CIFAR-10, 32 for Imagenette)
|
| 149 |
'activation': 'topk' # Activation type
|
| 150 |
},
|
|
|
|
| 195 |
### SAE Models
|
| 196 |
|
| 197 |
- **Layers**: 8, 9, 10 (out of 12 ViT layers)
|
|
|
|
| 198 |
- **Sparsity**: TopK activation
|
| 199 |
- **CIFAR-10**: k=16 (only top 16 features active per sample)
|
| 200 |
- **Imagenette**: k=32 (only top 32 features active per sample)
|