chanind commited on
Commit
9e67a7c
·
verified ·
1 Parent(s): ec2c0ac

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -11
README.md CHANGED
@@ -2,24 +2,54 @@
2
  library_name: saelens
3
  ---
4
 
5
- # Synthetic Model for SAE Training
6
 
7
- This repository contains a SyntheticModel for use with SAELens.
8
 
9
- ## Model Info
10
 
11
- - **Number of features**: 16,384
12
- - **Hidden dimension**: 512
13
- - **Hierarchy**: Yes
14
- - Root nodes: 128
15
- - Total nodes: 10880
16
- - Max depth: 4
17
- - **Feature correlation**: Yes (scale 0.1)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
 
19
  ## Usage
20
 
21
  ```python
22
  from sae_lens.synthetic import SyntheticModel
23
 
24
- model = SyntheticModel.from_pretrained("chanind/synth-sae-bench-variations", model_path="superposition/d-512")
25
  ```
 
2
  library_name: saelens
3
  ---
4
 
5
+ # SyntheticSAEBench Model Variations
6
 
7
+ This repository contains variations on the [SynthSAEBench-16k](https://huggingface.co/decoderesearch/synth-sae-bench-16k-v1) model, organized into subdirs based on the specific attribute that's different. Unless otherwise specified, all other attributes are identical to the original SynthSAEBench-16k model.
8
 
9
+ ### firing-magnitude-stdev
10
 
11
+ These models change the stev of firing magnitude, setting it to a constant for each feature in the model. The base model uses a random std per-feature with mean 0.5. Available variations:
12
+ - std-0
13
+ - std-0.1
14
+ - std-0.5
15
+ - std-2.5
16
+
17
+ ### superposition
18
+
19
+ These models change the hidden dimension of the model, changing the level of superposition in the model. Larger hidden dim means less superposition. The base model has hidden dim 768. Available variations:
20
+ - d-512
21
+ - d-1024
22
+ - d-1536
23
+
24
+ ### truncate-num-features
25
+
26
+ These models truncate the number of features in the original model, keeping the first N features. The base model has 16384 feature. Available variations:
27
+ - n-4096
28
+ - n-8192
29
+
30
+ ### relative-firing-probability
31
+
32
+ These models scale all the probabilities of the original model by the given multiplier (1.0 would be identical to the base model). This also scales the L0 of the model. Available variations:
33
+
34
+ - rel-p-0.1
35
+ - rel-p-0.25
36
+ - rel-p-0.5
37
+ - rel-p-0.75
38
+ - rel-p-1.25
39
+ - rel-p-1.5
40
+
41
+ ### misc
42
+
43
+ These models change several properties at once, typically using different hierarchy structures. However, the current models here are designed to keep the L0 of the first 4096 features at around 25 to match the standard model. Available variations:
44
+ - hierarchy-128-128-me-1.0-l0-40-4kl0-25
45
+ - rand-hierarchy-16-4-32-me-0.75-l0-30-4kl0-24
46
+
47
+ In these models, `me-0.75` means 75% of nodes in the hierarchy have mutually-exclusive children. The number after `hierarchy` is the number of root nodes. `rand-hierarchy` means there is a random number of children per parent. E.g. `rand-hierarchy-16-4-32` means 16 root nodes, and randomly between 4 and 32 child nodes per parent. For full details of the settings of `misc` models, it's best to look at the model config directly.
48
 
49
  ## Usage
50
 
51
  ```python
52
  from sae_lens.synthetic import SyntheticModel
53
 
54
+ model = SyntheticModel.from_pretrained("chanind/synth-sae-bench-variations", model_path="model/path")
55
  ```