Complete Sparse Autoencoder

#15
by juiceb0xc0de - opened

SAE Performance Metrics

Across all 30 layers, the SAEs achieved 0% dead features with stable L0 sparsity.

Note: Reconstruction loss scales up in later layers due to shifting activation magnitudes, but Explained Variance (EV) remains strong throughout.

Layer EV Mean L0 Recon Loss Dead %
0 0.9480 48.74 0.2074 0.0%
1 0.9599 43.65 0.3298 0.0%
2 0.9631 46.81 0.5021 0.0%
3 0.9508 46.56 0.7462 0.0%
4 0.9463 46.23 0.8936 0.0%
5 0.9350 47.57 1.1605 0.0%
6 0.9306 48.44 1.3838 0.0%
7 0.9318 49.51 1.5446 0.0%
8 0.9432 46.52 1.6598 0.0%
9 0.9373 47.15 2.0706 0.0%
10 0.9348 45.53 2.2983 0.0%
11 0.9905 48.58 5.8113 0.0%
12 0.9901 48.42 6.1039 0.0%
13 0.9891 46.15 6.9692 0.0%
14 0.9884 44.76 7.1844 0.0%
15 0.9863 47.63 8.6521 0.0%
16 0.9840 43.45 10.179 0.0%
17 0.9808 45.88 12.363 0.0%
18 0.9811 47.17 12.300 0.0%
19 0.9775 46.23 15.521 0.0%
20 0.9726 48.23 18.727 0.0%
21 0.9667 46.49 24.817 0.0%
22 0.9594 46.15 31.128 0.0%
23 0.9418 45.13 47.936 0.0%
24 0.9397 45.35 57.931 0.0%
25 0.9299 46.33 74.413 0.0%
26 0.9212 45.32 92.727 0.0%
27 0.9139 45.69 118.75 0.0%
28 0.8809 46.56 129.83 0.0%
29 0.8812 52.01 196.36 0.0%

https://huggingface.co/datasets/juiceb0xc0de/smollm2-135m-instruct-SAE

Sign up or log in to comment