Commit ·
94f23c8
1
Parent(s): fc7557c
replace model ID with plain description
Browse files
app.py
CHANGED
|
@@ -299,10 +299,8 @@ LATEX_APPENDIX = r"""\section{Arithmetic case study: interpretability analysis}
|
|
| 299 |
\end{table}
|
| 300 |
|
| 301 |
\paragraph{Setup.}
|
| 302 |
-
All interpretability analyses use
|
| 303 |
-
|
| 304 |
-
(\texttt{2L/1H/128d}, 2 layers, 1 head, hidden size 128; trained on 100K examples),
|
| 305 |
-
evaluated on 2{,}600 held-out problems across 26 splits.
|
| 306 |
This model achieves 95.5\% accuracy with \sorl{} abstraction tokens and 0.1\% without; making it the clearest test-bed for causal analysis.
|
| 307 |
All results are reproducible from the released code.
|
| 308 |
|
|
|
|
| 299 |
\end{table}
|
| 300 |
|
| 301 |
\paragraph{Setup.}
|
| 302 |
+
All interpretability analyses use a 2-layer, 1-head, 128-dimensional transformer
|
| 303 |
+
trained on 100K examples, evaluated on 2{,}600 held-out problems across 26 splits.
|
|
|
|
|
|
|
| 304 |
This model achieves 95.5\% accuracy with \sorl{} abstraction tokens and 0.1\% without; making it the clearest test-bed for causal analysis.
|
| 305 |
All results are reproducible from the released code.
|
| 306 |
|