Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,7 @@ tags: []
|
|
| 4 |
---
|
| 5 |
# FW Medium
|
| 6 |
|
| 7 |
-
This is the
|
| 8 |
The primary purpose of this model is interpretability, most design choices were made with that in mind.
|
| 9 |
|
| 10 |
The code to run this custom model can be found [here](https://github.com/tdooms/bilinear-decomposition), along with many utility functions for weight-based interpretability.
|
|
|
|
| 4 |
---
|
| 5 |
# FW Medium
|
| 6 |
|
| 7 |
+
This is the medium version of the bilinear transformers trained on FineWeb-edu.
|
| 8 |
The primary purpose of this model is interpretability, most design choices were made with that in mind.
|
| 9 |
|
| 10 |
The code to run this custom model can be found [here](https://github.com/tdooms/bilinear-decomposition), along with many utility functions for weight-based interpretability.
|