nielsr HF Staff commited on
Commit
59e83a2
·
verified ·
1 Parent(s): 086f0a9

Add pipeline tag and GitHub link

Browse files

This PR improves the model card by adding the `text-generation` pipeline tag to the metadata and providing a direct link to the official GitHub repository. The repository contains the custom model classes required to properly handle the continuous feature embeddings used by this model.

Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -1,18 +1,20 @@
1
  ---
2
- license: mit
3
- language:
4
- - en
5
  base_model:
6
  - meta-llama/Llama-3.1-8B-Instruct
 
 
 
 
7
  ---
8
 
9
  # Model Card
10
 
11
- This is a Llama-3.1-8B-Instruct model fine-tuned to explain continuous features from Llama-3.1-8B.
12
- This model was trained to map SAE features from Llama-3.1-8B's residual stream to their explanations derived from Neuronpedia.
13
- It generalizes to explaining any arbitrary continuous feature from Llama-3.1-8B's residual stream.
14
 
15
- See [paper](https://arxiv.org/abs/2511.08579) for more details.
 
16
 
17
  ## Usage
18
 
 
1
  ---
 
 
 
2
  base_model:
3
  - meta-llama/Llama-3.1-8B-Instruct
4
+ language:
5
+ - en
6
+ license: mit
7
+ pipeline_tag: text-generation
8
  ---
9
 
10
  # Model Card
11
 
12
+ This is a Llama-3.1-8B-Instruct model fine-tuned to explain continuous features from Llama-3.1-8B, as described in the paper [Training Language Models to Explain Their Own Computations](https://arxiv.org/abs/2511.08579).
13
+
14
+ This model was trained to map SAE features from Llama-3.1-8B's residual stream to their explanations derived from Neuronpedia. It generalizes to explaining any arbitrary continuous feature from Llama-3.1-8B's residual stream.
15
 
16
+ - **Repository:** [https://github.com/TransluceAI/introspective-interp](https://github.com/TransluceAI/introspective-interp)
17
+ - **Paper:** [https://arxiv.org/abs/2511.08579](https://arxiv.org/abs/2511.08579)
18
 
19
  ## Usage
20