Transluce
/

input_ablation_llama3.1_8b_instruct_llama3.1_8b_instruct

Text Generation

text-generation-inference

Model card Files Files and versions

belindazli commited on 16 days ago

Commit

81e6d46

·

verified ·

1 Parent(s): 927fd2f

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,8 +8,8 @@ pipeline_tag: text-generation
 This is a **Llama-3.1-8B-Instruct** explainer model fine-tuned for the **input ablations** task for the **Llama-3.1-8B-Instruct** target model, as described in [this paper](https://arxiv.org/abs/2511.08579). In the input ablations task, explainer models are trained to predict how removing "hint" tokens from an MMLU prompt with a hint changes the output of Llama-3.1-8B-Instruct. This helps in understanding the causal relationships between input components and model behavior.
-Repository: https://github.com/TransluceAI/introspective-interp
-Paper: https://arxiv.org/abs/2511.08579
 ## Sample Usage

 This is a **Llama-3.1-8B-Instruct** explainer model fine-tuned for the **input ablations** task for the **Llama-3.1-8B-Instruct** target model, as described in [this paper](https://arxiv.org/abs/2511.08579). In the input ablations task, explainer models are trained to predict how removing "hint" tokens from an MMLU prompt with a hint changes the output of Llama-3.1-8B-Instruct. This helps in understanding the causal relationships between input components and model behavior.
+[Repository](https://github.com/TransluceAI/introspective-interp) |
+[Paper](https://arxiv.org/abs/2511.08579)
 ## Sample Usage