nielsr HF Staff commited on
Commit
9c4fb7d
·
verified ·
1 Parent(s): 01fd106

Improve model card: Add pipeline tag, paper, project page, and code links

Browse files

This PR improves the model card by:

- Setting the `pipeline_tag` to `text-generation`, ensuring the model can be found under the correct pipeline on the Hugging Face Hub.
- Adding a prominent link to the paper: [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363).
- Including a link to the project page: https://elm.baulab.info.
- Adding a link to the GitHub repository: https://github.com/rohitgandikota/erasing-llm.

These additions make the model card more informative and improve discoverability.

Files changed (1) hide show
  1. README.md +10 -4
README.md CHANGED
@@ -4,11 +4,18 @@ library_name: transformers
4
  tags:
5
  - mergekit
6
  - merge
7
-
8
  ---
 
9
  # mergercoder
10
 
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
 
 
 
12
 
13
  ## Merge Details
14
  ### Merge Method
@@ -36,5 +43,4 @@ models:
36
  weight: -1.0
37
  merge_method: task_arithmetic
38
  dtype: bfloat16
39
- ```
40
- arxiv.org/abs/2509.01363
 
4
  tags:
5
  - mergekit
6
  - merge
7
+ pipeline_tag: text-generation
8
  ---
9
+
10
  # mergercoder
11
 
12
+ This repository hosts a merged language model, based on the work presented in [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363). This work demonstrates that reasoning ability, once learned, can be extracted and transferred between models as a compact task vector, offering a practical way to enhance models by recycling prior computational investments.
13
+
14
+ * 📖 Paper: [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363)
15
+ * 🌐 Project Page: [https://elm.baulab.info](https://elm.baulab.info)
16
+ * 💻 Code: [https://github.com/rohitgandikota/erasing-llm](https://github.com/rohitgandikota/erasing-llm)
17
+
18
+ This model is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
19
 
20
  ## Merge Details
21
  ### Merge Method
 
43
  weight: -1.0
44
  merge_method: task_arithmetic
45
  dtype: bfloat16
46
+ ```