Improve model card: Add pipeline tag, paper, project page, and code links
#1
by
nielsr HF Staff - opened
README.md
CHANGED
|
@@ -4,11 +4,18 @@ library_name: transformers
|
|
| 4 |
tags:
|
| 5 |
- mergekit
|
| 6 |
- merge
|
| 7 |
-
|
| 8 |
---
|
|
|
|
| 9 |
# mergercoder
|
| 10 |
|
| 11 |
-
This
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
|
| 13 |
## Merge Details
|
| 14 |
### Merge Method
|
|
@@ -36,5 +43,4 @@ models:
|
|
| 36 |
weight: -1.0
|
| 37 |
merge_method: task_arithmetic
|
| 38 |
dtype: bfloat16
|
| 39 |
-
```
|
| 40 |
-
arxiv.org/abs/2509.01363
|
|
|
|
| 4 |
tags:
|
| 5 |
- mergekit
|
| 6 |
- merge
|
| 7 |
+
pipeline_tag: text-generation
|
| 8 |
---
|
| 9 |
+
|
| 10 |
# mergercoder
|
| 11 |
|
| 12 |
+
This repository hosts a merged language model, based on the work presented in [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363). This work demonstrates that reasoning ability, once learned, can be extracted and transferred between models as a compact task vector, offering a practical way to enhance models by recycling prior computational investments.
|
| 13 |
+
|
| 14 |
+
* 📖 Paper: [Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic](https://arxiv.org/abs/2509.01363)
|
| 15 |
+
* 🌐 Project Page: [https://elm.baulab.info](https://elm.baulab.info)
|
| 16 |
+
* 💻 Code: [https://github.com/rohitgandikota/erasing-llm](https://github.com/rohitgandikota/erasing-llm)
|
| 17 |
+
|
| 18 |
+
This model is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
|
| 19 |
|
| 20 |
## Merge Details
|
| 21 |
### Merge Method
|
|
|
|
| 43 |
weight: -1.0
|
| 44 |
merge_method: task_arithmetic
|
| 45 |
dtype: bfloat16
|
| 46 |
+
```
|
|
|