Yulin-Li commited on
Commit
037a153
Β·
verified Β·
1 Parent(s): 38d482c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +117 -3
README.md CHANGED
@@ -1,3 +1,117 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - rebalance
5
+ - steering-vector
6
+ - reasoning
7
+ - llm
8
+ - iclr-2026
9
+ language:
10
+ - en
11
+ ---
12
+
13
+
14
+
15
+ <h1 align="center">ReBalance Steering Vectors</h1>
16
+
17
+ <p align="center">
18
+ Steering vectors for <strong>Efficient Reasoning with Balanced Thinking</strong> (ICLR 2026)
19
+ </p>
20
+
21
+ <p align="center">
22
+ <a href="https://huggingface.co/papers/2603.12372"><img src="https://img.shields.io/badge/Paper-Hugging_Face-b31b1b.svg" alt="Paper (Hugging Face)"></a>
23
+ <a href="https://openreview.net/forum?id=cJseWJJ5IM"><img src="https://img.shields.io/badge/Paper-Open_Review-8D1B12.svg" alt="Paper (Open Review)"></a>
24
+ <a href="https://github.com/yu-lin-li/ReBalance"><img src="https://img.shields.io/badge/Code-GitHub-black.svg" alt="Code (GitHub)"></a>
25
+ <a href="https://rebalance-ai.github.io"><img src="https://img.shields.io/badge/Project-Page-2EA44F.svg" alt="Project Page"></a>
26
+ <a href="https://github.com/yu-lin-li/ReBalance/blob/main/LICENSE"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"></a>
27
+ </p>
28
+
29
+ ## Overview
30
+ This repository provides **steering vectors only** for ReBalance.
31
+
32
+ It does **not** include:
33
+ - base model weights,
34
+ - inference/training code,
35
+ - datasets.
36
+
37
+ Use this repository together with the official codebase:
38
+ - Code: https://github.com/yu-lin-li/ReBalance
39
+ - Inference script: `transformer_inference_steer_dp.py`
40
+
41
+ ## Available Steering Vectors
42
+ | Base model | Vector path | Recommended `--steer_layer` |
43
+ |---|---|---|
44
+ | DeepSeek-R1-Distill-Qwen-1.5B | `vectors/DeepSeek-R1-Distill-Qwen-1.5B/steer_vector_layer19_conf_mixed.pt` | `19` |
45
+ | DeepSeek-R1-Distill-Qwen-7B | `vectors/DeepSeek-R1-Distill-Qwen-7B/steer_vector_layer22_conf_mixed.pt` | `22` |
46
+ | QwQ-32B | `vectors/QwQ-32B/steer_vector_layer58_conf_mixed.pt` | `58` |
47
+
48
+ ## Local Directory Layout (after download)
49
+ ```text
50
+ ReBalance/
51
+ β”œβ”€β”€ transformer_inference_steer_dp.py
52
+ └── vectors/
53
+ β”œβ”€β”€ DeepSeek-R1-Distill-Qwen-1.5B/
54
+ β”‚ └── steer_vector_layer19_conf_mixed.pt
55
+ β”œβ”€β”€ DeepSeek-R1-Distill-Qwen-7B/
56
+ β”‚ └── steer_vector_layer22_conf_mixed.pt
57
+ └── QwQ-32B/
58
+ └── steer_vector_layer58_conf_mixed.pt
59
+ ```
60
+
61
+ ## Download
62
+ ### Option 1: Clone the full model repository
63
+ ```bash
64
+ git lfs install
65
+ git clone https://huggingface.co/Yulin-Li/ReBalance
66
+ ```
67
+
68
+ Then copy the `vectors/` folder into your local `ReBalance/` root directory.
69
+
70
+ ### Option 2: Download only the vectors with `huggingface_hub`
71
+ ```python
72
+ from huggingface_hub import snapshot_download
73
+
74
+ snapshot_download(
75
+ repo_id="Yulin-Li/ReBalance",
76
+ repo_type="model",
77
+ allow_patterns="vectors/*",
78
+ local_dir="."
79
+ )
80
+ ```
81
+
82
+ ## Quick Usage with ReBalance
83
+ ```bash
84
+ python transformer_inference_steer_dp.py \
85
+ --model_name_or_path 'DeepSeek-R1-Distill-Qwen-1.5B' \
86
+ --dataset_dir "./Data/" \
87
+ --output_path "./outputs" \
88
+ --dataset "Math_AIME2024" \
89
+ --max_generated_tokens 16000 \
90
+ --num_gpus 8 \
91
+ --steer_vector_path ./vectors/DeepSeek-R1-Distill-Qwen-1.5B/steer_vector_layer19_conf_mixed.pt \
92
+ --steer_layer 19 \
93
+ --steer_coef -1
94
+ ```
95
+
96
+ ## Paper, Code, and Project Links
97
+ - Paper (Hugging Face): https://huggingface.co/papers/2603.12372
98
+ - Paper (Open Review): https://openreview.net/forum?id=cJseWJJ5IM
99
+ - Code: https://github.com/yu-lin-li/ReBalance
100
+ - Project page: https://rebalance-ai.github.io
101
+
102
+ ## Intended Use
103
+ - Research and reproducibility for ReBalance.
104
+ - Experiments on reasoning efficiency and accuracy trade-offs.
105
+ - Comparative studies on overthinking mitigation.
106
+
107
+ ## Citation
108
+ If you find ReBalance useful in your research, please cite our paper:
109
+
110
+ ```bibtex
111
+ @article{li2026efficient,
112
+ title={Efficient Reasoning with Balanced Thinking},
113
+ author={Li, Yulin and Tu, Tengyao and Ding, Li and Wang, Junjie and Zhen, Huiling and Chen, Yixin and Li, Yong and Tian, Zhuotao},
114
+ booktitle={Proceedings of the 14th International Conference on Learning Representations},
115
+ year={2026}
116
+ }
117
+ ```