Update README.md

037a153 verified 1 day ago

3.92 kB

	---
	license: mit
	tags:
	- rebalance
	- steering-vector
	- reasoning
	- llm
	- iclr-2026
	language:
	- en
	---



	<h1 align="center">ReBalance Steering Vectors</h1>

	<p align="center">
	Steering vectors for <strong>Efficient Reasoning with Balanced Thinking</strong> (ICLR 2026)
	</p>

	<p align="center">
	<a href="https://huggingface.co/papers/2603.12372"><img src="https://img.shields.io/badge/Paper-Hugging_Face-b31b1b.svg" alt="Paper (Hugging Face)"></a>
	<a href="https://openreview.net/forum?id=cJseWJJ5IM"><img src="https://img.shields.io/badge/Paper-Open_Review-8D1B12.svg" alt="Paper (Open Review)"></a>
	<a href="https://github.com/yu-lin-li/ReBalance"><img src="https://img.shields.io/badge/Code-GitHub-black.svg" alt="Code (GitHub)"></a>
	<a href="https://rebalance-ai.github.io"><img src="https://img.shields.io/badge/Project-Page-2EA44F.svg" alt="Project Page"></a>
	<a href="https://github.com/yu-lin-li/ReBalance/blob/main/LICENSE"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"></a>
	</p>

	## Overview
	This repository provides steering vectors only for ReBalance.

	It does not include:
	- base model weights,
	- inference/training code,
	- datasets.

	Use this repository together with the official codebase:
	- Code: https://github.com/yu-lin-li/ReBalance
	- Inference script: `transformer_inference_steer_dp.py`

	## Available Steering Vectors
	\| Base model \| Vector path \| Recommended `--steer_layer` \|
	\|---\|---\|---\|
	\| DeepSeek-R1-Distill-Qwen-1.5B \| `vectors/DeepSeek-R1-Distill-Qwen-1.5B/steer_vector_layer19_conf_mixed.pt` \| `19` \|
	\| DeepSeek-R1-Distill-Qwen-7B \| `vectors/DeepSeek-R1-Distill-Qwen-7B/steer_vector_layer22_conf_mixed.pt` \| `22` \|
	\| QwQ-32B \| `vectors/QwQ-32B/steer_vector_layer58_conf_mixed.pt` \| `58` \|

	## Local Directory Layout (after download)
	```text
	ReBalance/
	├── transformer_inference_steer_dp.py
	└── vectors/
	├── DeepSeek-R1-Distill-Qwen-1.5B/
	│ └── steer_vector_layer19_conf_mixed.pt
	├── DeepSeek-R1-Distill-Qwen-7B/
	│ └── steer_vector_layer22_conf_mixed.pt
	└── QwQ-32B/
	└── steer_vector_layer58_conf_mixed.pt
	```

	## Download
	### Option 1: Clone the full model repository
	```bash
	git lfs install
	git clone https://huggingface.co/Yulin-Li/ReBalance
	```

	Then copy the `vectors/` folder into your local `ReBalance/` root directory.

	### Option 2: Download only the vectors with `huggingface_hub`
	```python
	from huggingface_hub import snapshot_download

	snapshot_download(
	repo_id="Yulin-Li/ReBalance",
	repo_type="model",
	allow_patterns="vectors/*",
	local_dir="."
	)
	```

	## Quick Usage with ReBalance
	```bash
	python transformer_inference_steer_dp.py \
	--model_name_or_path 'DeepSeek-R1-Distill-Qwen-1.5B' \
	--dataset_dir "./Data/" \
	--output_path "./outputs" \
	--dataset "Math_AIME2024" \
	--max_generated_tokens 16000 \
	--num_gpus 8 \
	--steer_vector_path ./vectors/DeepSeek-R1-Distill-Qwen-1.5B/steer_vector_layer19_conf_mixed.pt \
	--steer_layer 19 \
	--steer_coef -1
	```

	## Paper, Code, and Project Links
	- Paper (Hugging Face): https://huggingface.co/papers/2603.12372
	- Paper (Open Review): https://openreview.net/forum?id=cJseWJJ5IM
	- Code: https://github.com/yu-lin-li/ReBalance
	- Project page: https://rebalance-ai.github.io

	## Intended Use
	- Research and reproducibility for ReBalance.
	- Experiments on reasoning efficiency and accuracy trade-offs.
	- Comparative studies on overthinking mitigation.

	## Citation
	If you find ReBalance useful in your research, please cite our paper:

	```bibtex
	@article{li2026efficient,
	title={Efficient Reasoning with Balanced Thinking},
	author={Li, Yulin and Tu, Tengyao and Ding, Li and Wang, Junjie and Zhen, Huiling and Chen, Yixin and Li, Yong and Tian, Zhuotao},
	booktitle={Proceedings of the 14th International Conference on Learning Representations},
	year={2026}
	}
	```