acts-controller / README.md
yuuxia's picture
Update README.md
cf7f1e6 verified
---
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
tags:
- reasoning
- agent
- chain-of-thought
---
# ACTS: Agentic Chain-of-Thought Steering Controller
This repository contains a controller agent checkpoint for **ACTS (Agentic Chain-of-Thought Steering)**, presented in the paper [Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning](https://huggingface.co/papers/2606.03965).
ACTS is a framework where a lightweight controller agent adaptively steers a frozen reasoner (such as DeepSeek-R1) step-by-step under a thinking-token budget. By formulating reasoning steering as a Markov decision process, the controller chooses a reasoning strategy and a short steering phrase at each step to enable controllable accuracy–efficiency trade-offs.
## Resources
- **Paper:** [Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning](https://huggingface.co/papers/2606.03965)
- **Repository:** [Andree-9/ACTS](https://github.com/Andree-9/ACTS)
- **SFT Data:** [yuuxia/controller-sft-data](https://huggingface.co/datasets/yuuxia/controller-sft-data)
## Quick Start Inference
To use this controller to steer a reasoner, follow the setup instructions in the [GitHub repository](https://github.com/Andree-9/ACTS) and run the following command:
```bash
conda activate slime
./scripts/run_acts_inference.sh \
--controller yuuxia/acts-controller \
--reasoner deepseek-ai/DeepSeek-R1-Distill-Qwen-7B \
--benchmark aime2024 \
--budget 10000
```
## Citation
```bibtex
@misc{xia2026acts,
title={Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning},
author={Yu Xia and Zhouhang Xie and Xin Xu and Byungkyu Kang and Prarit Lamba and Xiang Gao and Julian McAuley},
year={2026},
eprint={2606.03965},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2606.03965},
}
```