nielsr HF Staff commited on
Commit
451deb6
·
verified ·
1 Parent(s): 8a18545

Add model card for ACTS Controller

Browse files

This PR adds a comprehensive model card for the ACTS (Agentic Chain-of-Thought Steering) controller agent.

It includes:
- Metadata for `pipeline_tag`, `library_name`, and `license`.
- Links to the research paper [Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning](https://huggingface.co/papers/2606.03965).
- A link to the official [GitHub repository](https://github.com/Andree-9/ACTS).
- Sample usage instructions for running the inference demo as found in the repository's README.
- The BibTeX citation from the paper.

Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ tags:
6
+ - reasoning
7
+ - agent
8
+ - chain-of-thought
9
+ ---
10
+
11
+ # ACTS: Agentic Chain-of-Thought Steering Controller
12
+
13
+ This repository contains the controller agent for **ACTS (Agentic Chain-of-Thought Steering)**, presented in the paper [Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning](https://huggingface.co/papers/2606.03965).
14
+
15
+ ACTS is a framework where a lightweight controller agent adaptively steers a frozen reasoner (such as DeepSeek-R1) step-by-step under a thinking-token budget. By formulating reasoning steering as a Markov decision process, the controller chooses a reasoning strategy and a short steering phrase at each step to enable controllable accuracy–efficiency trade-offs.
16
+
17
+ ## Resources
18
+ - **Paper:** [Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning](https://huggingface.co/papers/2606.03965)
19
+ - **Repository:** [Andree-9/ACTS](https://github.com/Andree-9/ACTS)
20
+ - **SFT Data:** [yuuxia/controller-sft-data](https://huggingface.co/datasets/yuuxia/controller-sft-data)
21
+
22
+ ## Quick Start Inference
23
+
24
+ To use this controller to steer a reasoner, follow the setup instructions in the [GitHub repository](https://github.com/Andree-9/ACTS) and run the following command:
25
+
26
+ ```bash
27
+ conda activate slime
28
+ ./scripts/run_acts_inference.sh \
29
+ --controller yuuxia/acts-controller \
30
+ --reasoner deepseek-ai/DeepSeek-R1-Distill-Qwen-7B \
31
+ --benchmark aime2024 \
32
+ --budget 10000
33
+ ```
34
+
35
+ ## Citation
36
+
37
+ ```bibtex
38
+ @misc{xia2026acts,
39
+ title={Agentic Chain-of-Thought Steering for Efficient and Controllable LLM Reasoning},
40
+ author={Yu Xia and Zhouhang Xie and Xin Xu and Byungkyu Kang and Prarit Lamba and Xiang Gao and Julian McAuley},
41
+ year={2026},
42
+ eprint={2606.03965},
43
+ archivePrefix={arXiv},
44
+ primaryClass={cs.CL},
45
+ url={https://arxiv.org/abs/2606.03965},
46
+ }
47
+ ```