Parallel-Reasoning
/

llama-apr_cond10_grpo

Model card Files Files and versions

Add model card and metadata

#1

by nielsr HF Staff - opened Apr 24, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +47 -0

README.md ADDED Viewed

	@@ -0,0 +1,47 @@

+---
+pipeline_tag: text-generation
+library_name: transformers
+license: apache-2.0
+---
+<h1 align="center"> Learning Adaptive Parallel Reasoning <br> with Language Models </h1>
+<p align="center">
+  <a href="https://www.jiayipan.com/" style="text-decoration: none;">Jiayi Pan</a><sup>*</sup>,
+  <a href="https://xiuyuli.com/" style="text-decoration: none;">Xiuyu Li</a><sup>*</sup>,
+  <a href="https://tonylian.com/" style="text-decoration: none;">Long Lian</a><sup>*</sup>,
+  <a href="https://sea-snell.github.io/" style="text-decoration: none;">Charlie Victor Snell</a>,
+  <a href="https://yifeizhou02.github.io/" style="text-decoration: none;">Yifei Zhou</a>,<br>
+  <a href="https://www.adamyala.org/" style="text-decoration: none;">Adam Yala</a>,
+  <a href="https://people.eecs.berkeley.edu/~trevor/" style="text-decoration: none;">Trevor Darrell</a>,
+  <a href="https://people.eecs.berkeley.edu/~keutzer/" style="text-decoration: none;">Kurt Keutzer</a>,
+  <a href="https://www.alanesuhr.com/" style="text-decoration: none;">Alane Suhr</a>
+</p>
+<p align="center">
+    UC Berkeley and UCSF &nbsp;&nbsp;&nbsp;<sup>*</sup> Equal Contribution
+</p>
+<p align="center">
+<a href="https://arxiv.org/abs/2504.15466">📃 Paper</a>
+•
+<a href="https://github.com/Parallel-Reasoning/APR" >💻 Code</a>
+</p>
+![APR](./assets/apr.png)
+**TL;DR**:
+We present Adaptive Parallel Reasoning (APR), a novel framework that enables language models to learn to orchestrate both serialized and parallel computations. APR trains language models to use `spawn()` and `join()` operations through end-to-end supervised training and reinforcement learning, allowing models to dynamically orchestrate their own computational workflows.
+APR efficiently distributes compute, reduces latency, overcomes context window limits, and achieves state‑of‑the‑art performance on complex reasoning tasks (e.g., 83.4% vs. 60.0% accuracy at 4K context on Countdown).
+## Citation
+If you find this work useful in your research, please consider citing:
+```bibtex
+@article{pan2025learning,
+  title   = {Learning Adaptive Parallel Reasoning with Language Models},
+  author  = {Jiayi Pan and Xiuyu Li and Long Lian and Charlie Snell and Yifei Zhou and Adam Yala and Trevor Darrell and Kurt Keutzer and Alane Suhr},
+  year    = {2025},
+  journal = {arXiv preprint arXiv: 2504.15466}
+}
+```