| ๐ **ARES** โ Adaptive Multimodal Reasoning Framework | |
| Two-stage adaptive reasoning: cold-start + entropy-shaped RL. | |
| ๐ Highlights | |
| Balanced reasoning across easy & hard tasks via token-level entropy shaping. | |
| SOTA efficiencyโaccuracy tradeoffs on diverse multimodal and textual benchmarks. | |
| ๐ Training Pipeline | |
| 1. **Adaptive Cold-Start** โ curate difficulty-aware reasoning traces | |
| 2. **Entropy-Shaped RL (AEPO)** โ trigger exploration via high-window entropy, hierarchical rewards | |
| ๐ Resources | |
| - **Paper**: ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping :contentReference[oaicite:0]{index=0} | |
| - **Code**: [GitHub โ shawn0728/ARES](https://github.com/shawn0728/ARES) :contentReference[oaicite:1]{index=1} | |
| ๐ Citation | |
| ``` | |
| @misc{chen2025aresmultimodaladaptivereasoning, | |
| title={ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping}, | |
| author={Shuang Chen and Yue Guo and Yimeng Ye and Shijue Huang and Wenbo Hu and Haoxi Li and Manyuan Zhang and Jiayu Chen and Song Guo and Nanyun Peng}, | |
| year={2025}, | |
| eprint={2510.08457}, | |
| archivePrefix={arXiv}, | |
| primaryClass={cs.CL}, | |
| url={https://arxiv.org/abs/2510.08457}, | |
| } | |
| ``` | |
| --- | |
| Give **ARES** a shot and tell us what reasoning challenges it helps you solve! ๐ |