ARES-Coldstart-7B / README.md
ares0728's picture
Create README.md
2bbe456 verified
๐ŸŒŸ **ARES** โ€” Adaptive Multimodal Reasoning Framework
Two-stage adaptive reasoning: cold-start + entropy-shaped RL.
๐Ÿ”‘ Highlights
Balanced reasoning across easy & hard tasks via token-level entropy shaping.
SOTA efficiencyโ€“accuracy tradeoffs on diverse multimodal and textual benchmarks.
๐Ÿ“š Training Pipeline
1. **Adaptive Cold-Start** โ€” curate difficulty-aware reasoning traces
2. **Entropy-Shaped RL (AEPO)** โ€” trigger exploration via high-window entropy, hierarchical rewards
๐Ÿ“‚ Resources
- **Paper**: ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping :contentReference[oaicite:0]{index=0}
- **Code**: [GitHub โ€“ shawn0728/ARES](https://github.com/shawn0728/ARES) :contentReference[oaicite:1]{index=1}
๐Ÿ“Œ Citation
```
@misc{chen2025aresmultimodaladaptivereasoning,
title={ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping},
author={Shuang Chen and Yue Guo and Yimeng Ye and Shijue Huang and Wenbo Hu and Haoxi Li and Manyuan Zhang and Jiayu Chen and Song Guo and Nanyun Peng},
year={2025},
eprint={2510.08457},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2510.08457},
}
```
---
Give **ARES** a shot and tell us what reasoning challenges it helps you solve! ๐Ÿš€