ares0728
/

ARES-Coldstart-7B

Model card Files Files and versions

ares0728 commited on Oct 12, 2025

Commit

2bbe456

·

verified ·

1 Parent(s): 5ba7f31

Create README.md

Files changed (1) hide show

README.md +31 -0

README.md ADDED Viewed

	@@ -0,0 +1,31 @@

+🌟 **ARES** — Adaptive Multimodal Reasoning Framework
+Two-stage adaptive reasoning: cold-start + entropy-shaped RL.
+🔑 Highlights
+Balanced reasoning across easy & hard tasks via token-level entropy shaping.
+SOTA efficiency–accuracy tradeoffs on diverse multimodal and textual benchmarks.
+📚 Training Pipeline
+1. **Adaptive Cold-Start** — curate difficulty-aware reasoning traces
+2. **Entropy-Shaped RL (AEPO)** — trigger exploration via high-window entropy, hierarchical rewards
+📂 Resources
+- **Paper**: ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping :contentReference[oaicite:0]{index=0}
+- **Code**: [GitHub – shawn0728/ARES](https://github.com/shawn0728/ARES) :contentReference[oaicite:1]{index=1}
+📌 Citation
+```
+@misc{chen2025aresmultimodaladaptivereasoning,
+      title={ARES: Multimodal Adaptive Reasoning via Difficulty-Aware Token-Level Entropy Shaping},
+      author={Shuang Chen and Yue Guo and Yimeng Ye and Shijue Huang and Wenbo Hu and Haoxi Li and Manyuan Zhang and Jiayu Chen and Song Guo and Nanyun Peng},
+      year={2025},
+      eprint={2510.08457},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2510.08457},
+}
+```
+---
+Give **ARES** a shot and tell us what reasoning challenges it helps you solve! 🚀