MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper β’ 2507.14958 β’ Published Jul 20, 2025 β’ 47 β’ 3
Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning Paper β’ 2504.08672 β’ Published Apr 11, 2025 β’ 55 β’ 2
$Ο$-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation Paper β’ 2503.13288 β’ Published Mar 17, 2025 β’ 51 β’ 2