Test-Time-Scaling-Papers MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published Jul 20 • 46
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published Jul 20 • 46
Indic-SLMs A collection of general and task specific small language models trained on marathi sky-2002/Marathi-SmolLM2-145M Text Generation • 0.1B • Updated Oct 6 • 13 sky-2002/Marathi-SmolLM2-145M-Instruct Text Generation • Updated May 11 • 9
RL-Papers Group Sequence Policy Optimization Paper • 2507.18071 • Published Jul 24 • 316 DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
Test-Time-Scaling-Papers MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published Jul 20 • 46
MUR: Momentum Uncertainty guided Reasoning for Large Language Models Paper • 2507.14958 • Published Jul 20 • 46
RL-Papers Group Sequence Policy Optimization Paper • 2507.18071 • Published Jul 24 • 316 DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 138
Indic-SLMs A collection of general and task specific small language models trained on marathi sky-2002/Marathi-SmolLM2-145M Text Generation • 0.1B • Updated Oct 6 • 13 sky-2002/Marathi-SmolLM2-145M-Instruct Text Generation • Updated May 11 • 9