Llms and reasoning
updated
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models
Paper
• 2501.09686
• Published • 41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
• 2501.12948
• Published • 447
Chain-of-Retrieval Augmented Generation
Paper
• 2501.14342
• Published • 58
RL + Transformer = A General-Purpose Problem Solver
Paper
• 2501.14176
• Published • 28
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction
Paper
• 2502.07316
• Published • 50
Logical Reasoning in Large Language Models: A Survey
Paper
• 2502.09100
• Published • 24
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
Paper
• 2502.09601
• Published • 14
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced
Chain-of-Thought in Large Language Models
Paper
• 2502.09390
• Published • 16
Small Models Struggle to Learn from Strong Reasoners
Paper
• 2502.12143
• Published • 39
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement
Learning
Paper
• 2502.14768
• Published • 47
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via
GRPO
Paper
• 2502.14669
• Published • 15
Self-rewarding correction for mathematical reasoning
Paper
• 2502.19613
• Published • 82
R1-Searcher: Incentivizing the Search Capability in LLMs via
Reinforcement Learning
Paper
• 2503.05592
• Published • 27
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale
Reinforcement Learning
Paper
• 2503.07365
• Published • 61
A Simple "Try Again" Can Elicit Multi-Turn LLM Reasoning
Paper
• 2507.14295
• Published • 14