Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing Paper ⢠2602.03845 ⢠Published Feb 3 ⢠27
VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning Paper ⢠2510.01444 ⢠Published Oct 1, 2025 ⢠20
CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models Paper ⢠2509.09675 ⢠Published Sep 11, 2025 ⢠28
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper ⢠2509.07980 ⢠Published Sep 9, 2025 ⢠105
R1-RE: Cross-Domain Relationship Extraction with RLVR Paper ⢠2507.04642 ⢠Published Jul 7, 2025 ⢠7
Learning to Reason via Mixture-of-Thought for Logical Reasoning Paper ⢠2505.15817 ⢠Published May 21, 2025 ⢠18
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation Paper ⢠2503.06594 ⢠Published Mar 9, 2025 ⢠6
Towards Optimal Multi-draft Speculative Decoding Paper ⢠2502.18779 ⢠Published Feb 26, 2025 ⢠5
Asymmetric Conflict and Synergy in Post-training for LLM-based Multilingual Machine Translation Paper ⢠2502.11223 ⢠Published Feb 16, 2025 ⢠1