The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 8 days ago • 67
Co-GRPO: Co-Optimized Group Relative Policy Optimization for Masked Diffusion Model Paper • 2512.22288 • Published Dec 25, 2025 • 2
MemoryVLA Collection Checkpoints, data and logs of MemoryVLA & MemoryVLA+. https://github.com/shihao1895/MemoryVLA • 20 items • Updated 21 days ago • 7
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance Paper • 2509.26231 • Published Sep 30, 2025 • 18
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS Paper • 2507.07136 • Published Jul 9, 2025 • 40
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 188
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model? Paper • 2504.13837 • Published Apr 18, 2025 • 139
CODA: Repurposing Continuous VAEs for Discrete Tokenization Paper • 2503.17760 • Published Mar 22, 2025 • 4
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models Paper • 2503.10437 • Published Mar 13, 2025 • 34
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy Segment Optimization Paper • 2410.21411 • Published Oct 28, 2024 • 19