papers
updated
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy
Optimization
Paper
• 2507.15758
• Published
• 35
Hierarchical Budget Policy Optimization for Adaptive Reasoning
Paper
• 2507.15844
• Published
• 17
DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts
Paper
• 2507.18464
• Published
• 12
Finding Dori: Memorization in Text-to-Image Diffusion Models Is Less
Local Than Assumed
Paper
• 2507.16880
• Published
• 7
Beyond Context Limits: Subconscious Threads for Long-Horizon Reasoning
Paper
• 2507.16784
• Published
• 122
RefCritic: Training Long Chain-of-Thought Critic Models with Refinement
Feedback
Paper
• 2507.15024
• Published
• 14
Re:Form -- Reducing Human Priors in Scalable Formal Software
Verification with RL in LLMs: A Preliminary Study on Dafny
Paper
• 2507.16331
• Published
• 22
When Tokens Talk Too Much: A Survey of Multimodal Long-Context Token
Compression across Images, Videos, and Audios
Paper
• 2507.20198
• Published
• 28