Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models Paper • 2603.25750 • Published 11 days ago • 12
Representation Alignment for Just Image Transformers is not Easier than You Think Paper • 2603.14366 • Published 16 days ago • 9
InCoder-32B: Code Foundation Model for Industrial Scenarios Paper • 2603.16790 • Published 14 days ago • 306
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 15 days ago • 150
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper • 2601.08225 • Published Jan 13 • 53
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published Jan 8 • 229
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published Dec 18, 2025 • 96
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation Paper • 2512.24551 • Published Dec 31, 2025 • 21
AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents Paper • 2512.23343 • Published Dec 29, 2025 • 30
Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling Paper • 2512.23959 • Published Dec 30, 2025 • 111