Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models Paper • 2604.02340 • Published 5 days ago • 6
COSPADI: Compressing LLMs via Calibration-Guided Sparse Dictionary Learning Paper • 2509.22075 • Published Sep 26, 2025 • 23
ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Paper • 2505.02819 • Published May 5, 2025 • 26
Iterative Self-Training for Code Generation via Reinforced Re-Ranking Paper • 2504.09643 • Published Apr 13, 2025 • 34