T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning Paper • 2603.03790 • Published Mar 4 • 122
DraftAttention: Fast Video Diffusion via Low-Resolution Attention Guidance Paper • 2505.14708 • Published May 17, 2025
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation Paper • 2510.00515 • Published Oct 1, 2025 • 41
The Geometry of Reasoning: Flowing Logics in Representation Space Paper • 2510.09782 • Published Oct 10, 2025 • 7
Why Do Transformers Fail to Forecast Time Series In-Context? Paper • 2510.09776 • Published Oct 10, 2025 • 3
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers Paper • 2412.12444 • Published Dec 17, 2024
Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs Paper • 2506.00577 • Published May 31, 2025 • 12
Multi-Layer Transformers Gradient Can be Approximated in Almost Linear Time Paper • 2408.13233 • Published Aug 23, 2024 • 23
The Effect of Intrinsic Dataset Properties on Generalization: Unraveling Learning Differences Between Natural and Medical Images Paper • 2401.08865 • Published Jan 16, 2024