UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification Paper • 2605.06221 • Published 5 days ago • 19
Seedance 2.0: Advancing Video Generation for World Complexity Paper • 2604.14148 • Published 27 days ago • 157
view article Article ⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch zamal • Jun 28, 2025 • 41
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling Paper • 2603.06199 • Published Mar 6 • 9
R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model Paper • 2503.05132 • Published Mar 7, 2025 • 57
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14, 2025 • 303
InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding Paper • 2403.01487 • Published Mar 3, 2024 • 16