Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 3 days ago • 4 • 1
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 9 days ago • 175
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 16 days ago • 97 • 5
Running 384 Visualize Dataset (v2.0+ latest dataset format) 💻 384 Visualize LeRobot datasets in an interactive web tool
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 20 days ago • 95 • 3
Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published Feb 13 • 34 • 3