MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper β’ 2509.22186 β’ Published Sep 26, 2025 β’ 162
view article Article Tiny Agents: an MCP-powered agent in 50 lines of code julien-c β’ Apr 25, 2025 β’ 308
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova β’ Feb 20, 2025 β’ 337
π΅ The MusicBox Collection A collection full of musical tasks demos, for musicians & music enthusiasts β’ 39 items β’ Updated 23 days ago β’ 33
Semantic Image Inversion and Editing using Rectified Stochastic Differential Equations Paper β’ 2410.10792 β’ Published Oct 14, 2024 β’ 31
How Do Your Code LLMs Perform? Empowering Code Instruction Tuning with High-Quality Data Paper β’ 2409.03810 β’ Published Sep 5, 2024 β’ 35
Configurable Foundation Models: Building LLMs from a Modular Perspective Paper β’ 2409.02877 β’ Published Sep 4, 2024 β’ 32
MobileQuant: Mobile-friendly Quantization for On-device Language Models Paper β’ 2408.13933 β’ Published Aug 25, 2024 β’ 16