MolmoAct2 Models Collection Collection of the base models for MolmoAct2 • 6 items • Updated May 5 • 23
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 244
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated Apr 22 • 45
💧 LFM2.5 Collection Collection of post-trained and base LFM2.5 models. • 14 items • Updated about 4 hours ago • 159
AR-Lightx2v Collection Efficient autoregressive video generation (i.e., the Self-Forcing family) checkpoints. • 2 items • Updated 29 days ago • 3
Granite Vision Collection Multimodal models built for visual document analysis and image understanding. • 7 items • Updated May 22 • 43
Unsloth Diffusion GGUFs Collection Find GGUFs and other variants of diffusion based models like Qwen-Image and FLUX. • 20 items • Updated 11 days ago • 90
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 10 items • Updated about 1 month ago • 100
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated Apr 29 • 220
Transformers.js V4 demos Collection A collection of demos built with Transformers.js V4 • 24 items • Updated Apr 16 • 64
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 cbensimon, sayakpaul, linoyts, multimodalart • Sep 2, 2025 • 78
Cosmos-Reason2 Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 8 items • Updated 14 days ago • 26
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 164