SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published Mar 14, 2025 • 138
Aligning Text, Code, and Vision: A Multi-Objective Reinforcement Learning Framework for Text-to-Visualization Paper • 2601.04582 • Published 25 days ago • 10
NeMo Curator - Classifier Models Collection Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 3 days ago • 25
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 25 items • Updated 3 days ago • 114
MolmoAct Collection All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated Dec 23, 2025 • 35
Pi0 Fast (previous) Collection Pretrained checkpoints for Pi0+FAST models • 1 item • Updated 23 days ago • 6
view article Article A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality +2 Oct 24, 2024 • 64
view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face Nov 11, 2024 • 20