NextCoder Collection NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. ⢠6 items ⢠Updated Jul 9, 2025 ⢠74
SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published Apr 7, 2025 ⢠205
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10, 2025 ⢠146
š§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community ⢠24 items ⢠Updated May 19, 2025 ⢠182
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets ⢠9 items ⢠Updated Oct 7, 2025 ⢠67
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper ⢠2501.17703 ⢠Published Jan 29, 2025 ⢠59
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/nvidia-cosmos-2 ⢠31 items ⢠Updated 2 days ago ⢠299
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper ⢠2501.00958 ⢠Published Jan 1, 2025 ⢠109
Transformers Can Navigate Mazes With Multi-Step Prediction Paper ⢠2412.05117 ⢠Published Dec 6, 2024 ⢠5
Common Models Collection The first generation of models pretrained on Common Corpus. ⢠5 items ⢠Updated Dec 5, 2024 ⢠41
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M ⢠16 items ⢠Updated May 5, 2025 ⢠301
LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 ⢠8 items ⢠Updated Nov 21, 2024 ⢠48
Granite 3.0 Language Models Collection A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. ⢠8 items ⢠Updated Nov 17, 2025 ⢠101
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 ⢠15 items ⢠Updated Dec 6, 2024 ⢠654
Molmo Collection Artifacts for open multimodal language models. ⢠5 items ⢠Updated Dec 23, 2025 ⢠309