Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 4 days ago • 36
Bolmo Collection Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated 4 days ago • 11
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 6 items • Updated 3 days ago • 103
Pearl Collection PEARL: A Multimodal Culturally-Aware Arabic Instruction Dataset • 4 items • Updated Oct 27 • 6
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 25 days ago • 80
Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated 18 days ago • 37
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models • 10 items • Updated Nov 21 • 27
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation Paper • 2310.16656 • Published Oct 25, 2023 • 51
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 36
⚛️ Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 22 items • Updated about 22 hours ago • 101
👁️ LFM2-VL Collection LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated about 22 hours ago • 60