Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated Dec 23, 2025 • 56
ChemPile Collection The ChemPile is a dataset with over 77 billion curated multimodal tokens about chemistry. For more information, visit https://chempile.lamalab.org/. • 8 items • Updated May 5 • 19
view article Article Everything You Need to Know about Knowledge Distillation Kseniase • Mar 6, 2025 • 82
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language Paper • 2406.05629 • Published Jun 9, 2024 • 8
view article Article RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled Andyrasika • Apr 7, 2024 • 10