SmolTulu - a SultanR Collection

SultanR 's Collections

SmolTulu

updated Dec 17, 2024

A collection of models that use SmolLM2 as the pretrained base in conjunction with AllenAI's Tulu 3 post training pipeline.

SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs

Paper • 2412.08347 • Published Dec 11, 2024 • 4
SultanR/SmolTulu-1.7b-Reinforced

Text Generation • 2B • Updated Dec 17, 2024 • 8 • 5
SultanR/SmolTulu-1.7b-Instruct

Text Generation • 2B • Updated Dec 17, 2024 • 100 • 13
SultanR/SmolTulu-1.7b-RM

Text Classification • 2B • Updated Dec 17, 2024 • 4 • 2
SultanR/SmolTulu-1.7b-Instruct-GGUF

Text Generation • 2B • Updated Dec 1, 2024 • 14 • 2
SultanR/SmolTulu-1.7b-Reinforced-GGUF

Text Generation • 2B • Updated Dec 17, 2024 • 10 • 1