Upload 124M GPT trained from scratch with SmolLM distillation ca40472 verified farpluto commited on 18 days ago