Upload 124M GPT trained from scratch with SmolLM distillation ca40472 verified farpluto commited on 14 days ago