Upload 124M GPT trained from scratch with SmolLM distillation ca40472 verified farpluto commited on 17 days ago