DotLM Collection SimpleThoughts data spans four stages—pretraining, SFT, alignment, and reasoning - training DotLM-165M to prioritize reasoning over memorization. • 2 items • Updated about 6 hours ago
DotLM Collection SimpleThoughts data spans four stages—pretraining, SFT, alignment, and reasoning - training DotLM-165M to prioritize reasoning over memorization. • 2 items • Updated about 6 hours ago