SmolLM-135M model in full precision (BF16), fine-tuned on TinyStories. Trained for 12k steps on 200k train stories with eval on the published validation split (~6.3 perplexity).

See ./generate_tinystories_fullprec.py for simple demo. This model is only intended for generating toy story examples and comparing quantization techniques.

Downloads last month: 4

Safetensors

Model size

0.1B params

Tensor type

BF16

Model tree for Dominic/smollm135_fullprec_tinystories

Base model

HuggingFaceTB/SmolLM-135M

Finetuned

(122)

this model