roneneldan/TinyStories
Viewer • Updated • 2.14M • 88.6k • 985
This is a custom implementation of Gemma-3 270M parameter model fine-tuned on the TinyStories dataset.
# Note: This model requires the custom Gemma3Model class from the training notebook
# You'll need to copy the model definition to use this model