Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
epoyraz
/
tinystories-25m
like
2
Text Generation
PyTorch
roneneldan/TinyStories
English
gpt
tinystories
from-scratch
rope
qk-norm
muon
multi-token-prediction
arxiv:
4 papers
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
tinystories-25m
78 MB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
epoyraz
Upgrade to modded+Muon+zero-init checkpoint (val 2.65 -> 2.40)
4cae3f7
verified
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
1 day ago
README.md
4.14 kB
Upgrade to modded+Muon+zero-init checkpoint (val 2.65 -> 2.40)
1 day ago
config.json
371 Bytes
Upgrade to modded+Muon+zero-init checkpoint (val 2.65 -> 2.40)
1 day ago
model.py
42.7 kB
Upgrade to modded+Muon+zero-init checkpoint (val 2.65 -> 2.40)
1 day ago
tinystories-25m.pt
76.8 MB
xet
Upgrade to modded+Muon+zero-init checkpoint (val 2.65 -> 2.40)
1 day ago
tokenizer.json
1.14 MB
Add TinyStories GPT (19M) checkpoint, model code, tokenizer, and card
1 day ago