Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jacksuuuu
/
tinystories
like
1
Text Generation
Transformers
PyTorch
roneneldan/TinyStories
English
nanogpt
gpt
pre-ln
causal-lm
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
6
Deploy
Use this model
refs/pr/3
tinystories
291 MB
Ctrl+K
Ctrl+K
1 contributor
History:
30 commits
SFconvertbot
Adding `safetensors` variant of this model
d3cd026
verified
4 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
README.md
7.19 kB
Update model card: professional format, remove MLX version reference
4 months ago
config.json
603 Bytes
Add auto_map to config.json for automatic model loading
4 months ago
generation_config.json
119 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
merges.txt
456 kB
Upload merges.txt with huggingface_hub
4 months ago
model.safetensors
143 MB
xet
Adding `safetensors` variant of this model
4 months ago
modeling_nanogpt.py
9.1 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
pytorch_model.bin
143 MB
xet
Update to checkpoint 35000 (loss 3.46, includes distillation)
4 months ago
special_tokens_map.json
438 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
tokenizer.json
3.56 MB
Upload tokenizer.json with huggingface_hub
4 months ago
tokenizer_config.json
545 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
vocab.json
999 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago