Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jacksuuuu
/
tinystories
like
1
Text Generation
Transformers
PyTorch
roneneldan/TinyStories
English
nanogpt
gpt
pre-ln
causal-lm
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
6
Deploy
Use this model
refs/pr/1
tinystories
291 MB
Ctrl+K
Ctrl+K
1 contributor
History:
23 commits
SFconvertbot
Adding `safetensors` variant of this model
1bc401a
verified
4 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
README.md
11.7 kB
Upload model - 35000 iterations, loss: 3.4640
5 months ago
config.json
464 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
generation_config.json
119 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
merges.txt
456 kB
Upload merges.txt with huggingface_hub
4 months ago
model.safetensors
143 MB
xet
Adding `safetensors` variant of this model
4 months ago
modeling_nanogpt.py
9.1 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
pytorch_model.bin
143 MB
xet
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
special_tokens_map.json
438 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
tokenizer.json
3.56 MB
Upload tokenizer.json with huggingface_hub
4 months ago
tokenizer_config.json
545 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago
vocab.json
999 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
4 months ago