Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jacksuuuu
/
tinystories
like
1
Text Generation
Transformers
PyTorch
roneneldan/TinyStories
English
nanogpt
gpt
pre-ln
causal-lm
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
6
Deploy
Use this model
d3cd026
tinystories
291 MB
Ctrl+K
Ctrl+K
1 contributor
History:
30 commits
SFconvertbot
Adding `safetensors` variant of this model
d3cd026
verified
5 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
7.19 kB
Update model card: professional format, remove MLX version reference
5 months ago
config.json
Safe
603 Bytes
Add auto_map to config.json for automatic model loading
5 months ago
generation_config.json
Safe
119 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
merges.txt
Safe
456 kB
Upload merges.txt with huggingface_hub
5 months ago
model.safetensors
143 MB
xet
Adding `safetensors` variant of this model
5 months ago
modeling_nanogpt.py
Safe
9.1 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
143 MB
xet
Update to checkpoint 35000 (loss 3.46, includes distillation)
5 months ago
special_tokens_map.json
Safe
438 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
tokenizer.json
Safe
3.56 MB
Upload tokenizer.json with huggingface_hub
5 months ago
tokenizer_config.json
Safe
545 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
vocab.json
Safe
999 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago