Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
jacksuuuu
/
tinystories
like
1
Text Generation
Transformers
PyTorch
roneneldan/TinyStories
English
nanogpt
gpt
pre-ln
causal-lm
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
6
Deploy
Use this model
08dbe3c
tinystories
360 MB
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
jacksuuuu
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
08dbe3c
verified
5 months ago
__pycache__
Upload model - 35000 iterations, loss: 3.4640
5 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
Safe
11.7 kB
Upload model - 35000 iterations, loss: 3.4640
6 months ago
README_SCRIPTS.md
Safe
6.66 kB
Upload model - 35000 iterations, loss: 3.4640
6 months ago
add_tokenizer.py
Safe
3.85 kB
Upload model - 35000 iterations, loss: 3.4640
5 months ago
config.json
Safe
464 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
convert_to_hf.py
Safe
12.1 kB
Upload model - 35000 iterations, loss: 3.4640
5 months ago
generation_config.json
Safe
119 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
merges.txt
Safe
456 kB
Upload merges.txt with huggingface_hub
5 months ago
model.safetensors
212 MB
xet
Upload model - 20000 iterations, loss: 0.7583
5 months ago
modeling_nanogpt.py
Safe
9.1 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
publish_model.py
Safe
4.97 kB
Upload model - 35000 iterations, loss: 3.4640
6 months ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"collections.OrderedDict"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
143 MB
xet
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
requirements.txt
Safe
361 Bytes
Upload model - 35000 iterations, loss: 3.4640
6 months ago
special_tokens_map.json
Safe
438 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
test_model.py
Safe
4.97 kB
Upload model - 35000 iterations, loss: 3.4640
6 months ago
tokenizer.json
Safe
3.56 MB
Upload tokenizer.json with huggingface_hub
5 months ago
tokenizer_config.json
Safe
545 Bytes
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago
training_metadata.json
Safe
445 Bytes
Upload model - 20000 iterations, loss: 0.7583
5 months ago
upload_to_hf.py
Safe
6.82 kB
Upload model - 35000 iterations, loss: 3.4640
6 months ago
vocab.json
Safe
999 kB
Update to checkpoint 35000 with fixed Pre-LN architecture (iter 35k, loss 3.46)
5 months ago