Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
G3nadh
/
gemma3-270m-tinystories
like
0
Text Generation
PyTorch
roneneldan/TinyStories
English
gemma3
language-model
pre-training
from-scratch
tinystories
transformer
multi-query-attention
sliding-window-attention
rope
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
gemma3-270m-tinystories
329 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
G3nadh
Upload folder using huggingface_hub
71496e2
verified
about 2 months ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
about 2 months ago
README.md
3.58 kB
Upload folder using huggingface_hub
about 2 months ago
config.json
387 Bytes
Upload folder using huggingface_hub
about 2 months ago
layer_types.json
466 Bytes
Upload folder using huggingface_hub
about 2 months ago
loss_curves.png
103 kB
xet
Upload folder using huggingface_hub
about 2 months ago
lr_schedule.png
40.9 kB
Upload folder using huggingface_hub
about 2 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
329 MB
xet
Upload folder using huggingface_hub
about 2 months ago
training_config.json
451 Bytes
Upload folder using huggingface_hub
about 2 months ago