Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

G3nadh
/
gemma3-270m-tinystories

Text Generation
PyTorch
English
gemma3
language-model
pre-training
from-scratch
tinystories
transformer
multi-query-attention
sliding-window-attention
rope
Model card Files Files and versions
xet
Community
gemma3-270m-tinystories
329 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
G3nadh's picture
G3nadh
Upload folder using huggingface_hub
71496e2 verified about 2 months ago
  • .gitattributes
    1.57 kB
    Upload folder using huggingface_hub about 2 months ago
  • README.md
    3.58 kB
    Upload folder using huggingface_hub about 2 months ago
  • config.json
    387 Bytes
    Upload folder using huggingface_hub about 2 months ago
  • layer_types.json
    466 Bytes
    Upload folder using huggingface_hub about 2 months ago
  • loss_curves.png
    103 kB
    xet
    Upload folder using huggingface_hub about 2 months ago
  • lr_schedule.png
    40.9 kB
    Upload folder using huggingface_hub about 2 months ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch.BFloat16Storage",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    329 MB
    xet
    Upload folder using huggingface_hub about 2 months ago
  • training_config.json
    451 Bytes
    Upload folder using huggingface_hub about 2 months ago