Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

alexgara
/
llama-124m

Text Generation
PyTorch
TensorBoard
English
transformer
language-model
from-scratch
llama
decoder-only
causal-lm
Eval Results (legacy)
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-124m
85.4 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 17 commits
alexgara's picture
alexgara
Upload README.md with huggingface_hub
9d98c01 verified 25 days ago
  • config
    Upload config/llama_124M.json with huggingface_hub 25 days ago
  • data
    Upload data/val_10bt.npy with huggingface_hub 25 days ago
  • img
    Upload img/llama_124m.mp4 with huggingface_hub 25 days ago
  • runs
    Upload runs/events.out.tfevents.1773320553.af119ced32ad.13976.0 with huggingface_hub 25 days ago
  • .gitattributes
    1.67 kB
    Upload img/llama_124m.mp4 with huggingface_hub 25 days ago
  • README.md
    4.72 kB
    Upload README.md with huggingface_hub 25 days ago
  • model.pt

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    498 MB
    xet
    Upload model.pt with huggingface_hub 25 days ago
  • params.json
    955 Bytes
    Upload params.json with huggingface_hub 25 days ago
  • tokenizer.model
    786 kB
    xet
    Upload tokenizer.model with huggingface_hub 25 days ago
  • tokenizer.vocab
    503 kB
    Upload tokenizer.vocab with huggingface_hub 25 days ago