Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

farpluto
/
zubenelgenubi-124m

Text Generation
Safetensors
English
gpt2
knowledge-distillation
symbolic-reasoning
from-scratch
conversational
Model card Files Files and versions
xet
Community
zubenelgenubi-124m
989 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
farpluto's picture
farpluto
Upload 124M GPT trained from scratch with SmolLM distillation
ca40472 verified 14 days ago
  • .gitattributes
    1.52 kB
    initial commit 14 days ago
  • README.md
    761 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • chat_template.jinja
    196 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • config.json
    822 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • generation_config.json
    194 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • model.safetensors
    493 MB
    xet
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • raw_model.pt

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    492 MB
    xet
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • tokenizer.json
    3.52 MB
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • tokenizer_config.json
    405 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago
  • training_config.json
    334 Bytes
    Upload 124M GPT trained from scratch with SmolLM distillation 14 days ago