farpluto
/

zubenelgenubi-124m

Text Generation

knowledge-distillation

symbolic-reasoning

Model card Files Files and versions

zubenelgenubi-124m

989 MB

Ctrl+K

Ctrl+K

1 contributor

History: 2 commits

farpluto's picture

Upload 124M GPT trained from scratch with SmolLM distillation

ca40472 verified 3 months ago

.gitattributes

1.52 kB
initial commit 3 months ago
README.md

761 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
chat_template.jinja

196 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
config.json

822 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
generation_config.json

194 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
model.safetensors

493 MB
xet

Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
raw_model.pt
Detected Pickle imports (3)
- "torch.FloatStorage",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict"
What is a pickle import?
492 MB
xet

Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
tokenizer.json

3.52 MB
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
tokenizer_config.json

405 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago
training_config.json

334 Bytes
Upload 124M GPT trained from scratch with SmolLM distillation 3 months ago