Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Ashx098
/
Mini-LLM
like
4
Text Generation
Safetensors
PyTorch
English
doi:10.57967/hf/7332
llm
decoder-only
transformer
from-scratch
research
educational
80m
pretraining
custom-architecture
License:
mit
Model card
Files
Files and versions
xet
Community
Ashx098
commited on
Dec 8, 2025
Commit
6b16cfb
·
verified
·
1 Parent(s):
390f2ba
Upload folder using huggingface_hub
Browse files
Files changed (2)
hide
show
phase-1-pretraining/plots/loss_curve.png
+0
-0
phase-1-pretraining/plots/lr_curve.png
+0
-0
phase-1-pretraining/plots/loss_curve.png
ADDED
Viewed
phase-1-pretraining/plots/lr_curve.png
ADDED
Viewed