LLM From Scratch (Educational Model)

I trained this model while following the Train LLM From Scratch guide. It is intended primarily for educational and demonstration purposes. While it is not suitable for production use, feel free to utilize it for your own presentations or to save yourself the time and resources of training a model from zero.

The model was trained on a subset (about 20GB) of the The Pile (Uncopyrighted).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train plamentotev/llm-from-scratch