LF_LLM-269M / README.md
LF-Luis's picture
update readme
a274d23 verified
[MyLLM](https://github.com/LF-Luis/MyLLM) is a deep-learning personal project where I built a modern LLM (`LF_LLM-269M`) from the ground up. I focused on developing the core components required for pre-training an LLM, including writing the model-architecture code, handling large datasets, training the model efficiently, and evaluating its performance.
For more info on the model itself and how it was train see https://github.com/LF-Luis/MyLLM.
Model was trained on `2024_12_07-05_14_UTC`
---
license: gpl-3.0
---