llm-from-scratch / readme.md
Ibrahim Mansoor Khalid
Update readme.md
cd810f2
|
raw
history blame
788 Bytes

LLM From Scratch

Ibrahim Khalid


This repo is outdated due to github LFS limits
The updated project is now available on HuggingFace


The purpose of this project is to build a simple large language model from scratch.

This repo is following the guide from https://www.youtube.com/watch?v=UU1WVnMk4E8

In this repo:

  • ./shakespeare.txt - This is a sample text used for training a smaller scale model
  • ./bigram_testing.sync.ipynb - This notebook is where I test a basic BiGram model
  • ./gpt_shakespeare.sync.ipynb - Notebook implementing simple GPT model using entire works of shakespeare

Prepare environment

pip install -r ./requirements-base.txt  
pip install -r ./requirements-pytorch.txt