Spaces:
Running
Running
metadata
title: LLM From Scratch
emoji: 🧠
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.30.0
app_file: app.py
pinned: false
LLM From Scratch
Ibrahim Khalid
This repo is outdated due to github LFS limits
The updated project is now available on HuggingFace
The purpose of this project is to build a simple large language model from scratch.
This repo is following the guide from https://www.youtube.com/watch?v=UU1WVnMk4E8
In this repo:
- ./shakespeare.txt - This is a sample text used for training a smaller scale model
- ./bigram_testing.sync.ipynb - This notebook is where I test a basic BiGram model
- ./gpt_shakespeare.sync.ipynb - Notebook implementing simple GPT model using entire works of shakespeare
- ./gpt_openwebtext.sync.ipynb - Notebook implementing GPT model based on the OpenWebText Corpus
Prepare environment
pip install -r ./requirements-base.txt
pip install -r ./requirements-pytorch.txt
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference