File size: 1,190 Bytes
2aaeb80
029c3bf
 
2aaeb80
 
 
 
 
 
 
 
b1f2aeb
3209fc4
b1f2aeb
 
3209fc4
cd810f2
b1f2aeb
 
 
 
 
68d81c9
3209fc4
 
 
 
 
68d81c9
 
3209fc4
7554be3
3209fc4
 
7554be3
3209fc4
2aaeb80
3209fc4
 
2aaeb80
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
---
title: LLM From Scratch
emoji: 🧠
colorFrom: green
colorTo: red
sdk: streamlit
sdk_version: 1.30.0
app_file: app.py
pinned: false
---

# LLM From Scratch

_Ibrahim Khalid_

The hosted project is available on [HuggingFace](https://huggingface.co/spaces/ibrahimmkhalid/llm-from-scratch)

The purpose of this project is to build a simple large language model from scratch.

This repo is following the guide from https://www.youtube.com/watch?v=UU1WVnMk4E8

In this repo:

- ./shakespeare.txt - This is a sample text used for training a smaller scale model
- ./bigram/bigram_testing.sync.ipynb - This notebook is where I test a basic BiGram model
- ./bigram/bigram_testing.sync.py - Python script for BiGram testing
- ./simple_gpt/gpt_shakespeare.sync.ipynb - Notebook implementing simple GPT model using entire works of shakespeare
- ./simple_gpt/gpt_shakespeare.sync.py - Python script for GPT shakespeare model

### Prepare environment

```
pip install -r ./pre-requirements.txt
pip install -r ./requirements.txt
```

---

[On GitHub](https://github.com/ibrahimmkhalid/llm-from-scratch)

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference