Frustrated-B4S1C commited on
Commit
b78b32a
·
verified ·
1 Parent(s): ca44c07

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ GPT-2_scratch.jpg filter=lfs diff=lfs merge=lfs -text
.gitkeep ADDED
File without changes
GPT-2_scratch.jpg ADDED

Git LFS Details

  • SHA256: 784ee8659b95c2a615a715c775ecb0061252b157bfd60066877ca508080a60a9
  • Pointer size: 131 Bytes
  • Size of remote file: 176 kB
README.md CHANGED
@@ -1,3 +1,10 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
1
+ # GPT-2 From Scratch
2
+ This repo contains the custom implementation of [GPT-2](https://openai.com/index/gpt-2-1-5b-release/). This implementation was created following the playlist ["LLMs from Scratch" by Vizuara](https://youtube.com/playlist?list=PLPTV0NXA_ZSgsLAr8YCgCwhPIJNNtexWu&si=eksVKcxWNTVzJRUa). The original course has all the code in an ```ipynb``` notebook.
3
+
4
+ As I was following along the course, I have modularized the said implemenation and have also created an OpenAI weights compatible implementation (not covered in the course). In the course, the OpenAI's tensorflow weights were preprocessed and then injected into the custom implementation, whereas in my OpenAI compatible implementation (located in [compat folder](compat/)), I have used a larger (combined) K,Q,V matrix that aligns with the OpenAI weights, whereas the custom implementation (in the course) deals with this separately.
5
+
6
+ The scripts used to map the OpenAI weights (PyTorch weights downloaded from hugging face) to my OpenAI compatible implementation is [mapper_v3.py](./mapper_v3.py).
7
+
8
+ ![GPT-2 Architecture](./GPT-2_scratch.jpg)
9
+
10
+ Please check the [Github Repo](https://github.com/B4S1C-Coder/GPT-2-from-scratch) for usage.
gpt2_124m_compat_openai.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee616ca7ab582f584a74dbbed83803d87508f15fa2e88406a9dc75b28087cecb
3
+ size 702539117
gpt2_124m_converted.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f88ad435e5d6c2b1750c96c9999d1e220e3df6d22a6a288d7b6256b5ded125e5
3
+ size 497811599
gpt2_124m_openai_checkpoint.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:86a0214a6b2badaab581b375589790b1eaf4dc2844f69f6cf622c29d5f6f5406
3
+ size 497812255
gpt2_124m_reduced_ctx__checkpoint.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b2d5485523c262903be8dfb30eca8d86455fe37041b7a5f9e81d49fdb4810e5
3
+ size 1952401499
gpt2_355m_compat_openai.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91fe1e26031f70a5e7513d65336d4a6fb6039436a91a62490bb8fd87ff68d45d
3
+ size 1725924021
gpt2_355m_it_custom.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6b241be4ac14c160782e243c29e6178acba7e4e546577191f80c24df0e94422
3
+ size 1725927081
gpt2_355m_openai_checkpoint.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3e042dc1011e5433b6ae98ea5bb0855038a84d43e5a32ae42914868c37f6b84
3
+ size 1419396991
gpt2_spam_classif_weights.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9838aa44ee0879e206fcc6f11d1087fb6749d97020ef60ee04ee4019742f67bc
3
+ size 548158667