Fredtt3 commited on
Commit
4c53cdb
·
verified ·
1 Parent(s): e8bab74

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -10,3 +10,8 @@ pipeline_tag: text-generation
10
 
11
 
12
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63cb46191b705cc951e88e6c/2LZWEcQcoLIT4Fm1SHMCB.png)
 
 
 
 
 
 
10
 
11
 
12
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/63cb46191b705cc951e88e6c/2LZWEcQcoLIT4Fm1SHMCB.png)
13
+
14
+
15
+ I was initially going to upload a checkpoint that had been trained with 5,000 steps, but I made a mistake and had to train from scratch again, and I could only
16
+ save the 2,000-step checkpoint. As soon as I have access to an NVIDIA A100 again, I'm going to train a competent 100M model on a
17
+ larger scale and with more data, and then a 1B model.