LH-Tech-AI
/

Apex-1-Instruct-350M

Text Generation

Model card Files Files and versions

LH-Tech-AI commited on Feb 11

Commit

ccf1f50

·

verified ·

1 Parent(s): ca613a7

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -764,11 +764,11 @@ if __name__ == "__main__":
 # 5. Our training results
 We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3 days.
-Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
 # 6. Thanks to...
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
-2. HugginfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning
 4. My dad for his support <3
 5. My GPU for training and running my new model ;-)

 # 5. Our training results
 We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3 days.
+Out final `val loss` value was **2.8175** and our final `train loss` was **2.8008**.
 # 6. Thanks to...
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
+2. HuggingfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning
 4. My dad for his support <3
 5. My GPU for training and running my new model ;-)