LH-Tech-AI
/

Apex-1-Instruct-350M

Text Generation

Model card Files Files and versions

LH-Tech-AI commited on Feb 1

Commit

6c32df2

·

verified ·

1 Parent(s): b0ddab5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -758,7 +758,7 @@ We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3
 Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
 ## 5.2 Finetuning results
-After pretraining, we finetuned our model for 1000 iterations for ~2 hours:
 1. Final `val loss`:  **?**
 2. Final `train loss`: **?**
@@ -773,7 +773,7 @@ We tested our finetuned model a lot:
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
 2. HugginfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning
-4. My dad for his support
 5. My GPU for training and running my new model ;-)
 ---

 Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
 ## 5.2 Finetuning results
+After pretraining, we finetuned our model for 1500 iterations for ~3 hours:
 1. Final `val loss`:  **?**
 2. Final `train loss`: **?**
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
 2. HugginfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning
+4. My dad for his support <3
 5. My GPU for training and running my new model ;-)
 ---