LH-Tech-AI
/

Apex-1-Instruct-350M

Text Generation

Model card Files Files and versions

LH-Tech-AI commited on Feb 4

Commit

5d75d92

·

verified ·

1 Parent(s): 394ed8c

Update README.md

Files changed (1) hide show

README.md +1 -14

README.md CHANGED Viewed

@@ -763,23 +763,10 @@ if __name__ == "__main__":
 ```
 # 5. Our training results
-## 5.1 Pretraining results
 We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3 days.
 Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
-## 5.2 Finetuning results
-After pretraining, we finetuned our model for 1500 iterations for ~3 hours:
-1. Final `val loss`:  **?**
-2. Final `train loss`: **?**
-# 6. Exampleprompts and -results
-We tested our finetuned model a lot:
-1. Question: What is Artificial Intelligence?
-   --> Answer:
-2. ...
-# 7. Thanks to...
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
 2. HugginfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning

 ```
 # 5. Our training results
 We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3 days.
 Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
+# 6. Thanks to...
 1. Andrej Karpathy for his nanoGPT Code and his YouTube Videos in the make-mode-series
 2. HugginfaceTW for the Fineweb-Edu-10BT-Sample Training Dataset
 3. Yahma for the alpaca-cleaned dataset for the finetuning