Update README.md
Browse files
README.md
CHANGED
|
@@ -677,11 +677,11 @@ if __name__ == "__main__":
|
|
| 677 |
# 5. Our training results
|
| 678 |
## 5.1 Pretraining results
|
| 679 |
|
| 680 |
-
We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations.
|
| 681 |
Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
|
| 682 |
|
| 683 |
## 5.2 Finetuning results
|
| 684 |
-
After pretraining, we finetuned our model for
|
| 685 |
1. Final `val loss`: **?**
|
| 686 |
2. Final `train loss`: **?**
|
| 687 |
|
|
|
|
| 677 |
# 5. Our training results
|
| 678 |
## 5.1 Pretraining results
|
| 679 |
|
| 680 |
+
We did the pretraining on a single RTX 5060 Ti 16GB for 30,000 iterations for ~3 days.
|
| 681 |
Out final `val loss` value was **3.0450** and our final `train loss` was **3.0719**.
|
| 682 |
|
| 683 |
## 5.2 Finetuning results
|
| 684 |
+
After pretraining, we finetuned our model for 1000 iterations for ~2 hours:
|
| 685 |
1. Final `val loss`: **?**
|
| 686 |
2. Final `train loss`: **?**
|
| 687 |
|