Commit ·
a940cbb
1
Parent(s): 0e9cf15
Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ Instruction Tuned GPT-NeoXT-20B model on Instruction Tuning dataset as listed be
|
|
| 9 |
**Base Model:** togethercomputer/GPT-NeoXT-Chat-Base-20B (GPT-NeoXT-Chat-Base-20B-v0.16 - fine-tuned on feedback data)
|
| 10 |
|
| 11 |
**Training Details :**
|
| 12 |
-
* Epochs:
|
| 13 |
* Batch Size : 5 instantaneous per device x 1 gradient accumulation steps x 8 gpus = 40
|
| 14 |
* Block Size : 2020
|
| 15 |
* Weight Decay : 0
|
|
|
|
| 9 |
**Base Model:** togethercomputer/GPT-NeoXT-Chat-Base-20B (GPT-NeoXT-Chat-Base-20B-v0.16 - fine-tuned on feedback data)
|
| 10 |
|
| 11 |
**Training Details :**
|
| 12 |
+
* Epochs: 4
|
| 13 |
* Batch Size : 5 instantaneous per device x 1 gradient accumulation steps x 8 gpus = 40
|
| 14 |
* Block Size : 2020
|
| 15 |
* Weight Decay : 0
|