Update README.md
Browse files
README.md
CHANGED
|
@@ -20,11 +20,11 @@ The project is still a work in progress, new versions will come with time.
|
|
| 20 |
|
| 21 |
# Training
|
| 22 |
|
| 23 |
-
- epochs:
|
| 24 |
- lr: 1e-5
|
| 25 |
- optim: AdamW
|
| 26 |
-
- weight_decay: 1e-
|
| 27 |
|
| 28 |
# Eval
|
| 29 |
|
| 30 |
-
- perplexity:
|
|
|
|
| 20 |
|
| 21 |
# Training
|
| 22 |
|
| 23 |
+
- epochs: 250
|
| 24 |
- lr: 1e-5
|
| 25 |
- optim: AdamW
|
| 26 |
+
- weight_decay: 1e-4
|
| 27 |
|
| 28 |
# Eval
|
| 29 |
|
| 30 |
+
- perplexity: 45 (it's a 12MB model, don't expect this to be ChatGPT anytime soon :)
|