Update README.md
Browse files
README.md
CHANGED
|
@@ -9,4 +9,5 @@ tags:
|
|
| 9 |
---
|
| 10 |
3B rocm-rwkv pth record.
|
| 11 |
- rwkv-final-chnk5.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-5 and with a loss of 2.456.
|
| 12 |
-
- rwkv-final-chnk17.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-10 for the first epoch and an aditional training with chunk1-7 after the first epoch and with a loss of 2.281
|
|
|
|
|
|
| 9 |
---
|
| 10 |
3B rocm-rwkv pth record.
|
| 11 |
- rwkv-final-chnk5.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-5 and with a loss of 2.456.
|
| 12 |
+
- rwkv-final-chnk17.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-10 for the first epoch and an aditional training with chunk1-7 after the first epoch and with a loss of 2.281
|
| 13 |
+
- rwkv-code39-16012024.pth: 3B rocm-rwkv model trained with Slim pajama chunk1-10 for the first epoch and an aditional training with chunk1-8 after the first epoch; plus a little bit of code. This pth has a loss of 1.174.
|