Commit ·
d6fc432
1
Parent(s): 8ff1eaf
Update README.md
Browse files
README.md
CHANGED
|
@@ -17,7 +17,7 @@ This repository provides a Japanese-centric multilingual GPT-NeoX model of 10 bi
|
|
| 17 |
|
| 18 |
* **Pre-training**
|
| 19 |
|
| 20 |
-
The model was trained on around **600B** tokens from a mixture of the following corpora
|
| 21 |
|
| 22 |
- [Japanese C4](https://huggingface.co/datasets/mc4)
|
| 23 |
- [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
|
|
|
|
| 17 |
|
| 18 |
* **Pre-training**
|
| 19 |
|
| 20 |
+
The model was trained on around **600B** tokens from a mixture of the following corpora.
|
| 21 |
|
| 22 |
- [Japanese C4](https://huggingface.co/datasets/mc4)
|
| 23 |
- [The Pile](https://huggingface.co/datasets/EleutherAI/pile)
|