Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -32,7 +32,7 @@ dataset contains educational web pages.
|
|
| 32 |
|
| 33 |
- **Developed by:** Allen Porter
|
| 34 |
|
| 35 |
-
### Model Sources
|
| 36 |
|
| 37 |
<!-- Provide the basic links for the model. -->
|
| 38 |
|
|
@@ -78,7 +78,7 @@ from the 10B token sample.
|
|
| 78 |
|
| 79 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 80 |
|
| 81 |
-
#### Preprocessing
|
| 82 |
|
| 83 |
The data was pre-tokenized using the `nano-gpt prepare_dataset` command
|
| 84 |
line tool.
|
|
|
|
| 32 |
|
| 33 |
- **Developed by:** Allen Porter
|
| 34 |
|
| 35 |
+
### Model Sources
|
| 36 |
|
| 37 |
<!-- Provide the basic links for the model. -->
|
| 38 |
|
|
|
|
| 78 |
|
| 79 |
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
| 80 |
|
| 81 |
+
#### Preprocessing
|
| 82 |
|
| 83 |
The data was pre-tokenized using the `nano-gpt prepare_dataset` command
|
| 84 |
line tool.
|