Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ The model was trained using data annotated by human annotators, who considered q
|
|
| 14 |
This model is used in the [NVIDIA NeMo Curator](https://github.com/NVIDIA/NeMo-Curator) as part of the qualitative filtering module.
|
| 15 |
# Model Architecture
|
| 16 |
The model architecture is Deberta V3 Base
|
| 17 |
-
Context length is
|
| 18 |
# Training (details)
|
| 19 |
## Training data:
|
| 20 |
- 1 million Common Crawl samples, labeled using Google Cloud’s Natural Language API: https://cloud.google.com/natural-language/docs/classifying-text
|
|
|
|
| 14 |
This model is used in the [NVIDIA NeMo Curator](https://github.com/NVIDIA/NeMo-Curator) as part of the qualitative filtering module.
|
| 15 |
# Model Architecture
|
| 16 |
The model architecture is Deberta V3 Base
|
| 17 |
+
Context length is 1024 tokens
|
| 18 |
# Training (details)
|
| 19 |
## Training data:
|
| 20 |
- 1 million Common Crawl samples, labeled using Google Cloud’s Natural Language API: https://cloud.google.com/natural-language/docs/classifying-text
|