nvidia
/

quality-classifier-deberta

pytorch_model_hub_mixin

model_hub_mixin

Model card Files Files and versions

ryantwolf commited on Aug 6, 2024

Commit

05e88a7

·

verified ·

1 Parent(s): deefa0c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ The model was trained using data annotated by human annotators, who considered q
 This model is used in the [NVIDIA NeMo Curator](https://github.com/NVIDIA/NeMo-Curator) as part of the qualitative filtering module.
 # Model Architecture
 The model architecture is Deberta V3 Base
-Context length is 512 tokens
 # Training (details)
 ## Training data:
 - 1 million Common Crawl samples, labeled using Google Cloud’s Natural Language API: https://cloud.google.com/natural-language/docs/classifying-text

 This model is used in the [NVIDIA NeMo Curator](https://github.com/NVIDIA/NeMo-Curator) as part of the qualitative filtering module.
 # Model Architecture
 The model architecture is Deberta V3 Base
+Context length is 1024 tokens
 # Training (details)
 ## Training data:
 - 1 million Common Crawl samples, labeled using Google Cloud’s Natural Language API: https://cloud.google.com/natural-language/docs/classifying-text