Aleph-Alpha
/

Aleph-Alpha-GermanWeb-Quality-Classifier-fastText

Model card Files Files and versions

bastitx commited on Apr 24, 2025

Commit

e273b66

·

verified ·

1 Parent(s): 3340ddd

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ For each document, we calculated a combined educational quality score by taking
 We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 92% precision and 91.5% recall on the validation set.
-Further details, including our LLM judging prompt, can be found in our [accompanying paper](arXiv link).
 ## Example Snippet

 We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 92% precision and 91.5% recall on the validation set.
+Further details, including our LLM judging prompt, can be found in our accompanying paper (link to paper coming soon).
 ## Example Snippet