Instructions to use Aleph-Alpha/Aleph-Alpha-GermanWeb-Quality-Classifier-fastText with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- fastText
How to use Aleph-Alpha/Aleph-Alpha-GermanWeb-Quality-Classifier-fastText with fastText:
from huggingface_hub import hf_hub_download import fasttext model = fasttext.load_model(hf_hub_download("Aleph-Alpha/Aleph-Alpha-GermanWeb-Quality-Classifier-fastText", "model.bin")) - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -18,7 +18,7 @@ For each document, we calculated a combined educational quality score by taking
|
|
| 18 |
|
| 19 |
We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 77% precision and 77% recall on the validation set.
|
| 20 |
|
| 21 |
-
Further details, including our LLM judging prompt, can be found in our accompanying paper
|
| 22 |
|
| 23 |
## Example Snippet
|
| 24 |
|
|
|
|
| 18 |
|
| 19 |
We trained Aleph-Alpha-GermanWeb-Quality-Classifier-fastText using 185,403 documents in each class. We used 95% of the data (and the remaining 5% for validation) to train a fastText model to classify between high and low quality text data. It reached 77% precision and 77% recall on the validation set.
|
| 20 |
|
| 21 |
+
Further details, including our LLM judging prompt, can be found in our [accompanying paper](https://arxiv.org/abs/2505.00022).
|
| 22 |
|
| 23 |
## Example Snippet
|
| 24 |
|