lapa-llm
/

alignment-score-model

Text Classification

Generated from Trainer

text-embeddings-inference

Model card Files Files and versions

Metrics Training metrics Community

robinhad commited on Nov 11, 2025

Commit

363561d

·

verified ·

1 Parent(s): 69e9e72

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 # alignment-score-model
-This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0610
 - Precision: 0.9802
@@ -26,17 +26,19 @@ It achieves the following results on the evaluation set:
 - F1 Macro: 0.9790
 - Accuracy: 0.9790
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 # alignment-score-model
+This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) on the alignment dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0610
 - Precision: 0.9802
 - F1 Macro: 0.9790
 - Accuracy: 0.9790
+Training script is available here: https://github.com/lapa-llm/lapa-llm/blob/main/pretraining/quality-classifiers/alignment_score.py
 ## Model description
+This model measure how likely the given text is a disinformation or unaligned to Ukrainian context.
 ## Intended uses & limitations
+Data filtering and evaluation of pretraining data at scale
 ## Training and evaluation data
+Take a look into https://github.com/lapa-llm/lapa-llm/blob/main/pretraining/quality-classifiers/alignment_score.py
 ## Training procedure