robinhad commited on
Commit
363561d
·
verified ·
1 Parent(s): 69e9e72

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  # alignment-score-model
20
 
21
- This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 0.0610
24
  - Precision: 0.9802
@@ -26,17 +26,19 @@ It achieves the following results on the evaluation set:
26
  - F1 Macro: 0.9790
27
  - Accuracy: 0.9790
28
 
 
 
29
  ## Model description
30
 
31
- More information needed
32
 
33
  ## Intended uses & limitations
34
 
35
- More information needed
36
 
37
  ## Training and evaluation data
38
 
39
- More information needed
40
 
41
  ## Training procedure
42
 
 
18
 
19
  # alignment-score-model
20
 
21
+ This model is a fine-tuned version of [intfloat/multilingual-e5-base](https://huggingface.co/intfloat/multilingual-e5-base) on the alignment dataset.
22
  It achieves the following results on the evaluation set:
23
  - Loss: 0.0610
24
  - Precision: 0.9802
 
26
  - F1 Macro: 0.9790
27
  - Accuracy: 0.9790
28
 
29
+ Training script is available here: https://github.com/lapa-llm/lapa-llm/blob/main/pretraining/quality-classifiers/alignment_score.py
30
+
31
  ## Model description
32
 
33
+ This model measure how likely the given text is a disinformation or unaligned to Ukrainian context.
34
 
35
  ## Intended uses & limitations
36
 
37
+ Data filtering and evaluation of pretraining data at scale
38
 
39
  ## Training and evaluation data
40
 
41
+ Take a look into https://github.com/lapa-llm/lapa-llm/blob/main/pretraining/quality-classifiers/alignment_score.py
42
 
43
  ## Training procedure
44