samirmsallem commited on
Commit
da80ee4
·
verified ·
1 Parent(s): b654646

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -31,10 +31,13 @@ model-index:
31
 
32
  ## Text classification model for coherence evaluation in German scientific texts
33
 
34
- **gbert-base-coherence_evaluation** is a sequence classification model in the scientific domain in German, finetuned from the model [gbert-large](https://huggingface.co/deepset/gbert-large).
35
  It was trained using a custom annotated dataset of around 12,000 training and 3,000 test examples containing coherent and incoherent text sequences from wikipedia articles in german.
36
 
37
 
 
 
 
38
  |Text Classification Tag| Text Classification Label | Description |
39
  | :----: | :----: | :----: |
40
  | 0 | INCOHERENT | The text is not coherent or has any kind of cohesion. |
 
31
 
32
  ## Text classification model for coherence evaluation in German scientific texts
33
 
34
+ **gbert-large-coherence_evaluation** is a sequence classification model in the scientific domain in German, finetuned from the model [gbert-large](https://huggingface.co/deepset/gbert-large).
35
  It was trained using a custom annotated dataset of around 12,000 training and 3,000 test examples containing coherent and incoherent text sequences from wikipedia articles in german.
36
 
37
 
38
+ Compared to the [base version](https://huggingface.co/samirmsallem/gbert-base-coherence_evaluation), this model achieved a slightly higher peak accuracy (95.30%) on the validation set, observed at epoch 7. However, the base model reached its lowest evaluation loss (0.2347) earlier during training, suggesting that it converges faster but may underperform slightly in terms of generalization. These findings can inform future model selection depending on whether inference efficiency or accuracy is prioritized.
39
+
40
+
41
  |Text Classification Tag| Text Classification Label | Description |
42
  | :----: | :----: | :----: |
43
  | 0 | INCOHERENT | The text is not coherent or has any kind of cohesion. |