ymoslem
/

ModernBERT-base-qe-v1

@@ -37,7 +37,47 @@ datasets:
 - ymoslem/tokenized-wmt-da-human-evaluation
 model-index:
 - name: Quality Estimation for Machine Translation
-  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -51,18 +91,23 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 - ymoslem/tokenized-wmt-da-human-evaluation
 model-index:
 - name: Quality Estimation for Machine Translation
+  results:
+  - task:
+      type: regression
+    dataset:
+      name: ymoslem/wmt-da-human-evaluation-long-context
+      type: QE
+    metrics:
+    - name: Pearson
+      type: Pearson Correlation
+      value: 0.4465
+    - name: MAE
+      type: Mean Absolute Error
+      value: 0.126
+    - name: RMSE
+      type: Root Mean Squared Error
+      value: 0.1623
+    - name: R-R2
+      type: R-Squared
+      value: 0.0801
+  - task:
+      type: regression
+    dataset:
+      name: ymoslem/wmt-da-human-evaluation
+      type: QE
+    metrics:
+    - name: Pearson
+      type: Pearson Correlation
+      value:
+    - name: MAE
+      type: Mean Absolute Error
+      value:
+    - name: RMSE
+      type: Root Mean Squared Error
+      value:
+    - name: R-R2
+      type: R-Squared
+      value:
+metrics:
+- pearsonr
+- mae
+- r_squared
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 ## Model description
+This model is for reference-free, sentence level quality estimation (QE) of machine translation (MT) systems.
+The long-context / document-level model can be found at: [ModernBERT-base-long-context-qe-v1](https://huggingface.co/ymoslem/ModernBERT-base-long-context-qe-v1),
+which is trained on a long-context / document-level QE dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context)
 ## Training and evaluation data
+This model is trained on the sentence-level quality estimation dataset: [ymoslem/wmt-da-human-evaluation](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation)
 ## Training procedure
+This version of the model uses the full lengthtokenizer.model_max_length=8192,
+but it is still trained on a sentence-level QE dataset [ymoslem/wmt-da-human-evaluation](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation)
+The long-context / document-level model can be found at: [ModernBERT-base-long-context-qe-v1](https://huggingface.co/ymoslem/ModernBERT-base-long-context-qe-v1),
+which is trained on a long-context / document-level QE dataset [ymoslem/wmt-da-human-evaluation-long-context](https://huggingface.co/datasets/ymoslem/wmt-da-human-evaluation-long-context)
 ### Training hyperparameters
 The following hyperparameters were used during training: