Update README.md
Browse files
README.md
CHANGED
|
@@ -6,4 +6,58 @@ base_model:
|
|
| 6 |
- benjamin/roberta-base-wechsel-german
|
| 7 |
tags:
|
| 8 |
- simplification
|
| 9 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 |
- benjamin/roberta-base-wechsel-german
|
| 7 |
tags:
|
| 8 |
- simplification
|
| 9 |
+
---
|
| 10 |
+
|
| 11 |
+
# 🧭 DETECT: Determining Ease and Textual Clarity of German Text Simplifications
|
| 12 |
+
|
| 13 |
+
This repository contains the **trained checkpoint for DETECT**, an automated **German Automatic Text Simplification (ATS)** quality evaluation metric introduced in
|
| 14 |
+
> *“DETECT: Determining Ease and Textual Clarity of German Text Simplifications”*.
|
| 15 |
+
|
| 16 |
+
DETECT provides fine-grained scoring across **simplicity**, **meaning preservation**, and **fluency**, along with a composite **total** score.
|
| 17 |
+
Further information about the metric can be found in the description of the [GitHub repository](https://github.com/ZurichNLP/DETECT) or in our accompanying paper.
|
| 18 |
+
|
| 19 |
+
> 🔎 **Note**
|
| 20 |
+
> - This repository hosts a **checkpoint file only**.
|
| 21 |
+
> - You must load it **through the DETECT codebase** (see usage below).
|
| 22 |
+
> - It is **not** directly compatible with `AutoModel.from_pretrained()`.
|
| 23 |
+
> - The model supports **reference-based** text simplification evaluation only — it does **not** provide reference-free evaluation.
|
| 24 |
+
|
| 25 |
+
---
|
| 26 |
+
|
| 27 |
+
## ⚙️ Usage
|
| 28 |
+
|
| 29 |
+
Clone and install the DETECT codebase:
|
| 30 |
+
```bash
|
| 31 |
+
git clone https://github.com/ZurichNLP/DETECT.git
|
| 32 |
+
cd DETECT/detect
|
| 33 |
+
pip install -e .
|
| 34 |
+
```
|
| 35 |
+
|
| 36 |
+
Then, in Python:
|
| 37 |
+
|
| 38 |
+
```from detect import DETECT
|
| 39 |
+
|
| 40 |
+
# Initialize model
|
| 41 |
+
detect = DETECT("ZurichNLP/DETECT/best-LENS_multi_wechsel_reducedhs-epoch=04.ckpt", rescale=True)
|
| 42 |
+
|
| 43 |
+
complex = [
|
| 44 |
+
"Sie sind kulturell den Küstenbewohnern von Papua-Neuguinea verwandt."
|
| 45 |
+
]
|
| 46 |
+
|
| 47 |
+
simple = [
|
| 48 |
+
"Sie sind kulturell den Menschen in Papua-Neuguinea ähnlich."
|
| 49 |
+
]
|
| 50 |
+
|
| 51 |
+
references = [[
|
| 52 |
+
"Sie sind kulturell den Küstenbewohnern von Papua-Neuguinea ähnlich.",
|
| 53 |
+
"Sie ähneln den Menschen aus Papua-Neuguinea, die an der Küste leben."
|
| 54 |
+
]]
|
| 55 |
+
|
| 56 |
+
scores = detect.score(complex, simple, references, batch_size=8, devices=[0])
|
| 57 |
+
print(scores)
|
| 58 |
+
# [{'simplicity': 78.6, 'meaning_preservation': 80.1, 'fluency': 77.3, 'total': 78.3}]
|
| 59 |
+
```
|
| 60 |
+
|
| 61 |
+
## Citation
|
| 62 |
+
|
| 63 |
+
If you use DETECT, please cite:
|