Update README.md
Browse files
README.md
CHANGED
|
@@ -5,13 +5,16 @@ tags:
|
|
| 5 |
- evaluate
|
| 6 |
- metric
|
| 7 |
description: >-
|
| 8 |
-
|
| 9 |
-
|
| 10 |
-
|
|
|
|
|
|
|
| 11 |
sdk: gradio
|
| 12 |
sdk_version: 3.19.1
|
| 13 |
app_file: app.py
|
| 14 |
pinned: false
|
|
|
|
| 15 |
---
|
| 16 |
|
| 17 |
# Metric Card for ParaPLUIE (Paraphrase Generation Evaluation Powered by an LLM)
|
|
@@ -140,4 +143,4 @@ This metric is based on an LLM and therefore is limited by the LLM used.
|
|
| 140 |
year = "2025",
|
| 141 |
url = "https://aclanthology.org/2025.coling-main.538/"
|
| 142 |
}
|
| 143 |
-
```
|
|
|
|
| 5 |
- evaluate
|
| 6 |
- metric
|
| 7 |
description: >-
|
| 8 |
+
ParaPLUIE is a metric for evaluating the semantic proximity of two sentences.
|
| 9 |
+
ParaPLUIE use the perplexity of an LLM to compute a confidence score. It has
|
| 10 |
+
shown the highest correlation with human judgement on paraphrase
|
| 11 |
+
classification meanwhile reamin the computional cost low as it roughtly equal
|
| 12 |
+
to one token generation cost.
|
| 13 |
sdk: gradio
|
| 14 |
sdk_version: 3.19.1
|
| 15 |
app_file: app.py
|
| 16 |
pinned: false
|
| 17 |
+
short_description: ParaPLUIE is a metric for evaluating the semantic proximity
|
| 18 |
---
|
| 19 |
|
| 20 |
# Metric Card for ParaPLUIE (Paraphrase Generation Evaluation Powered by an LLM)
|
|
|
|
| 143 |
year = "2025",
|
| 144 |
url = "https://aclanthology.org/2025.coling-main.538/"
|
| 145 |
}
|
| 146 |
+
```
|