Update README.md
Browse files
README.md
CHANGED
|
@@ -29,7 +29,7 @@ model-index:
|
|
| 29 |
|
| 30 |
<h1 align="center">roberta-incoherence-classifier</h1>
|
| 31 |
|
| 32 |
-
Encoder-based classifier for document inconsistency detection in **Polish**. This model evaluates the semantic consistency between two text fragments (e.g. sections of legal, procurement or organizational documents). It follows an NLI-like setup but **redefines labels specifically for document coherence auditing**.
|
| 33 |
|
| 34 |
---
|
| 35 |
|
|
@@ -96,7 +96,7 @@ Finetuning on specific domain data is recommended for best production accuracy.
|
|
| 96 |
|
| 97 |
---
|
| 98 |
|
| 99 |
-
## Evaluation (
|
| 100 |
|
| 101 |
```
|
| 102 |
precision recall f1-score support
|
|
|
|
| 29 |
|
| 30 |
<h1 align="center">roberta-incoherence-classifier</h1>
|
| 31 |
|
| 32 |
+
Encoder-based classifier for document inconsistency detection in **Polish**. This model evaluates the semantic consistency between two text fragments (e.g. sections of legal, procurement or organizational documents). It follows an NLI-like setup but **redefines labels specifically for document coherence auditing**. This model was **initalized from [PKOBP/polish-roberta-8k](https://huggingface.co/PKOBP/polish-roberta-8k)** and **adapted into an inconsistency classifier** through supervised training on high-quality document-style pairs.
|
| 33 |
|
| 34 |
---
|
| 35 |
|
|
|
|
| 96 |
|
| 97 |
---
|
| 98 |
|
| 99 |
+
## Evaluation (on [asseco-group/incoherence-bench](https://huggingface.co/datasets/asseco-group/incoherence-bench), test split)
|
| 100 |
|
| 101 |
```
|
| 102 |
precision recall f1-score support
|