Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@
|
|
| 12 |
|
| 13 |
## Introduction
|
| 14 |
|
| 15 |
-
Recent
|
| 16 |
|
| 17 |
## Code Repository
|
| 18 |
|
|
|
|
| 12 |
|
| 13 |
## Introduction
|
| 14 |
|
| 15 |
+
Recent agentic workflows have automated professional document generation but focus narrowly on textual quality, overlooking structural and stylistic professionalism that is equally critical for readability. This gap stems mainly from a lack of effective reward models capable of guiding agents toward producing documents with high structural and stylistic professionalism. We introduce DocReward, a Document Reward Model that evaluates documents based on their structure and style. To achieve this, we propose a textual-quality-agnostic framework that ensures assessments are not confounded by content quality, and construct DocPair, a dataset of 117K paired documents, covering 32 domains and 267 types. DocReward is trained using the Bradley-Terry loss. On a manually annotated benchmark, DocReward outperforms GPT-5 by 14.6 percentage points in accuracy. Reinforcement learning experiments further show that DocReward effectively guides agents toward generating documents of greater structural and stylistic quality.
|
| 16 |
|
| 17 |
## Code Repository
|
| 18 |
|