nielsr HF Staff commited on
Commit
ff8f983
·
verified ·
1 Parent(s): 745ad3e

Improve model card: add metadata, paper link and repository information

Browse files

Hi! I'm Niels from the Hugging Face community science team.

This PR improves the model card for the DRPG Judge Model by:
- Adding the `library_name: transformers` metadata (verified by the `config.json` architecture and version).
- Adding the `pipeline_tag: text-generation` for better discoverability.
- Linking the model to its associated paper on Hugging Face: [DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal](https://huggingface.co/papers/2601.18081).
- Including a BibTeX citation for researchers using this model.

Please feel free to merge if this looks good!

Files changed (1) hide show
  1. README.md +41 -2
README.md CHANGED
@@ -1,5 +1,44 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
3
  ---
4
- A judge model to evaluate rebuttal quality, trained from Qwen3-8B using RL.
5
- Refer to https://github.com/ulab-uiuc/DRPG-RebuttalAgent for usage.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ base_model: qwen/Qwen3-8B
6
+ tags:
7
+ - academic-rebuttal
8
+ - agentic-framework
9
+ - rl
10
  ---
11
+
12
+ # DRPG Judge Model
13
+
14
+ This repository contains the Judge Model for the **DRPG (Decompose, Retrieve, Plan, Generate)** framework, as introduced in the paper [DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal](https://huggingface.co/papers/2601.18081).
15
+
16
+ The model is specifically designed to evaluate the quality of academic rebuttals. It was trained from **Qwen3-8B** using Reinforcement Learning (RL) to provide accurate and persuasive assessment scores.
17
+
18
+ ## Links
19
+ - **Paper:** [DRPG: An Agentic Framework for Academic Rebuttal](https://huggingface.co/papers/2601.18081)
20
+ - **Repository:** [ulab-uiuc/DRPG-RebuttalAgent](https://github.com/ulab-uiuc/DRPG-RebuttalAgent)
21
+
22
+ ## About DRPG
23
+ DRPG is an agentic framework for automatic academic rebuttal generation that operates through four steps:
24
+ 1. **Decompose**: Breaking reviews into atomic concerns.
25
+ 2. **Retrieve**: Finding relevant evidence from the paper.
26
+ 3. **Plan**: Identifying feasible rebuttal strategies.
27
+ 4. **Generate**: Creating targeted responses.
28
+
29
+ The Judge Model is used within this pipeline to assess rebuttal quality, achieving performance beyond the average human level in experimental evaluations.
30
+
31
+ ## Usage
32
+ Refer to the official [GitHub repository](https://github.com/ulab-uiuc/DRPG-RebuttalAgent) for instructions on running the evaluation scripts and using the model within the DRPG pipeline.
33
+
34
+ ## Citation
35
+ If you find this model useful in your research, please cite:
36
+ ```bibtex
37
+ @article{han2025drpg,
38
+ title={DRPG (Decompose, Retrieve, Plan, Generate): An Agentic Framework for Academic Rebuttal},
39
+ author={Han, Peixuan and Yu, Yingjie and Xu, Jingjun and You, Jiaxuan},
40
+ journal={arXiv preprint arXiv:2601.18081},
41
+ url={https://arxiv.org/pdf/2601.18081},
42
+ year={2026}
43
+ }
44
+ ```