File size: 1,544 Bytes
aa60d75
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation
---

# CoCoReviewBench Model

This repository contains a model checkpoint from the paper [CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers](https://huggingface.co/papers/2605.07905).

## Introduction
CoCoReviewBench is a benchmark designed for reliable and fine-grained evaluation of AI reviewers. It curates 3,900 papers from ICLR and NeurIPS, focusing on:
- **Completeness**: Evaluating by category to avoid penalizing models for issues missing in human references.
- **Correctness**: Filtering human reviews using reviewer-author-meta-reviewer discussions to ensure accuracy.

## Links
- **Paper**: [CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers](https://huggingface.co/papers/2605.07905)
- **Code**: [Official GitHub Repository](https://github.com/hexuandeng/CoCoReviewBench)

## Citation

```bibtex
@inproceedings{deng2026cocoreviewbench,
  title     = {{CoCoReviewBench}: A Completeness- and Correctness-Oriented Benchmark for {AI} Reviewers},
  author    = {Deng, Hexuan and Li, Yichen and Ke, Xiaopeng and Hu, Ruina and Wong, Derek F. and Wang, Yue and Liu, Xuebo and Huang, Dehao and Zhang, Min},
  booktitle = {Proceedings of the 43rd International Conference on Machine Learning},
  series    = {Proceedings of Machine Learning Research},
  publisher = {PMLR},
  year      = {2026},
  note      = {To appear},
  url       = {https://github.com/hexuandeng/CoCoReviewBench}
}
```