hexuandeng
/

ReviewSplit

Model card Files Files and versions

ReviewSplit / README.md

nielsr's picture

nielsr HF Staff

Add model card and metadata

aa60d75 verified 8 days ago

|

1.54 kB

	---
	license: apache-2.0
	library_name: transformers
	pipeline_tag: text-generation
	---

	# CoCoReviewBench Model

	This repository contains a model checkpoint from the paper [CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers](https://huggingface.co/papers/2605.07905).

	## Introduction
	CoCoReviewBench is a benchmark designed for reliable and fine-grained evaluation of AI reviewers. It curates 3,900 papers from ICLR and NeurIPS, focusing on:
	- Completeness: Evaluating by category to avoid penalizing models for issues missing in human references.
	- Correctness: Filtering human reviews using reviewer-author-meta-reviewer discussions to ensure accuracy.

	## Links
	- Paper: [CoCoReviewBench: A Completeness- and Correctness-Oriented Benchmark for AI Reviewers](https://huggingface.co/papers/2605.07905)
	- Code: [Official GitHub Repository](https://github.com/hexuandeng/CoCoReviewBench)

	## Citation

	```bibtex
	@inproceedings{deng2026cocoreviewbench,
	title = {{CoCoReviewBench}: A Completeness- and Correctness-Oriented Benchmark for {AI} Reviewers},
	author = {Deng, Hexuan and Li, Yichen and Ke, Xiaopeng and Hu, Ruina and Wong, Derek F. and Wang, Yue and Liu, Xuebo and Huang, Dehao and Zhang, Min},
	booktitle = {Proceedings of the 43rd International Conference on Machine Learning},
	series = {Proceedings of Machine Learning Research},
	publisher = {PMLR},
	year = {2026},
	note = {To appear},
	url = {https://github.com/hexuandeng/CoCoReviewBench}
	}
	```