IAAR-Shanghai
/

xVerify-32B-I

Text Generation

Transformers

Safetensors

English

Chinese

qwen2

instruction-finetuning

evaluation

reasoning

conversational

text-generation-inference

Model card Files Files and versions

xet

Community

Hush-cd

nielsr HF Staff commited on Jan 1

Commit

d58b5ed

verified ·

1 Parent(s): cf02a7a

Add pipeline_tag, library_name and link to paper (#1)

Browse files

- Add pipeline_tag, library_name and link to paper (cb936155815809e4214605481ba465ddee1436dc)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md +70 -60

README.md CHANGED Viewed

@@ -1,61 +1,71 @@
----
-inference: false
-language:
-- en
-- zh
-tags:
-- instruction-finetuning
-task_categories:
-- text-generation
-base_model:
-- Qwen/Qwen2.5-32B-Instruct
-license: cc-by-nc-nd-4.0
----
-<h1 align="center">
-🔍 xVerify-32B-I
-</h1>
-<p align="center">
-  <div style="display: flex; justify-content: center; gap: 10px;">
-    <a href="https://github.com/IAAR-Shanghai/xVerify">
-      <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
-    </a>
-    <a href="https://huggingface.co/IAAR-Shanghai/xVerify-32B-I">
-      <img src="https://img.shields.io/badge/🤗%20Hugging%20Face-xVerify--32B--I-yellow" alt="Hugging Face"/>
-    </a>
-  </div>
-</p>
-xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
----
-## ✨ Key Features
-### 📊 Broad Applicability
-Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
-### ⛓️ Handles Long Reasoning Chains
-Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
-### 🌐 Multilingual Support
-Primarily handles Chinese and English responses while remaining compatible with other languages.
-### 🔄 Powerful Equivalence Judgment
-- ✓ Recognizes basic transformations like letter case changes and Greek letter conversions
-- ✓ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
-- ✓ Determines semantic equivalence in natural language answers
-- ✓ Matches multiple-choice responses by content rather than just option identifiers
----
-## 📚 Citation
-```bibtex
-@article{xVerify,
-      title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
-      author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
-      journal={arXiv preprint arXiv:2504.10481},
-      year={2025},
-}
 ```

+---
+base_model:
+- Qwen/Qwen2.5-32B-Instruct
+language:
+- en
+- zh
+license: cc-by-nc-nd-4.0
+tags:
+- instruction-finetuning
+- evaluation
+- reasoning
+inference: false
+pipeline_tag: text-generation
+library_name: transformers
+arxiv: 2504.10481
+---
+<h1 align="center">
+🔍 xVerify-32B-I
+</h1>
+<p align="center">
+  <div style="display: flex; justify-content: center; gap: 10px;">
+    <a href="https://github.com/IAAR-Shanghai/xVerify">
+      <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
+    </a>
+    <a href="https://huggingface.co/IAAR-Shanghai/xVerify-32B-I">
+      <img src="https://img.shields.io/badge/🤗%20Hugging%20Face-xVerify--32B--I-yellow" alt="Hugging Face"/>
+    </a>
+    <a href="https://huggingface.co/papers/2504.10481">
+      <img src="https://img.shields.io/badge/Paper-arXiv-red?logo=arxiv" alt="Paper"/>
+    </a>
+  </div>
+</p>
+xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
+This model is presented in the paper [xVerify: Efficient Answer Verifier for Reasoning Model Evaluations](https://huggingface.co/papers/2504.10481).
+---
+## ✨ Key Features
+### 📊 Broad Applicability
+Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
+### ⛓️ Handles Long Reasoning Chains
+Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
+### 🌐 Multilingual Support
+Primarily handles Chinese and English responses while remaining compatible with other languages.
+### 🔄 Powerful Equivalence Judgment
+- ✓ Recognizes basic transformations like letter case changes and Greek letter conversions
+- ✓ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
+- ✓ Determines semantic equivalence in natural language answers
+- ✓ Matches multiple-choice responses by content rather than just option identifiers
+---
+## 📚 Citation
+```bibtex
+@article{xVerify,
+      title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
+      author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
+      journal={arXiv preprint arXiv:2504.10481},
+      year={2025},
+}
 ```