Hush-cd nielsr HF Staff commited on
Commit
fdfe1e6
Β·
verified Β·
1 Parent(s): 3b9ae0b

Add paper link and library metadata (#1)

Browse files

- Add paper link and library metadata (64d6c909faecc7a6c77214781e4ccb118a2a7c2d)


Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +66 -60
README.md CHANGED
@@ -1,61 +1,67 @@
1
- ---
2
- inference: false
3
- language:
4
- - en
5
- - zh
6
- tags:
7
- - instruction-finetuning
8
- task_categories:
9
- - text-generation
10
- base_model:
11
- - google/gemma-2-9b-it
12
- license: cc-by-nc-nd-4.0
13
- ---
14
- <h1 align="center">
15
- πŸ” xVerify-9B-I
16
- </h1>
17
-
18
- <p align="center">
19
- <div style="display: flex; justify-content: center; gap: 10px;">
20
- <a href="https://github.com/IAAR-Shanghai/xVerify">
21
- <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
22
- </a>
23
- <a href="https://huggingface.co/IAAR-Shanghai/xVerify-9B-I">
24
- <img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-xVerify--9B--I-yellow" alt="Hugging Face"/>
25
- </a>
26
- </div>
27
- </p>
28
- xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
29
-
30
- ---
31
-
32
- ## ✨ Key Features
33
-
34
- ### πŸ“Š Broad Applicability
35
- Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
36
-
37
- ### ⛓️ Handles Long Reasoning Chains
38
- Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
39
-
40
- ### 🌐 Multilingual Support
41
- Primarily handles Chinese and English responses while remaining compatible with other languages.
42
-
43
- ### πŸ”„ Powerful Equivalence Judgment
44
- - βœ“ Recognizes basic transformations like letter case changes and Greek letter conversions
45
- - βœ“ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
46
- - βœ“ Determines semantic equivalence in natural language answers
47
- - βœ“ Matches multiple-choice responses by content rather than just option identifiers
48
-
49
- ---
50
-
51
-
52
- ## πŸ“š Citation
53
-
54
- ```bibtex
55
- @article{xVerify,
56
- title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
57
- author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
58
- journal={arXiv preprint arXiv:2504.10481},
59
- year={2025},
60
- }
 
 
 
 
 
 
61
  ```
 
1
+ ---
2
+ base_model:
3
+ - google/gemma-2-9b-it
4
+ language:
5
+ - en
6
+ - zh
7
+ license: cc-by-nc-nd-4.0
8
+ library_name: transformers
9
+ pipeline_tag: text-generation
10
+ tags:
11
+ - instruction-finetuning
12
+ ---
13
+
14
+ <h1 align="center">
15
+ πŸ” xVerify-9B-I
16
+ </h1>
17
+
18
+ <p align="center">
19
+ <div style="display: flex; justify-content: center; gap: 10px;">
20
+ <a href="https://github.com/IAAR-Shanghai/xVerify">
21
+ <img src="https://img.shields.io/badge/GitHub-Repository-blue?logo=github" alt="GitHub"/>
22
+ </a>
23
+ <a href="https://huggingface.co/IAAR-Shanghai/xVerify-9B-I">
24
+ <img src="https://img.shields.io/badge/πŸ€—%20Hugging%20Face-xVerify--9B--I-yellow" alt="Hugging Face"/>
25
+ </a>
26
+ <a href="https://huggingface.co/papers/2504.10481">
27
+ <img src="https://img.shields.io/badge/Paper-Arxiv-red?logo=arxiv" alt="Paper"/>
28
+ </a>
29
+ </div>
30
+ </p>
31
+
32
+ xVerify is an evaluation tool fine-tuned from a pre-trained large language model, designed specifically for objective questions with a single correct answer. It was introduced in the paper [xVerify: Efficient Answer Verifier for Reasoning Model Evaluations](https://huggingface.co/papers/2504.10481).
33
+
34
+ The model accurately extracts the final answer from lengthy reasoning processes and efficiently identifies equivalence across different forms of expressions.
35
+
36
+ ---
37
+
38
+ ## ✨ Key Features
39
+
40
+ ### πŸ“Š Broad Applicability
41
+ Suitable for various objective question evaluation scenarios including math problems, multiple-choice questions, classification tasks, and short-answer questions.
42
+
43
+ ### ⛓️ Handles Long Reasoning Chains
44
+ Effectively processes answers with extensive reasoning steps to extract the final answer, regardless of complexity.
45
+
46
+ ### 🌐 Multilingual Support
47
+ Primarily handles Chinese and English responses while remaining compatible with other languages.
48
+
49
+ ### πŸ”„ Powerful Equivalence Judgment
50
+ - βœ“ Recognizes basic transformations like letter case changes and Greek letter conversions
51
+ - βœ“ Identifies equivalent mathematical expressions across formats (LaTeX, fractions, scientific notation)
52
+ - βœ“ Determines semantic equivalence in natural language answers
53
+ - βœ“ Matches multiple-choice responses by content rather than just option identifiers
54
+
55
+ ---
56
+
57
+
58
+ ## πŸ“š Citation
59
+
60
+ ```bibtex
61
+ @article{xVerify,
62
+ title={xVerify: Efficient Answer Verifier for Reasoning Model Evaluations},
63
+ author={Ding Chen and Qingchen Yu and Pengyuan Wang and Wentao Zhang and Bo Tang and Feiyu Xiong and Xinchi Li and Minchuan Yang and Zhiyu Li},
64
+ journal={arXiv preprint arXiv:2504.10481},
65
+ year={2025},
66
+ }
67
  ```