seven-cat commited on
Commit
aa6938e
·
verified ·
1 Parent(s): 9d86507

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -9
README.md CHANGED
@@ -11,13 +11,7 @@ pipeline_tag: text-classification
11
 
12
  ## Model Description
13
 
14
- `ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1).
15
-
16
- <p align="center">
17
- <img src="introduction.jpg" alt="error" style="width:95%;">
18
- </p>
19
-
20
- `ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
21
  - **Validity**: The step contains no mistakes in calculation and logic.
22
  - **Redundancy**: The step lacks utility in solving the problem but is still valid.
23
 
@@ -35,12 +29,12 @@ With ReasonEval, you can
35
  classification head for next-token prediction is replaced with a classification head for outputting the
36
  possibilities of each class of reasong steps.
37
  * **Language(s)**: English
38
- * **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy](https://drive.google.com/file/d/1Lw1uGFzTUWxo3mB91sfdusSrxnCCO9mR/view?usp=sharing)
39
  * **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
40
  * **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
41
  * **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
42
 
43
- For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval).
44
  ## How to Cite
45
  ```bibtex
46
  ```
 
11
 
12
  ## Model Description
13
 
14
+ `ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1). Given a mathematical problem and the solution generated by LLMs, `ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
 
 
 
 
 
 
15
  - **Validity**: The step contains no mistakes in calculation and logic.
16
  - **Redundancy**: The step lacks utility in solving the problem but is still valid.
17
 
 
29
  classification head for next-token prediction is replaced with a classification head for outputting the
30
  possibilities of each class of reasong steps.
31
  * **Language(s)**: English
32
+ * **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy]()
33
  * **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
34
  * **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
35
  * **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
36
 
37
+ For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval) and the [paper]() .
38
  ## How to Cite
39
  ```bibtex
40
  ```