Update README.md
Browse files
README.md
CHANGED
|
@@ -11,13 +11,7 @@ pipeline_tag: text-classification
|
|
| 11 |
|
| 12 |
## Model Description
|
| 13 |
|
| 14 |
-
`ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1).
|
| 15 |
-
|
| 16 |
-
<p align="center">
|
| 17 |
-
<img src="introduction.jpg" alt="error" style="width:95%;">
|
| 18 |
-
</p>
|
| 19 |
-
|
| 20 |
-
`ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
|
| 21 |
- **Validity**: The step contains no mistakes in calculation and logic.
|
| 22 |
- **Redundancy**: The step lacks utility in solving the problem but is still valid.
|
| 23 |
|
|
@@ -35,12 +29,12 @@ With ReasonEval, you can
|
|
| 35 |
classification head for next-token prediction is replaced with a classification head for outputting the
|
| 36 |
possibilities of each class of reasong steps.
|
| 37 |
* **Language(s)**: English
|
| 38 |
-
* **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy](
|
| 39 |
* **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
|
| 40 |
* **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
|
| 41 |
* **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
|
| 42 |
|
| 43 |
-
For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval).
|
| 44 |
## How to Cite
|
| 45 |
```bibtex
|
| 46 |
```
|
|
|
|
| 11 |
|
| 12 |
## Model Description
|
| 13 |
|
| 14 |
+
`ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1). Given a mathematical problem and the solution generated by LLMs, `ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
- **Validity**: The step contains no mistakes in calculation and logic.
|
| 16 |
- **Redundancy**: The step lacks utility in solving the problem but is still valid.
|
| 17 |
|
|
|
|
| 29 |
classification head for next-token prediction is replaced with a classification head for outputting the
|
| 30 |
possibilities of each class of reasong steps.
|
| 31 |
* **Language(s)**: English
|
| 32 |
+
* **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy]()
|
| 33 |
* **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
|
| 34 |
* **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
|
| 35 |
* **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
|
| 36 |
|
| 37 |
+
For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval) and the [paper]() .
|
| 38 |
## How to Cite
|
| 39 |
```bibtex
|
| 40 |
```
|