GAIR
/

ReasonEval-7B

Text Classification

text-embeddings-inference

Model card Files Files and versions

seven-cat commited on Apr 7, 2024

Commit

aa6938e

·

verified ·

1 Parent(s): 9d86507

Update README.md

Files changed (1) hide show

README.md +3 -9

README.md CHANGED Viewed

@@ -11,13 +11,7 @@ pipeline_tag: text-classification
 ## Model Description
-`ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1).
-<p align="center">
-<img src="introduction.jpg" alt="error" style="width:95%;">
-</p>
-`ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
 - **Validity**: The step contains no mistakes in calculation and logic.
 - **Redundancy**: The step lacks utility in solving the problem but is still valid.
@@ -35,12 +29,12 @@ With ReasonEval, you can
 classification head for next-token prediction is replaced with a classification head for outputting the
 possibilities of each class of reasong steps.
 * **Language(s)**: English
-* **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy](https://drive.google.com/file/d/1Lw1uGFzTUWxo3mB91sfdusSrxnCCO9mR/view?usp=sharing)
 * **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
 * **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
 * **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
-For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval).
 ## How to Cite
 ```bibtex
 ```

 ## Model Description
+`ReasonEval-7B` is a 7B parameter decoder-only language model fine-tuned from [`WizardMath-7B-V1.1`](https://huggingface.co/WizardLM/WizardMath-7B-V1.1). Given a mathematical problem and the solution generated by LLMs, `ReasonEval-7B` assesses the problem-solving process in a step-by-step format from the following perspectives:
 - **Validity**: The step contains no mistakes in calculation and logic.
 - **Redundancy**: The step lacks utility in solving the problem but is still valid.
 classification head for next-token prediction is replaced with a classification head for outputting the
 possibilities of each class of reasong steps.
 * **Language(s)**: English
+* **Paper**: [Evaluating Mathematical Reasoning Beyond Accuracy]()
 * **Github**: [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval)
 * **Finetuned from model**: [https://huggingface.co/WizardLM/WizardMath-7B-V1.1](https://huggingface.co/WizardLM/WizardMath-7B-V1.1)
 * **Fine-tuning Data**: [PRM800K](https://github.com/openai/prm800k)
+For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at [https://github.com/GAIR-NLP/ReasonEval](https://github.com/GAIR-NLP/ReasonEval) and the [paper]() .
 ## How to Cite
 ```bibtex
 ```