How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-classification", model="GAIR/ReasonEval-7B")
# Load model directly
from transformers import AutoTokenizer, ReasonEval_7B

tokenizer = AutoTokenizer.from_pretrained("GAIR/ReasonEval-7B")
model = ReasonEval_7B.from_pretrained("GAIR/ReasonEval-7B")
Quick Links

ReasonEval-7B Model Card

Model Description

ReasonEval-7B is a 7B parameter decoder-only language model fine-tuned from WizardMath-7B-V1.1. Given a mathematical problem and the solution, ReasonEval-7B assesses the problem-solving process in a step-by-step format from the following perspectives:

  • Validity: The step contains no mistakes in calculation and logic.
  • Redundancy: The step lacks utility in solving the problem but is still valid.

With ReasonEval, you can

  • ๐Ÿ“ quantify the quality of reasoning steps free of human or close-source models.

  • ๐Ÿค– find the potential invalid or redundant steps in the solutions even with the correct results.

  • ๐Ÿ› ๏ธ select high-quality training data for downstream tasks (e.g., fine-tuning).

Model Details

For detailed instructions on how to use the ReasonEval-7B model, visit our GitHub repository at https://github.com/GAIR-NLP/ReasonEval.

How to Cite

@article{xia2024evaluating,
        title={Evaluating Mathematical Reasoning Beyond Accuracy}, 
        author={Xia, Shijie and Li, Xuefeng and Liu, Yixin and Wu, Tongshuang and Liu, Pengfei},
        journal={arXiv preprint arXiv:2404.05692},
        year={2024},
}
Downloads last month
34
Safetensors
Model size
7B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for GAIR/ReasonEval-7B

Adapters
6 models

Paper for GAIR/ReasonEval-7B