valpy's picture
Create README.md
b7184b3 verified
metadata
license: llama3.1
base_model:
  - meta-llama/Llama-3.1-8B

This is an IF-RLVR trained model on IFTrain and IFEval constraints, with meta-llama/Llama-3.1-8B as the base model.

Citation

If IFBench, IFTrain, or any of the related materials were helpful to your work, please cite:



@article
{pyatkin2025generalizing,
title={Generalizing Verifiable Instruction Following},
author={Pyatkin, Valentina and Malik, Saumya and Graf, Victoria and Ivison, Hamish and Huang, Shengyi and Dasigi, Pradeep and Lambert, Nathan and Hajishirzi, Hannaneh},
journal={Advances in Neural Information Processing Systems},
volume={38},
year={2025}
}