RoadQAQ
/

ReLIFT-Qwen2.5-Math-7B-Zero

Question Answering

text-generation

text-generation-inference

Model card Files Files and versions

ReLIFT-Qwen2.5-Math-7B-Zero / README.md

RoadQAQ's picture

Add model card (#1)

6424ae6 verified 7 months ago

|

history blame contribute delete

432 Bytes

	---
	license: cc-by-nc-4.0
	library_name: transformers
	pipeline_tag: question-answering
	---

	This repository contains the ReLIFT model presented in [Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions](https://huggingface.co/papers/2506.07527).

	Code: https://github.com/TheRoadQaQ/ReLIFT

	Hugging Face Collection: https://huggingface.co/collections/RoadQAQ/relift-684535e199a909cad16d8b05