nielsr's picture
nielsr HF Staff
Add model card
315f971 verified
|
raw
history blame
432 Bytes
---
license: cc-by-nc-4.0
library_name: transformers
pipeline_tag: question-answering
---
This repository contains the ReLIFT model presented in [Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions](https://huggingface.co/papers/2506.07527).
Code: https://github.com/TheRoadQaQ/ReLIFT
Hugging Face Collection: https://huggingface.co/collections/RoadQAQ/relift-684535e199a909cad16d8b05