File size: 419 Bytes
f3318ea
 
 
 
 
 
 
 
 
 
 
 
 
 
101c8bc
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
---
library_name: transformers
pipeline_tag: text-generation
base_model:
- Qwen/Qwen2.5-0.5B
---

## UFT

This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).

Code: https://github.com/liumy2010/UFT
    
    ## References

    * [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)