File size: 419 Bytes
16e2d0b d236a09 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | ---
library_name: transformers
pipeline_tag: text-generation
base_model:
- Qwen/Qwen2.5-1.5B
---
## UFT
This repository contains the model presented in [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://huggingface.co/papers/2505.16984).
Code: https://github.com/liumy2010/UFT
## References
* [UFT: Unifying Supervised and Reinforcement Fine-Tuning](https://arxiv.org/abs/2505.16984)
|