Uploaded model
- Developed by: mmmanuel
- License: apache-2.0
- Finetuned from model : mmmanuel/SFT_nochat_FULL_DATA
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 1
Model tree for mmmanuel/DPO_ONSFT_HALF_DATA_3
Base model
mmmanuel/SFT_nochat_FULL_DATA