NoManDeRY
/

DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

DPO-Shift-Qwen-2-7B-Ultrafeedback-fixed-1.0 / merges.txt

NoManDeRY's picture

Upload folder using huggingface_hub

3f33393 verified 10 months ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.