jasonhuang3/Pro6000-ipo-qwen-2-5-7b-math_lora_28k_merged Text Generation • 8B • Updated Oct 7, 2025 • 7
jasonhuang3/99-caldpo-dataset-dpop-our-13-5-zephyr-7b-sft-full-merged-28k 7B • Updated Oct 3, 2025 • 8