feat: update to DPO v6 merged model (BF16 safetensors) ffbdde5 verified intrect commited on 3 days ago
fix: generation_config rep_penalty=1.05, top_k=20, top_p=0.8 (vela-v5-merged) a0a3973 verified intrect commited on Feb 18
fix: match llama-cpp-python defaults (top_k=40, top_p=0.95, rep_penalty=1.0) dea4e49 verified intrect commited on Feb 17
fix: rollback generation_config to safe Qwen2.5 defaults (rep_penalty 1.3→1.1) bfc3043 verified intrect commited on Feb 16
fix: update generation params — repetition_penalty 1.05→1.3, top_k 20→50, top_p 0.8→0.92 d4a1479 verified intrect commited on Feb 16
feat: update to DPO v4 merged model (SFT + DPO v4 language leak fix) c34c4f4 verified intrect commited on Feb 15