feat: update to DPO v6 merged model (BF16 safetensors) ffbdde5 verified intrect commited on 4 days ago
feat: update to DPO v4 merged model (SFT + DPO v4 language leak fix) c34c4f4 verified intrect commited on Feb 15