CNN-DPO-LoRA / tokenizer.json

Commit History

LoRA adapter: SFT (CNN style) + DPO (anti-hallucination)
b414e1e
verified

aryan14072001 commited on