Text Generation
PEFT
lora
trl
naming
brand-generation
controllable-generation
nomen-ai / scripts /train_dpo.py

Commit History

Set left-padding tokenizer for DPOTrainer
439dbca
verified

krystv commited on

Add DPO training script
70f3ce3
verified

krystv commited on