trl-mcsd / trl /experimental /bco /bco_trainer.py

Commit History

Implement MCSD for experimental SDPO
1fa3c6c
verified

ihbkaiser commited on