trl-mcsd / VERSION
ihbkaiser's picture
Implement MCSD for experimental SDPO
1fa3c6c verified
1.3.0.dev0