trl-mcsd / examples /cli_configs
423 Bytes
ihbkaiser's picture
Implement MCSD for experimental SDPO
1fa3c6c verified