ihbkaiser's picture
Implement MCSD for experimental SDPO
1fa3c6c verified
distributed_type: DEEPSPEED
deepspeed_config:
zero_stage: 2
num_processes: 2