DeepSeekMath-Base-SFT-Step-DPO / trainer_state.json
xinlai's picture
upload model
07ddda3
File too large to display, you can check the raw version instead.