a2_rl_minimal_instructions / generation_config.json

Commit History

Upload export at step 60. Base model: Qwen/Qwen3-32B. Training type: RL.
c9688b0
verified

atutej commited on