Anna4242's picture
Upload RL-trained model from outputs/nemotron-multihop-qwen2.5-7b-rl/final_model
9997bba verified