question about the step separato "\n\n"

by pixas - opened Jan 15, 2025

Jan 15, 2025

I wonder whether the step separator "\n\n" is required. As you suggest, each step of the solution should be connected by a "\n\n" separator. But in your example, you only add the special token "" to connect two steps. So I wonder whether the step separator is required to compute the reward score?

Zhenru

Qwen org Jan 15, 2025

•

edited Jan 15, 2025

When using responses from Qwen2.5-Math-Instruct, we recommend splitting response with '\n\n' to build multiple steps, which does not mean you should connect steps with "\n\n" when calculating step rewards.

In the example responses, steps are already separated by '\n\n' and represented as a list. Special tokens are used to mark positions for calculating step rewards.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment