Spaces:

qgallouedec
/

tito

Running

App Files Files Community

fix(article): EOS-trim the prefix in the tool-response delta

by kashif HF Staff - opened May 28

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+10

-4

kashif

May 28

Verified against apply_chat_template on the post's own Qwen2.5 example: buffer + delta != full render before the fix; equal after.

Fix: trim the prefix back to the last <|im_end|> before subtracting, so the separator lands in delta (loss-masked scaffolding) instead of vanishing between the two renders. Llama 3's <|eot_id|> emits nothing after the stop token, so the trim is a no-op there — which is why the bug was easy to miss. Matches what TRL's _get_tool_suffix_ids already does.

🤖 Generated with Claude Code

fix(article): EOS-trim the prefix in the tool-response delta986b6fc2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment