Is this tokenizer messed up?

by joeofportland - opened Nov 12, 2023

Nov 12, 2023

•

edited Nov 12, 2023

I've noticed sometimes the model returns \n\nUSER: at the end of some responses however I don't encounter this issue on your 13b-v2 version. Are they different between the models? I'm using vicuna formatting for multi turn.

Thanks to the team for all the work they put into this model btw, it's very impressive.

dittops

Bud org Nov 12, 2023

I have noticed this happening when the prompt templating is not correct. Try checking if the prompts are in the right format.

joeofportland

Nov 12, 2023

•

edited Nov 12, 2023

Ahh that's what it is: I was using USER: ASSISTANT: template for 13b but it looks like 70b is ### User:\nWrite a python flask code for login management\n\n### Assistant:\n, switching to that format fixed it, thank you!

Out of curiosity are the new lines needed? eg. \n after User: Assistant: and after the response? \n\n? This formatting template stuff has been hard for me to understand while experimenting with LLMs.

Thanks again!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment