the vocab is mismatched with the original phi-2

by Spico - opened Jan 11, 2024

Jan 11, 2024

Thanks for sharing the model~

It seems the vocab size (50296) is smaller than the original phi-2 (51200). Are there special operations to drop tokens from the vocab ?

lxuechen changed discussion status to closed Jan 11, 2024

lxuechen

Owner Jan 11, 2024

Hi, thanks for the message. Yeah, the embedding size was reduced when I added the padding token. The original embedding size was made to be multiple of 64.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment