Is the config.json wrong?

by kkyeer - opened Mar 13, 2025

Mar 13, 2025

The 4b-pt config.json is much same as 4b-it, and diffrent from 1b-pt version.For example,no vocab_size prop in 4b but can be found in 1b.Is this intended or a mistake?

edmond

Mar 21, 2025

Same problem here

lkv

Google org Aug 26, 2025

Hi ,

Thank you for pointing this out! You're correct — the config.json for the 4b-pt and 4b-it versions appears to omit the vocab_size field, while it's present in the 1b-pt version.

This could be intentional if the tokenizer is assumed to be loaded independently and the model config relies on tokenizer metadata. In many recent transformer-based models, the vocab_size is not strictly required in config.json during inference. Thank you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment