I am getting error for shape mismatch with the provided code

#7
by abhgarg - opened

This is a complete description:

RuntimeError: Error(s) in loading state_dict for Gemma3ForConditionalGeneration:
size mismatch for language_model.model.embed_tokens.weight: copying a param with shape torch.Size([262208, 2560]) from checkpoint, the shape in current model is torch.Size([262144, 2560]).

Also I think the pip install should have a hypen between Gemma and 3 ?

pip install git+https://github.com/huggingface/transformers@v4.49.0-Gemma-3

Seems to be working with latest code.

abhgarg changed discussion status to closed

Sign up or log in to comment