AutoModel.from_pretrained error in loading state_dict

by Srymaker - opened Feb 26, 2025

Discussion

Srymaker

Feb 26, 2025

why is this? I have tried updating transformers to the latest version.

hehesang

Feb 26, 2025

same problem

Xiangtai

Feb 26, 2025

•

edited Feb 26, 2025

I meet the same error. It seems that the text prediction head (weights and bias) shape in current transformers is [1152, 1152] while the weights the authors provided are [1536, 1152] to match the visual token output.

Xiangtai

Feb 26, 2025

Bugs are here in current transformer source code.

hehesang

Feb 26, 2025

this version (https://github.com/huggingface/transformers/releases/tag/v4.49.0-SigLIP-2) should fix the problem

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment