ValueError: The state dictionary of the model you are trying to load is corrupted.

#55

by dsbyprateekg - opened Jan 23, 2024

Discussion

dsbyprateekg

Jan 23, 2024

Code-

Error-

Environment-
Colab T4 GPU

Muennighoff

BigScience Workshop org Jan 23, 2024

•

edited Jan 23, 2024

ur trying to load mt0 with the bloom model; u need to load it w/ the mt0 model (i.e. t5 i think) - the script is in its modelcard

christopher changed discussion status to closed Jan 23, 2024

dsbyprateekg

Jan 24, 2024

@Muennighoff can you please share the code snippet how to do that?

Muennighoff

BigScience Workshop org Jan 24, 2024

From https://huggingface.co/bigscience/mt0-small

# pip install -q transformers accelerate
from transformers import AutoModelForSeq2SeqLM, AutoTokenizer

checkpoint = "bigscience/mt0-small"

tokenizer = AutoTokenizer.from_pretrained(checkpoint)
model = AutoModelForSeq2SeqLM.from_pretrained(checkpoint, torch_dtype="auto", device_map="auto")

inputs = tokenizer.encode("Translate to English: Je t’aime.", return_tensors="pt").to("cuda")
outputs = model.generate(inputs)
print(tokenizer.decode(outputs[0]))

dsbyprateekg

Jan 24, 2024

@Muennighoff Thanks!
It's working now.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment