Set decoder_start_token_id and output_past in config

by ankrgyl - opened Oct 22, 2022

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-0

ankrgyl

Oct 22, 2022

Without the decoder_start_token_id parameter, you get the following ValueError while using the model:

    561 elif (
    562     hasattr(self.config, "decoder")
    563     and hasattr(self.config.decoder, "bos_token_id")
    564     and self.config.decoder.bos_token_id is not None
    565 ):
    566     return self.config.decoder.bos_token_id
--> 567 raise ValueError(
    568     "`decoder_start_token_id` or `bos_token_id` has to be defined for encoder-decoder generation."
    569 )

ValueError: `decoder_start_token_id` or `bos_token_id` has to be defined for encoder-decoder generation.

I checked https://huggingface.co/google/flan-t5-large/blob/main/config.json and noticed that output_past is also different.

Set decoder_start_token_id and output_past in config12cb4a2c

ankrgyl

Oct 22, 2022

I believe this fixes https://huggingface.co/google/flan-t5-xxl/discussions/2

ybelkada

Oct 22, 2022

This definitely fixes the issue, yes! Great catch! Thanks for spotting it - indeed we forgot to add it when porting the model :-)

ybelkada changed pull request status to merged Oct 22, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment