Duplicated from sanchit-gandhi/whisper-large-v3

openai
/

whisper-large-v3

Automatic Speech Recognition

hf-asr-leaderboard

Model card Files Files and versions

Fix error in config.json

#9

by pere - opened Nov 10, 2023

base: refs/heads/main

←

from: refs/pr/9

Discussion Files changed

@patrickvonplaten @Sanchit

The decoder_start_token_id should refer to the <|startoftranscript|> token in the vocabulary.

Fix error in config.jsone8320864

patrickvonplaten

•

edited Nov 16, 2023

Thanks for the fix, I agree that this needs to be corrected as it should match v2 in it's generation config: https://huggingface.co/openai/whisper-large-v2/blob/696465c62215e36a9ab3f9b7672fe7749f1a1df5/config.json#L19

patrickvonplaten changed pull request status to merged Nov 16, 2023

patrickvonplaten

Thanks a lot @pere

Good catch @pere ! We converted the generation_config standalone but missed the generation attributes in the config. The bos_token_id and eos_token_id also need updating: https://huggingface.co/openai/whisper-large-v3/discussions/25#6555f5d2ef6e96329fd5db2f

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment