Large audio file (more then 2 hours)

#59

by jonfv - opened Aug 18, 2023

Aug 18, 2023

My code:

pipe = pipeline(
    "automatic-speech-recognition",
    model="openai/whisper-large-v2",
    generate_kwargs={"language": "br", "task": "transcribe"},
    device="cpu",
    use_fast=True
)

res = pipe(YT_AUDIO_FILE, batch_size=10, return_timestamps=True, chunk_length_s=30, stride_length_s=(4, 2))

Why the pipe finish after end of audio? The audio have more then 2 hours and less then minutes is generated.

Thx!!!

sanchit-gandhi

Aug 22, 2023

Hey @jonfv - your code looks good. Could you share the audio file so I can reproduce locally on my end?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment