numpy torch transformers diffusers accelerate librosa soundfile