Audio2Image / requirements.txt
HariLogicgo's picture
added model and weights
40cfce6
decord
peft
onnxruntime
pandas
matplotlib
-e git+https://github.com/facebookresearch/sam2.git@0e78a118995e66bb27d78518c4bd9a3e95b4e266#egg=SAM-2
loguru
sentencepiece
openai-whisper
HyperPyYAML
inflect
omegaconf
hydra-core
lightning
rich
gdown
wget
pyarrow
pyworld
librosa
modelscope
GitPython
torch>=2.4.0
torchvision>=0.19.0
torchaudio
opencv-python>=4.9.0.80
diffusers>=0.31.0
transformers>=4.49.0,<=4.51.3
tokenizers>=0.20.3
accelerate>=1.1.1
tqdm
imageio[ffmpeg]
easydict
ftfy
huggingface-hub>=0.24.0
safetensors
flash-attn
numpy>=1.23.5,<2