DeepMMAudio / README.md
Cactooz's picture
Update datasets used
39b166f verified
metadata
license: mit
datasets:
  - Loie/VGGSound
  - CLAPv2/Clotho
  - cvssp/WavCaps
base_model:
  - hkchengrex/MMAudio
pipeline_tag: video-text-to-text
tags:
  - video-to-audio

Code: https://github.com/Cactooz/DeepMMAudio