| license: mit | |
| datasets: | |
| - Loie/VGGSound | |
| - CLAPv2/Clotho | |
| - cvssp/WavCaps | |
| base_model: | |
| - hkchengrex/MMAudio | |
| pipeline_tag: video-text-to-text | |
| tags: | |
| - video-to-audio | |
| Code: https://github.com/Cactooz/DeepMMAudio |
| license: mit | |
| datasets: | |
| - Loie/VGGSound | |
| - CLAPv2/Clotho | |
| - cvssp/WavCaps | |
| base_model: | |
| - hkchengrex/MMAudio | |
| pipeline_tag: video-text-to-text | |
| tags: | |
| - video-to-audio | |
| Code: https://github.com/Cactooz/DeepMMAudio |