|
|
--- |
|
|
license: cc-by-nc-4.0 |
|
|
tags: |
|
|
- audio-to-video |
|
|
library_name: diffusers |
|
|
--- |
|
|
|
|
|
Models of [R-FLAV](https://arxiv.org/abs/2503.08307) trained on Landscape and AIST++ for 400k iterations. |
|
|
|
|
|
For more info, please refer to the Github repository at https://github.com/ErgastiAlex/R-FLAV |
|
|
|
|
|
To download the ckpts directly in the code you can do |
|
|
```python |
|
|
from huggingface_hub import hf_hub_download |
|
|
import torch |
|
|
from models import FLAV |
|
|
|
|
|
model = FLAV.from_pretrained(args.model_ckpt) |
|
|
|
|
|
hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/config.json") |
|
|
vocoder_path = hf_hub_download(repo_id="MaverickAlex/R-FLAV-B-1-LS", filename="vocoder/vocoder.pt") |
|
|
|
|
|
vocoder_path = vocoder_path.replace("vocoder.pt", "") |
|
|
vocoder = Generator.from_pretrained(vocoder_path) |
|
|
``` |