Sucial
/

MedleyVox-Inference-WebUI

Model card Files Files and versions

Sucial commited on Feb 22, 2025

Commit

d07f364

·

verified ·

1 Parent(s): 00e3a13

Create README.md

Files changed (1) hide show

pretrained/README.md +21 -0

pretrained/README.md ADDED Viewed

	@@ -0,0 +1,21 @@

+This pretrained model has been fixed according to [fairseq/issues/4585](https://github.com/facebookresearch/fairseq/issues/4585) and you don't need to run `fix_xlsr.py` again.
+If you download the pretrained model from [official website](https://dl.fbaipublicfiles.com/fairseq/wav2vec/xlsr_53_56k.pt), you need to fix the model using the following scripts:
+```python
+import torch
+import os
+from omegaconf import DictConfig, open_dict
+assert os.path.exists("pretrained/xlsr_53_56k.pt"), "Please download the pretrained model xlsr_53_56k.pt from https://dl.fbaipublicfiles.com/fairseq/wav2vec/xlsr_53_56k.pt and put it in the 'pretrained' folder!"
+cp_path = "pretrained/xlsr_53_56k.pt"
+cp = torch.load(cp_path, map_location='cpu', weights_only=False)
+wrong_key = ['eval_wer','eval_wer_config', 'eval_wer_tokenizer', 'eval_wer_post_process', 'autoregressive']
+cfg = DictConfig(cp['cfg'])
+with open_dict(cfg):
+    for k in wrong_key:
+        cfg.task.pop(k)
+cp['cfg'] = cfg
+torch.save(cp, "pretrained/xlsr_53_56k.pt")
+print("Done")
+```