Instructions to use Jason-Lu/Laoliang-voice-clone with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Jason-Lu/Laoliang-voice-clone with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Jason-Lu/Laoliang-voice-clone", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Configuration Parsing Warning:Config file config.json cannot be fetched (too big)
Models trained from VITS-fast-fine-tuning
- Three speakers: laoliang (่ๆข), specialweek, zhongli.
- The model is based on the C+J base model and trained on a single NVIDIA 3090 with 300 epochs. It takes about 4.5 hours in total.
- During training, we use a single long audio of laoliang (~5 minutes) with auxiliary data as training data.
How to run the model?
- Follow the official instruction, install required libraries.
- Download models and move finetune_speaker.json and G_latest.pth to /path/to/ VITS-fast-fine-tuning.
- Run python VC_inference.py --model_dir ./G_latest.pth --share True to start a local gradio inference demo.
File structure
VITS-fast-fine-tuning
โโโโVC_inference.py
โโโโ...
โโโโfinetune_speaker.json
โโโโG_latest.pth
- Downloads last month
- 10
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support