Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

drbaph
/
MegaTTS3-WaveVAE

Text-to-Speech
Transformers
Safetensors
PyTorch
tts
voice-cloning
speech-synthesis
audio
chinese
english
zero-shot
diffusion
Model card Files Files and versions
xet
Community

Instructions to use drbaph/MegaTTS3-WaveVAE with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use drbaph/MegaTTS3-WaveVAE with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("text-to-speech", model="drbaph/MegaTTS3-WaveVAE")
    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("drbaph/MegaTTS3-WaveVAE", dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
MegaTTS3-WaveVAE
4.39 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
drbaph's picture
drbaph
Update README.md
e1e4a7a verified 10 months ago
  • aligner_lm
    Upload 24 files 10 months ago
  • diffusion_transformer
    Upload 24 files 10 months ago
  • duration_lm
    Upload 24 files 10 months ago
  • g2p
    Upload 24 files 10 months ago
  • wavvae
    Upload 24 files 10 months ago
  • .gitattributes
    1.57 kB
    Upload 24 files 10 months ago
  • .msc
    1.81 kB
    Upload 24 files 10 months ago
  • .mv
    36 Bytes
    Upload 24 files 10 months ago
  • README.md
    4.02 kB
    Update README.md 10 months ago
  • config.json
    68 Bytes
    Upload 24 files 10 months ago
  • configuration.json
    72 Bytes
    Upload 24 files 10 months ago