Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Duplicated from  Sibgat-Ul/meno_tts

Apurba-NSU-RnD-Lab
/
MenoChat_Vox_TTS

Text-to-Speech
VoxCPM
PyTorch
English
Chinese
speech
speech generation
voice cloning
Model card Files Files and versions
xet
Community

Instructions to use Apurba-NSU-RnD-Lab/MenoChat_Vox_TTS with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • VoxCPM

    How to use Apurba-NSU-RnD-Lab/MenoChat_Vox_TTS with VoxCPM:

    import soundfile as sf
    from voxcpm import VoxCPM
    
    model = VoxCPM.from_pretrained("Apurba-NSU-RnD-Lab/MenoChat_Vox_TTS")
    
    wav = model.generate(
        text="VoxCPM is an innovative end-to-end TTS model from ModelBest, designed to generate highly expressive speech.",
        prompt_wav_path=None,      # optional: path to a prompt speech for voice cloning
        prompt_text=None,          # optional: reference text
        cfg_value=2.0,             # LM guidance on LocDiT, higher for better adherence to the prompt, but maybe worse
        inference_timesteps=10,   # LocDiT inference timesteps, higher for better result, lower for fast speed
        normalize=True,           # enable external TN tool
        denoise=True,             # enable external Denoise tool
        retry_badcase=True,        # enable retrying mode for some bad cases (unstoppable)
        retry_badcase_max_times=3,  # maximum retrying times
        retry_badcase_ratio_threshold=6.0, # maximum length restriction for bad case detection (simple but effective), it could be adjusted for slow pace speech
    )
    
    sf.write("output.wav", wav, 16000)
    print("saved: output.wav")
  • Notebooks
  • Google Colab
  • Kaggle
MenoChat_Vox_TTS
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
RafatK's picture
RafatK
Sibgat-Ul's picture
Sibgat-Ul
Duplicate from Sibgat-Ul/meno_tts
3744d75 6 days ago
  • assets
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • lora
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • .gitattributes
    1.58 kB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • README.md
    12.6 kB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • audiovae.pth

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    301 MB
    xet
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • config.json
    2.6 kB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch.BFloat16Storage",
    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    1.3 GB
    xet
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • special_tokens_map.json
    1.63 kB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • tokenizer.json
    3.68 MB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago
  • tokenizer_config.json
    4.95 kB
    Duplicate from Sibgat-Ul/meno_tts 6 days ago