Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

aufklarer
/
VoxCPM2-LiteRT

Text-to-Speech
LiteRT
LiteRT
VoxCPM
tts
voice-cloning
voice-design
android
Model card Files Files and versions
xet
Community

Instructions to use aufklarer/VoxCPM2-LiteRT with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • LiteRT

    How to use aufklarer/VoxCPM2-LiteRT with LiteRT:

    # No code snippets available yet for this library.
    
    # To use this model, check the repository files and the library's documentation.
    
    # Want to help? PRs adding snippets are welcome at:
    # https://github.com/huggingface/huggingface.js
  • VoxCPM

    How to use aufklarer/VoxCPM2-LiteRT with VoxCPM:

    import soundfile as sf
    from voxcpm import VoxCPM
    
    model = VoxCPM.from_pretrained("aufklarer/VoxCPM2-LiteRT")
    
    wav = model.generate(
        text="VoxCPM is an innovative end-to-end TTS model from ModelBest, designed to generate highly expressive speech.",
        prompt_wav_path=None,      # optional: path to a prompt speech for voice cloning
        prompt_text=None,          # optional: reference text
        cfg_value=2.0,             # LM guidance on LocDiT, higher for better adherence to the prompt, but maybe worse
        inference_timesteps=10,   # LocDiT inference timesteps, higher for better result, lower for fast speed
        normalize=True,           # enable external TN tool
        denoise=True,             # enable external Denoise tool
        retry_badcase=True,        # enable retrying mode for some bad cases (unstoppable)
        retry_badcase_max_times=3,  # maximum retrying times
        retry_badcase_ratio_threshold=6.0, # maximum length restriction for bad case detection (simple but effective), it could be adjusted for slow pace speech
    )
    
    sf.write("output.wav", wav, 16000)
    print("saved: output.wav")
  • Notebooks
  • Google Colab
  • Kaggle
VoxCPM2-LiteRT
4.65 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
aufklarer's picture
aufklarer
Add VoxCPM2 LiteRT INT8 bundle
6a721a1 verified 1 day ago
  • .gitattributes
    1.52 kB
    initial commit 1 day ago
  • README.md
    4.81 kB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • config.json
    11.7 kB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • generation_config.json
    3 Bytes
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • special_tokens_map.json
    1.63 kB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • tokenization_voxcpm2.py
    2.9 kB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • tokenizer.json
    3.68 MB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • tokenizer_config.json
    5.06 kB
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • voxcpm2-audio-decoder.tflite
    183 MB
    xet
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • voxcpm2-audio-encoder.tflite
    192 MB
    xet
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • voxcpm2-text-prefill.tflite
    2.08 GB
    xet
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago
  • voxcpm2-token-step.tflite
    2.19 GB
    xet
    Add VoxCPM2 LiteRT INT8 bundle 1 day ago