Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

openensemble
/
pocket-tts

Text-to-Speech
Pocket-TTS
Safetensors
English
tts
voice-cloning
mirror
Model card Files Files and versions
xet
Community

Instructions to use openensemble/pocket-tts with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Pocket-TTS

    How to use openensemble/pocket-tts with Pocket-TTS:

    from pocket_tts import TTSModel
    import scipy.io.wavfile
    
    tts_model = TTSModel.load_model("openensemble/pocket-tts")
    voice_state = tts_model.get_state_for_audio_prompt(
        "hf://kyutai/tts-voices/alba-mackenna/casual.wav"
    )
    audio = tts_model.generate_audio(voice_state, "Hello world, this is a test.")
    # Audio is a 1D torch tensor containing PCM data.
    scipy.io.wavfile.write("output.wav", tts_model.sample_rate, audio.numpy())
  • Notebooks
  • Google Colab
  • Kaggle
pocket-tts
230 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 4 commits
openensemble's picture
openensemble
Add OE Default voice-state (Pocket TTS)
0c22ea9 verified 21 days ago
  • languages
    Upload languages/english/model.safetensors with huggingface_hub 21 days ago
  • .gitattributes
    1.52 kB
    initial commit 21 days ago
  • README.md
    1.08 kB
    Upload README.md with huggingface_hub 21 days ago
  • default-voice.safetensors
    11.3 MB
    xet
    Add OE Default voice-state (Pocket TTS) 21 days ago