Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
johnsett 's Collections
robotics
image-text-to-text
CUA
audio
interesting
tts
video
image

tts

updated Nov 24, 2025
Upvote
-

  • myshell-ai/MeloTTS-English

    Text-to-Speech • Updated Dec 24, 2024 • 618k • 301

    Note really nice tts


  • coqui/XTTS-v2

    Text-to-Speech • Updated Dec 11, 2023 • 4.71M • 3.3k

    Note change voice and language tts


  • facebook/musicgen-small

    Text-to-Audio • 0.6B • Updated Nov 17, 2023 • 69.6k • 467

  • Running
    3

    Spleeter And ASR

    🚀
    3

    Separate audio into vocals and accompaniment, transcribe vocals

    Note take a auto file and split out the voice from music and then extract text from the voice


  • Running
    31

    Speaker Diarization

    🔥
    31

    Speaker diarization, speake segmentation,


  • pyannote/segmentation-3.0

    Voice Activity Detection • Updated May 10, 2024 • 14.4M • 743

  • Running on Zero
    428

    Seed Voice Conversion

    🎤
    428

    Convert voice to match another's style or tone


  • Supertone/supertonic

    Text-to-Speech • Updated Dec 10, 2025 • 4.17k • 457

  • maya-research/maya1

    Text-to-Speech • 3B • Updated Nov 12, 2025 • 47.1k • • 843

  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10, 2025 • 1.73M • • 5.54k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs