Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
tardigrade-doc 's Collections
image
datasets4tts
useful
agents
asr
ocr
hot models
text-to-speech

text-to-speech

updated Apr 17
Upvote
-

  • Supertone/supertonic

    Text-to-Speech • Updated Dec 10, 2025 • 1.26k • 481

    Note only english, small and voice better than hexgrad/Kokoro-82M


  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10, 2025 • 13.8M • • 6.27k

  • fishaudio/s1-mini

    Text-to-Speech • Updated Feb 6 • 3.48k • 650

    Note multi language, include Chinese


  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 56.8k • 2.39k

    Note long speech, 90+min


  • OuteAI/Llama-OuteTTS-1.0-1B

    Text-to-Speech • 1B • Updated Sep 8, 2025 • 4.86k • 241

  • nineninesix/kani-tts-400m-zh

    Text-to-Speech • 0.4B • Updated Feb 18 • 585 • 1

  • microsoft/VibeVoice-Realtime-0.5B

    Text-to-Speech • 1B • Updated Dec 12, 2025 • 816k • 1.23k

  • OpenMOSS-Team/MOSS-TTSD-v1.0

    Text-to-Speech • 8B • Updated Feb 14 • 5.99k • 58

  • onnx-community/Supertonic-TTS-2-ONNX

    Text-to-Speech • Updated Jan 20 • 1.03k • 8

  • meituan-longcat/LongCat-AudioDiT-3.5B

    4B • Updated Apr 3 • 729 • 73

  • OpenMOSS-Team/MOSS-TTS-Nano-100M

    Text-to-Speech • Updated Apr 13 • 150k • 213

  • tencent/HY-World-2.0

    Image-to-3D • Updated 15 days ago • 3.62k • 665
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs