Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
makisekurisu-jp 's Collections
LTX-2
KLEIN
Z-Image
Qwen-Edit
WAN
WAN LORA
Nunchaku
Upscaler
TTS
Digital Human
Others

TTS

updated Mar 5
Upvote
1

  • IAHispano/Applio

    Audio-to-Audio • Updated Feb 25 • 84 • 164

  • Kijai/MelBandRoFormer_comfy

    Updated Aug 23, 2025 • 109k • 33

  • openai/whisper-large-v3

    Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.84M • • 5.61k

  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 731k • 1.05k

  • microsoft/VibeVoice-1.5B

    Text-to-Speech • 3B • Updated Jan 22 • 146k • 2.33k

  • Qwen/Qwen3-ASR-1.7B

    Automatic Speech Recognition • 2B • Updated Jan 30 • 1.75M • 756

  • Qwen/Qwen3-TTS-12Hz-1.7B-Base

    Updated Jan 23 • 1.41M • 376

  • Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

    Text-to-Speech • 2B • Updated Jan 29 • 1.57M • 1.43k

  • Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign

    Text-to-Speech • 2B • Updated Jan 29 • 532k • 330

  • Soul-AILab/SoulX-Podcast-1.7B

    Text-to-Speech • Updated Dec 18, 2025 • 236 • 231

  • Soul-AILab/SoulX-Singer

    Text-to-Speech • Updated Mar 13 • 732 • 148

  • IndexTeam/IndexTTS-2

    Text-to-Speech • Updated Jan 20 • 17.6k • 689

  • HeartMuLa/HeartMuLa-RL-oss-3B-20260123

    4B • Updated Jan 23 • 493 • 25

  • HeartMuLa/HeartCodec-oss-20260123

    2B • Updated Jan 23 • 3.94k • 8

  • ACE-Step/Ace-Step1.5

    Text-to-Audio • Updated Feb 3 • 51.1k • 722
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs