Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
kuanlu 's Collections
AI OCR
Text-to-Audio
Image-to-3D
Text-to-Video
LLM
Stable Diffusion
Coding
Time Series Forecasting
Text-to-SQL
Text-to-Speech
Speech Recognition

Text-to-Audio

updated 2 days ago
Upvote
-

  • stabilityai/stable-audio-open-1.0

    Text-to-Audio • Updated Jun 19, 2025 • 21.2k • 1.47k

  • google/magenta-realtime

    Updated Aug 29, 2025 • 400 • 547

  • Zyphra/Zonos-v0.1-hybrid

    Text-to-Speech • Updated Jun 3, 2025 • 2.5k • 1.11k

  • ACE-Step/ACE-Step-v1-3.5B

    Text-to-Audio • Updated May 22, 2025 • 732

  • FabioSarracino/VibeVoice-Large-Q8

    Text-to-Audio • 9B • Updated Oct 1, 2025 • 975 • 97

  • DevParker/VibeVoice1.5b_ELSIE_4BIT

    3B • Updated Sep 25, 2025 • 2

  • ResembleAI/chatterbox-turbo

    Text-to-Speech • Updated Dec 15, 2025 • • 648

  • LiquidAI/LFM2.5-Audio-1.5B

    Audio-to-Audio • 1B • Updated Mar 30 • 861 • 400

  • ElectricAlexis/NotaGen

    Updated Feb 26, 2025 • 153

  • ASLP-lab/YingMusic-Singer-Plus

    Updated Apr 9 • 1.94k • 7

  • ASLP-lab/DiffRhythm2

    Updated Nov 9, 2025 • 1.57k • 45

  • m-a-p/YuE-s2-1B-general

    Text Generation • 2B • Updated Mar 12, 2025 • 5.85k • 60
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs