Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ndamulelonemakh 's Collections
The Agent Harness
Document Processing
Text2Image
Check it out
Coding LLMs
Open Prompts
Awesome Open Text to Image
Awesome Open LLMs
LLM QA Datasets
Mid Open LLMs
Open Text 2 Speech
ZaBantu Data
Papers

Open Text 2 Speech

updated May 18, 2025
Upvote
-

  • Running on Zero
    Agents
    Featured
    2.87k

    F5-TTS

    🗣
    2.87k

    F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)


  • nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

    Text Generation • 71B • Updated Apr 13, 2025 • 14.4k • • 2.07k

  • openai/whisper-large-v3-turbo

    Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 8.57M • • 3.07k

  • Running on Zero
    Agents
    1.02k

    Whisper Turbo

    🤯
    1.02k

    Transcribe audio or YouTube videos into text


  • Running on Zero
    Agents
    Featured
    2.77k

    Whisper

    📉
    2.77k

    Transcribe audio files into text instantly


  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10, 2025 • 14.1M • • 6.28k

  • nvidia/parakeet-tdt-0.6b-v2

    Automatic Speech Recognition • Updated Apr 13 • 376k • 1.49k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs