Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
MegaTronX 's Collections
Images
Datasets
Ancient Language / Philology
Audio & Voice
Search
Music
Leaderboards
Prompt Datasets
Text Generation Models
OpenCoder Dataset
Image Prompts
Small Code Models
Image Generators

Audio & Voice

updated 18 days ago
Upvote
-

  • HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

    Updated Feb 13, 2025 • 83 • 30

  • nari-labs/Dia-1.6B

    Text-to-Speech • Updated Jun 1, 2025 • 73k • • 2.83k

  • Running
    39

    Whisper-WebUI

    🚀
    39

    Generate subtitles and translate audio files


  • Runtime error
    2

    Nllb Translation

    📈
    2

    Translate text between multiple languages


  • facebook/nllb-200-3.3B

    Translation • Updated Feb 11, 2023 • 39.3k • 401

  • Running on Zero
    161

    NLLB

    🌐
    161

    Translate text between 200 languages


  • Running
    13

    MMS

    🌍
    13

    Transcribe audio to text in multiple languages


  • Running on L4
    Featured
    721

    StyleTTS 2

    🗣
    721

    Efficient, fast, and natural text to speech with StyleTTS 2!


  • Running
    Featured
    48

    MOSS Transcribe Diarize

    🏢
    48

    Transcribe audio/video files with speaker identification

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs