Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepakkumar07 's Collections
vision-llm
tamil-dataset
document-parser
text-to-speech
voice-to-text
Transformers model
csv-dataset

voice-to-text

updated Apr 26, 2025
Upvote
-

  • Running on CPU Upgrade
    Agents
    12

    Talk to Claude

    👨
    12

    Talk to Anthropic's Claude


  • Running on Zero
    Agents
    Featured
    413

    Zonos

    🌍
    413

    Generate expressive speech audio from text with custom voice


  • Running on Zero
    Agents
    Featured
    475

    MeloTTS

    🗣
    475

    Fast, efficient, & multilingual text-to-speech


  • NexaAI/OmniAudio-2.6B

    Audio-Text-to-Text • 3B • Updated Dec 13, 2024 • 748 • 288

  • gpt-omni/VoiceAssistant-400K

    Viewer • Updated Sep 13, 2024 • 470k • 1.25k • 97

  • Runtime error
    Agents

    LLM Voice Chat

    💻

    Talk to an LLM with ElevenLabs


  • Running on CPU Upgrade
    Agents
    26

    Talk to Gemini

    ♊
    26

    Talk to Gemini using Google's multimodal API


  • ai4bharat/Rasa

    Viewer • Updated Dec 5, 2025 • 995k • 3.89k • 37
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs