Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
aikongfu 's Collections
embedding benchmark
AI agent
LLM
speech recognition
AI Coding
Computer Vision(Text to Image)
Text to Audio
Multimodal
Audio to text
Datasets
Text to Video
image to video

speech recognition

updated Nov 21, 2024
Upvote
-

  • Running on Zero
    Agents
    Featured
    2.78k

    Whisper

    📉
    2.78k

    Transcribe audio files into text instantly


  • Running on Zero
    Agents
    Featured
    369

    Video Transcription Smart Summary

    ⚡
    369

    Generate transcription and summary from uploaded videos


  • Running on Zero
    MCP
    Featured
    847

    Whisper Large V3

    🤫
    847

    Transcribe audio or YouTube videos to text


  • Paused
    Agents
    855

    Video Dubbing (SoniTranslate)

    🌍
    855

    Video Dubbing with Open Source Projects


  • Build error
    Agents
    275

    Faster Whisper Webui

    🚀
    275

    Transcribe audio to text with speaker diarization

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs