Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
falamarcao 's Collections
segmentation
Testing 1.. 2… 3…
AI Agent
Veterinary
On Device (local)
Start Here
omni models (text, image, audio, video)
Speech related
Web GPU
Software Engineering
Tracker
Speech-to-speech
MCP Servers
computer-use
Speech-to-text
Index-embed
3D
Code
Object Detection
Safety
Parser
Multimodal
Specialized
OCR
Video
Image
Audio
LLM
Text-to-speech

Speech-to-text

updated Jan 18
Upvote
-

  • nvidia/parakeet-tdt-0.6b-v2

    Automatic Speech Recognition • Updated Apr 13 • 376k • 1.49k

  • Running on Zero
    Agents
    Featured
    474

    Parakeet-TDT-0.6b-V2

     
    474

    Transcribe audio files with timestamps and downloadable subtitles


  • Runtime error
    Agents
    33

    Blazing Fast Whisper

    👁
    33

    Blazing Fast Whisper Deployed on HF Inference Endpoints


  • Running on CPU Upgrade
    Agents
    Featured
    1.37k

    Open ASR Leaderboard

    🏆
    1.37k

    Compare speech-to-text models using benchmark scores


  • LiquidAI/LFM2.5-Audio-1.5B

    Audio-to-Audio • 1B • Updated Mar 30 • 1.42k • 417
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs