Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
biki96 's Collections
image-text-to-video
I2I
Face Swap
Embedding
A2A
TTS
Text2Image
LLM
IT3D
OCR
I2V
STT
diffusion

STT

updated Feb 5
Upvote
-

  • Running on CPU Upgrade
    Agents
    Featured
    1.33k

    Open ASR Leaderboard

    🏆
    1.33k

    Explore and compare speech recognition model benchmarks


  • nvidia/canary-qwen-2.5b

    Automatic Speech Recognition • 3B • Updated 17 days ago • 108k • 422

  • nvidia/parakeet-tdt-0.6b-v3

    Automatic Speech Recognition • 0.6B • Updated 22 days ago • 428k • 826

  • nvidia/parakeet-tdt-0.6b-v2

    Automatic Speech Recognition • Updated 25 days ago • 167k • 1.47k

  • stabilityai/stable-video-diffusion-img2vid

    Image-to-Video • Updated Jul 10, 2024 • 50.6k • 1.03k

  • LiquidAI/LFM2-Audio-1.5B

    Audio-to-Audio • 1B • Updated Mar 27 • 277 • 346

  • mistralai/Voxtral-Mini-4B-Realtime-2602

    Automatic Speech Recognition • 4B • Updated Mar 11 • 1.16M • 840
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs