Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Bk9x 's Collections
Data_Pretrain_NLP
Dataset_NLP
Small LM
Dataset_voice
Embedding
Automatic Speech Recognition
SDXL
TTS
LLM
model_NLP
VLM + OCR

Automatic Speech Recognition

updated Apr 7
Upvote
-

  • openai/whisper-large-v3-turbo

    Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.29M • • 3.01k

  • nguyendv02/ViMD_Dataset

    Viewer • Updated Jan 28 • 19k • 1.51k • 18

  • Running
    Agents
    46

    Automatic Speech Recognition

    🌍
    46

    Transcribe uploaded, recorded, or online audio to text

    Note Speech recognition serving sherpa onnx-zipformer-vi-int8 cpu


  • Qwen/Qwen3-ASR-1.7B

    Automatic Speech Recognition • 2B • Updated Jan 30 • 2.04M • 808

  • Running
    Agents
    17

    Automatic Speech Recognition

    🌍
    17

    Transcribe speech from audio files, mic or URL to text


  • g-group-ai-lab/gipformer-65M-rnnt

    Automatic Speech Recognition • Updated Mar 25 • 90 • 25

    Note gipformer-65M-rnnt

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs