Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
matyunin 's Collections
audio
images

audio

updated 16 days ago
Upvote
-

  • Multimodal Latent Language Modeling with Next-Token Diffusion

    Paper • 2412.08635 • Published Dec 11, 2024 • 49

  • CohereLabs/cohere-transcribe-03-2026

    Automatic Speech Recognition • Updated 8 days ago • 199k • 877

  • microsoft/VibeVoice-ASR

    Automatic Speech Recognition • 9B • Updated Jan 27 • 717k • 1.03k

  • mistralai/Voxtral-Mini-4B-Realtime-2602

    Automatic Speech Recognition • 4B • Updated Mar 11 • 877k • 821

  • openai/whisper-large-v3

    Automatic Speech Recognition • 2B • Updated Aug 12, 2024 • 4.82M • • 5.58k

  • openai/whisper-large-v3-turbo

    Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 6.5M • • 2.94k

  • pyannote/speaker-diarization-3.1

    Automatic Speech Recognition • Updated May 10, 2024 • 10.4M • 1.75k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs