Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Kunal Dhawan's picture
20 4 10

Kunal Dhawan

kunaldhawan
nvidia
kunal1234's profile picture Amargolin's profile picture avinashod's profile picture
·
https://kunal-dhawan.weebly.com/
  • KunalDhawan

AI & ML interests

Conversational AI, NLP, Multimodal Machine Learning

Recent Activity

commented on an article 2 days ago
Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR
updated a model 3 days ago
nvidia/nemotron-speech-streaming-en-0.6b
commented on an article 7 days ago
Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR
View all activity

Organizations

NVIDIA's profile picture Dynamic-SUPERB's profile picture ASR-LLM Group: Generative Error Correction's profile picture

upvoted a collection 17 days ago

Nemotron Speech

Collection
Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 9 items • Updated 3 days ago • 36
upvoted an article 27 days ago
view article
Article

Scaling Real-Time Voice Agents with Cache-Aware Streaming ASR

27 days ago
•
75
upvoted 2 papers 8 months ago

Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer

Paper • 2306.08753 • Published Jun 14, 2023 • 2

Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations

Paper • 2407.03495 • Published Jul 3, 2024 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs