Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sonal Kumar's picture
7 3

Sonal Kumar

sonalkum
John6666's profile picture Av1nash's profile picture frascuchon's profile picture
·

AI & ML interests

None yet

Organizations

ZeroGPU Explorers's profile picture JSALT25-AuGI's profile picture Gamma Lab's profile picture University of Maryland's profile picture

upvoted a paper 3 months ago

Visual Spatial Tuning

Paper • 2511.05491 • Published Nov 7, 2025 • 52
upvoted a paper 5 months ago

MMAU-Pro: A Challenging and Comprehensive Benchmark for Holistic Evaluation of Audio General Intelligence

Paper • 2508.13992 • Published Aug 19, 2025 • 7
upvoted a paper 11 months ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published Mar 6, 2025 • 26
upvoted a paper about 1 year ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 23
upvoted 3 papers over 1 year ago

Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation

Paper • 2410.13198 • Published Oct 17, 2024 • 9

ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds

Paper • 2409.09213 • Published Sep 13, 2024 • 13

GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities

Paper • 2406.11768 • Published Jun 17, 2024 • 24
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs