Eric Bezzam's picture

Eric Bezzam PRO

bezzam

huggingface

·

AI & ML interests

speech, audio, imaging

Recent Activity

liked a dataset about 3 hours ago

apptek-com/apptek_callcenter_dialogues

updated a model about 4 hours ago

bezzam/Qwen3-ForcedAligner-0.6B

updated a model about 4 hours ago

bezzam/Qwen3-ASR-1.7B

View all activity

Organizations

upvoted a collection 1 day ago

Nemotron Speech

Open, state-of-the-art, production‑ready enterprise speech models from the NVIDIA Speech research team for ASR, TTS, Speaker Diarization and S2S • 12 items • Updated 18 days ago • 51

upvoted an article 2 days ago

Article

Adding Benchmaxxer Repellant to the Open ASR Leaderboard

+9

3 days ago

•

10

upvoted an article 5 days ago

Article

ML Intern Takes Our Post-Training Internship Test

15 days ago

•

30

upvoted an article 11 days ago

Article

Safetensors is Joining the PyTorch Foundation

about 1 month ago

•

37

upvoted a paper 14 days ago

Qwen3-ASR Technical Report

Paper • 2601.21337 • Published Jan 29 • 37

upvoted an article 15 days ago

Article

mlinter: a linter for Transformers modeling files

17 days ago

•

8

upvoted a collection 18 days ago

MOSS-Audio

An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 7 items • Updated 6 days ago • 55

upvoted 2 collections 22 days ago

Canary ASR/AST

A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 6 items • Updated 18 days ago • 34

Parakeet ASR

NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 16 items • Updated 18 days ago • 70

upvoted an article 22 days ago

Article

The PR you would have opened yourself

23 days ago

•

68

upvoted an article about 1 month ago

Article

Liberate your OpenClaw

+6

Mar 27

•

45

upvoted a collection about 1 month ago

Gemma 4

12 items • Updated 3 days ago • 768

upvoted 3 articles about 1 month ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

+2

Mar 31

•

51

Article

How I contributed a new model to the Transformers library using Codex

Mar 30

•

51

Article

Raw Robot Video to VLA-Ready Training Data: Annotating LeRobot Datasets with Nomadic and HuggingFace Buckets

Mar 21

•

17

upvoted an article about 2 months ago

Article

LLM based Audio models

Dec 18, 2025

•

58

upvoted a collection about 2 months ago

ALARM

Official checkpoints and data for "ALARM: Audio–Language Alignment for Reasoning Models" • 8 items • Updated Mar 9 • 1

upvoted an article about 2 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

Mar 10

•

194

upvoted a paper 2 months ago

Music Flamingo: Scaling Music Understanding in Audio Language Models

Paper • 2511.10289 • Published Nov 13, 2025 • 19

upvoted an article 2 months ago

Article

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

+2

Mar 5

•

51