12 9 1

Bruno Hays

BrunoHays

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

BrunoHays/muscat-merged-samples

published a dataset 3 days ago

BrunoHays/muscat-merged-samples

upvoted an article 19 days ago

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

View all activity

Organizations

upvoted an article 19 days ago

Article

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

nvidia

•

24 days ago

• 66

upvoted 2 articles 12 months ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

manu

•

Jul 17, 2025

• 77

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper 12 months ago

Should We Still Pretrain Encoders with Masked Language Modeling?

Paper • 2507.00994 • Published Jul 1, 2025 • 81

upvoted a paper over 1 year ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 173

upvoted 2 articles over 1 year ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 488

upvoted a paper over 1 year ago

Enhancing Training Efficiency Using Packing with Flash Attention

Paper • 2407.09105 • Published Jul 12, 2024 • 17

upvoted a paper almost 2 years ago

GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Paper • 2409.06595 • Published Sep 10, 2024 • 38

Bruno Hays

AI & ML interests

Recent Activity

Organizations

BrunoHays's activity

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

Introducing ColQwen-Omni: Retrieve in every modality

SmolLM3: smol, multilingual, long-context reasoner

Open-R1: a fully open reproduction of DeepSeek-R1

You could have designed state of the art positional encoding