🏗️ Building on HF

5 35 31

Andrea Gemelli

andreagemelli

https://www.andreagemelli.me

AI & ML interests

Natural Language Processing, Computer Vision, Generative Models, Document Analysis

Recent Activity

upvoted a paper about 6 hours ago

Where Does the Signal Live? A Web Data Recipe for Medical Encoder Pretraining

upvoted a collection 7 days ago

GLM-5.2

upvoted an article 21 days ago

🪆 Introduction to Matryoshka Embedding Models

View all activity

Organizations

None yet

upvoted a paper about 6 hours ago

Where Does the Signal Live? A Web Data Recipe for Medical Encoder Pretraining

Paper • 2606.22079 • Published 6 days ago • 2

upvoted a collection 7 days ago

GLM-5.2

Collection

2 items • Updated 9 days ago • 46

upvoted an article 21 days ago

Article

🪆 Introduction to Matryoshka Embedding Models

tomaarsen, Xenova, osanseviero

•

Feb 23, 2024

• 211

upvoted an article 3 months ago

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

upvoted an article 7 months ago

Article

Vision Language Model Alignment in TRL ⚡️

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 112

upvoted 2 papers 7 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 73

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37

upvoted 2 articles 8 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 315

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 351

upvoted a collection 8 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 747

upvoted an article 8 months ago

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

upvoted a collection 9 months ago

Holo1.5

Collection

Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35

upvoted an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

upvoted a paper 12 months ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2, 2025 • 36

upvoted a collection about 1 year ago

Qwen3

Collection

84 items • Updated Dec 31, 2025 • 1.82k

upvoted a paper about 1 year ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 209

upvoted an article about 1 year ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 343

upvoted an article over 1 year ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

ariG23498, merve, pcuenq, reach-vb

•

Mar 12, 2025

• 497

upvoted 2 collections over 1 year ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 10 items • Updated Mar 2 • 566

Comics Understanding

Collection

5 items • Updated Mar 14, 2025 • 4

Andrea Gemelli

AI & ML interests

Recent Activity

Organizations

andreagemelli's activity

🪆 Introduction to Matryoshka Embedding Models

We Got Claude to Fine-Tune an Open Source LLM

Vision Language Model Alignment in TRL ⚡️

Supercharge your OCR Pipelines with Open Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

Preference Optimization for Vision Language Models

SmolLM3: smol, multilingual, long-context reasoner

SmolVLM2: Bringing Video Understanding to Every Device

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM