Juan CM

jucamohedano

2 28 44

AI & ML interests

AI Systems MSc at Trento 🚀🤖

Recent Activity

updated a model about 1 month ago

jucamohedano/qwen2.5_vl_7b_oxford_pets_grpo_lora

updated a model about 1 month ago

jucamohedano/qwen2.5_vl_7b_oxford_pets_grpo_lora_more_rollouts

published a model about 1 month ago

jucamohedano/qwen2.5_vl_7b_oxford_pets_grpo_lora_more_rollouts

View all activity

Organizations

upvoted 2 articles 3 months ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 105

Article

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Weyaxi

•

Jan 2

• 23

upvoted a changelog 4 months ago

Hugging Face Changelog

Hugging Face Papers for AI Agents

Mar 18

• 143

upvoted 3 articles 9 months ago

Article

Vision Language Model Alignment in TRL ⚡️

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 112

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 120

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted an article 10 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

ariG23498, lusxvr, andito, sergiopaniego, merve, pcuenq, reach-vb

•

May 21, 2025

• 262

upvoted a collection about 1 year ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 253

upvoted 2 papers about 1 year ago

Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights

Paper • 2502.09619 • Published Feb 13, 2025 • 36

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Paper • 2505.22453 • Published May 28, 2025 • 46

upvoted a collection over 1 year ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 174

upvoted an article over 1 year ago

Article

Introducing smolagents: simple agents that write actions in code.

m-ric, merve, thomwolf

•

Dec 31, 2024

• 1.2k

upvoted a paper over 1 year ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 261

upvoted 2 articles over 1 year ago

Article

Open-source DeepResearch – Freeing our search agents

m-ric, albertvillanova, merve, thomwolf, clefourrier

•

Feb 4, 2025

• 1.32k

Article

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

andito, mfarre, merve

•

Jan 23, 2025

• 192

upvoted 2 articles about 2 years ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

merve, andsteing, pcuenq

•

May 14, 2024

• 288

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

AviSoori1x

•

Jun 23, 2024

• 40

upvoted a collection about 2 years ago

[lecture artifacts] aligning open language models

Collection

artifacts referenced in the talk timeline! Slides: https://docs.google.com/presentation/d/1quMyI4BAx4rvcDfk8jjv063bmHg4RxZd9mhQloXpMn0/edit?usp=sharin • 63 items • Updated Apr 17, 2024 • 58

upvoted 2 articles about 2 years ago

Article

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

lmassaron

•

Feb 21, 2024

• 19

Article

Design choices for Vision Language Models in 2024

gigant

•

Apr 16, 2024

• 35

Juan CM

AI & ML interests

Recent Activity

Organizations

jucamohedano's activity

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

The Engineering Handbook for GRPO + LoRA with Verl: Training Qwen2.5 on Multi-GPU

Hugging Face Papers for AI Agents

Vision Language Model Alignment in TRL ⚡️

KV Cache from scratch in nanoVLM

Vision Language Models (Better, faster, stronger)

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Introducing smolagents: simple agents that write actions in code.

Open-source DeepResearch – Freeing our search agents

SmolVLM Grows Smaller – Introducing the 256M & 500M Models!

PaliGemma – Google's Cutting-Edge Open Vision Language Model

SeeMoE: Implementing a MoE Vision Language Model from Scratch

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

Design choices for Vision Language Models in 2024