Anh Duy Le

duycse1603

15 30

AI & ML interests

None yet

Recent Activity

updated a model 8 days ago

duycse1603/MyModel

published a model 10 days ago

duycse1603/MyModel

liked a Space 23 days ago

numind/NuExtract3

View all activity

Organizations

None yet

upvoted a collection 9 months ago

DINOv3

Collection

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 15 items • Updated Mar 10 • 678

upvoted 3 articles 9 months ago

Article

Vision Language Model Alignment in TRL ⚡️

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 112

Article

Finetune Stable Diffusion Models with DDPO via TRL

metric-space, sayakpaul, kashif, lvwerra

•

Sep 29, 2023

• 20

Article

Preference Optimization for Vision Language Models

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

upvoted 5 articles 10 months ago

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

dvgodoy

•

Feb 11, 2025

• 125

Article

Decoding Strategies in Large Language Models

mlabonne

•

Oct 29, 2024

• 114

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 128

Article

How to Choose the Best Open Source LLM for Your Project in 2025

dvilasuero

•

Sep 9, 2025

• 78

Article

KV Cache from scratch in nanoVLM

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 120

upvoted a collection 10 months ago

Direct Preference Optimization Datasets

Collection

Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api • 3985 items • Updated 6 days ago • 8

upvoted an article 10 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

natolambert, LouisCastricato, lvwerra, Dahoas

•

Dec 9, 2022

• 418

upvoted an article 11 months ago

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

merve

•

Aug 25, 2023

• 40

upvoted 2 articles over 1 year ago

Article

State of open video generation models in Diffusers

sayakpaul, a-r-r-o-w, dn6

•

Jan 27, 2025

• 71

Article

Introduction to 3D Gaussian Splatting

dylanebert

•

Sep 18, 2023

• 140

upvoted a collection almost 2 years ago

Awesome Document AI

Collection

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11, 2024 • 80

Anh Duy Le

AI & ML interests

Recent Activity

Organizations

duycse1603's activity

Vision Language Model Alignment in TRL ⚡️

Finetune Stable Diffusion Models with DDPO via TRL

Preference Optimization for Vision Language Models

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

Decoding Strategies in Large Language Models

From GRPO to DAPO and GSPO: What, Why, and How

How to Choose the Best Open Source LLM for Your Project in 2025

KV Cache from scratch in nanoVLM

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

State of open video generation models in Diffusers

Introduction to 3D Gaussian Splatting