DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 β’ 15 items β’ Updated Mar 10 β’ 630
view article Article Vision Language Model Alignment in TRL β‘οΈ +3 sergiopaniego, merve, qgallouedec, kashif, ariG23498 β’ Aug 7, 2025 β’ 111
view article Article Finetune Stable Diffusion Models with DDPO via TRL +2 metric-space, sayakpaul, kashif, lvwerra β’ Sep 29, 2023 β’ 20
view article Article Preference Optimization for Vision Language Models +2 qgallouedec, vwxyzjn, merve, kashif β’ Jul 10, 2024 β’ 93
view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face dvgodoy β’ Feb 11, 2025 β’ 123
view article Article From GRPO to DAPO and GSPO: What, Why, and How NormalUhr β’ Aug 9, 2025 β’ 118
view article Article How to Choose the Best Open Source LLM for Your Project in 2025 dvilasuero β’ Sep 9, 2025 β’ 77
view article Article KV Cache from scratch in nanoVLM +3 ariG23498, kashif, lusxvr, andito, pcuenq β’ Jun 4, 2025 β’ 119
Direct Preference Optimization Datasets Collection Datasets suitable for DPO based on having 'chosen', 'rejected', and 'prompt' columns. Created using librarian-bots/dataset-column-search-api β’ 4011 items β’ Updated Apr 2 β’ 7
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 natolambert, LouisCastricato, lvwerra, Dahoas β’ Dec 9, 2022 β’ 411
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ merve β’ Aug 25, 2023 β’ 39
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 β’ Jan 27, 2025 β’ 70
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11, 2024 β’ 80