view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 614
🇹🇼 Taiwan-Bench Collection Evaluation dataset in Traditional Chinese. • 3 items • Updated Apr 26, 2025 • 1
OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview Image-Text-to-Text • 0.4B • Updated Aug 29, 2025 • 94.1k • 82
view article Article Finally, a Replacement for BERT: Introducing ModernBERT +13 bwarner, NohTow, bclavie, orionweller, ohallstrom, staghado, alexisgallagher, rbiswasfc, fladhak, tomaarsen, ncoop57, griffin, jph00, johnowhitaker, iacolippo • Dec 19, 2024 • 748
view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) ariG23498 • Jan 19, 2025 • 53
view article Article Finetune Stable Diffusion Models with DDPO via TRL +2 metric-space, sayakpaul, kashif, lvwerra • Sep 29, 2023 • 20