RASMD: RGB And SWIR Multispectral Driving Dataset for Robust Perception in Adverse Conditions Paper • 2504.07603 • Published Apr 10, 2025 • 2
view article Article Diffusers welcomes Stable Diffusion 3 +4 dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart • Jun 12, 2024 • 99
view article Article The Falcon has landed in the Hugging Face ecosystem +6 lvwerra, ybelkada, smangrul, lewtun, olivierdehaene, pcuenq, philschmid, osanseviero • Jun 5, 2023 • 17
view article Article makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch AviSoori1x • May 7, 2024 • 121
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46
view article Article Zero-shot image-to-text generation with BLIP-2 MariaK, JunnanLi • Feb 15, 2023 • 28
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Leyo, HugoLaurencon, VictorSanh • Apr 15, 2024 • 191
view article Article Fine tuning CLIP with Remote Sensing (Satellite) images and captions +4 arampacha, devv, goutham794, cataluna84, ritog, sujitpal • Oct 13, 2021 • 8
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Paper • 2402.17177 • Published Feb 27, 2024 • 87