Deepseek-VL Collection DeepSeek-VL: Towards Real-World Vision-Language Understanding • 5 items • Updated Jun 14, 2025 • 1
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 133
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published Dec 31, 2025 • 42
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Leyo, HugoLaurencon, VictorSanh • Apr 15, 2024 • 191
view article Article Using LoRA for Efficient Stable Diffusion Fine-Tuning pcuenq, sayakpaul • Jan 26, 2023 • 83
view article Article LoRA training scripts of the world, unite! linoyts, multimodalart • Jan 2, 2024 • 79
TripoSR: Fast 3D Object Reconstruction from a Single Image Paper • 2403.02151 • Published Mar 4, 2024 • 16
Running on Zero Agents Featured 520 FLUX LoRa Lab 🧪 520 Generate custom images using mixed FLUX LoRA adapters
Running Featured 603 Image Arena Leaderboard 📊 603 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade Agents 10.1k Kolors Virtual Try-On 👕 10.1k Generate virtual try‑on images of a person wearing a chosen garment
view article Article SDXL in 4 steps with Latent Consistency LoRAs +5 pcuenq, valhalla, SimianLuo, dg845, tyq1024, sayakpaul, multimodalart • Nov 9, 2023 • 16
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper • 2312.00777 • Published Dec 1, 2023 • 24
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Paper • 2311.04391 • Published Nov 7, 2023 • 14