Deepseek-VL Collection DeepSeek-VL: Towards Real-World Vision-Language Understanding • 5 items • Updated Jun 14, 2025 • 1
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 133
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published Dec 31, 2025 • 42
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Apr 15, 2024 • 191
TripoSR: Fast 3D Object Reconstruction from a Single Image Paper • 2403.02151 • Published Mar 4, 2024 • 16
Running on Zero Agents Featured 517 FLUX LoRa Lab 🧪 517 Generate images using custom LoRA styles with FLUX model
Running Featured 595 Image Arena Leaderboard 📊 595 Image Generation and Image Editing Arena & Leaderboard
Running on CPU Upgrade Agents 10.1k Kolors Virtual Try-On 👕 10.1k Generate a virtual try‑on image of a person wearing a garment
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper • 2312.00777 • Published Dec 1, 2023 • 24
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Paper • 2311.04391 • Published Nov 7, 2023 • 14