Deepseek-VL Collection DeepSeek-VL: Towards Real-World Vision-Language Understanding • 5 items • Updated Jun 14, 2025 • 1
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos Paper • 2601.00393 • Published Jan 1 • 133
GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction Paper • 2512.25073 • Published Dec 31, 2025 • 42
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community +1 Leyo, HugoLaurencon, VictorSanh • Apr 15, 2024 • 191
view article Article Using LoRA for Efficient Stable Diffusion Fine-Tuning pcuenq, sayakpaul • Jan 26, 2023 • 82
view article Article LoRA training scripts of the world, unite! linoyts, multimodalart • Jan 2, 2024 • 79
TripoSR: Fast 3D Object Reconstruction from a Single Image Paper • 2403.02151 • Published Mar 4, 2024 • 16
view article Article SDXL in 4 steps with Latent Consistency LoRAs +5 pcuenq, valhalla, SimianLuo, dg845, tyq1024, sayakpaul, multimodalart • Nov 9, 2023 • 15
VideoBooth: Diffusion-based Video Generation with Image Prompts Paper • 2312.00777 • Published Dec 1, 2023 • 24
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features Paper • 2311.04391 • Published Nov 7, 2023 • 14
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery Paper • 2310.18356 • Published Oct 24, 2023 • 24
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation Paper • 2310.19512 • Published Oct 30, 2023 • 16