DVLab-Generation

non-profit

https://www.dvlab.ai/

AI & ML interests

None defined yet.

Recent Activity

julianjuaner authored a paper about 12 hours ago

SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generation

Ruihang authored a paper about 2 months ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Ruihang authored a paper about 2 months ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

View all activity

authored a paper about 12 hours ago

SwiftI2V: Efficient High-Resolution Image-to-Video Generation via Conditional Segment-wise Generation

Paper • 2605.06356 • Published 4 days ago • 5

authored 2 papers about 2 months ago

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Paper • 2603.12257 • Published Mar 12 • 31

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Paper • 2603.12257 • Published Mar 12 • 31

authored 5 papers 2 months ago

DreamOmni2: Multimodal Instruction-based Editing and Generation

Paper • 2510.06679 • Published Oct 8, 2025 • 74

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published Nov 11, 2025 • 39

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published Dec 8, 2025 • 17

DreamVE: Unified Instruction-based Image and Video Editing

Paper • 2508.06080 • Published Aug 8, 2025

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published Mar 3 • 185

authored a paper 4 months ago

SIN-Bench: Tracing Native Evidence Chains in Long-Context Multimodal Scientific Interleaved Literature

Paper • 2601.10108 • Published Jan 15 • 7

authored a paper 5 months ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 134

authored a paper 8 months ago

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning

Paper • 2509.20360 • Published Sep 24, 2025 • 18

authored 5 papers 10 months ago

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation

Paper • 2307.01831 • Published Jul 4, 2023 • 8

Mask-Attention-Free Transformer for 3D Instance Segmentation

Paper • 2309.01692 • Published Sep 4, 2023 • 1

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 48

A Survey of Reasoning with Foundation Models

Paper • 2312.11562 • Published Dec 17, 2023 • 1

AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning

Paper • 2507.12841 • Published Jul 17, 2025 • 42

authored 2 papers 12 months ago

DreamOmni: Unified Image Generation and Editing

Paper • 2412.17098 • Published Dec 22, 2024 • 2

Training-Free Efficient Video Generation via Dynamic Token Carving

Paper • 2505.16864 • Published May 22, 2025 • 24

authored a paper about 1 year ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 61

authored a paper about 1 year ago

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published Mar 16, 2025 • 35