4 20

Dian Zheng PRO

zhengli1013

https://zhengdian1.github.io/

zhengdian1

AI & ML interests

generative model

Recent Activity

upvoted a paper 6 days ago

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

updated a model 18 days ago

InterleaveThinker/Critic-SFT-8B

updated a model 18 days ago

InterleaveThinker/InterleaveThinker-Critic-8B

View all activity

Organizations

upvoted a paper 6 days ago

Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning

Paper • 2605.21487 • Published 7 days ago • 21

upvoted 3 papers about 2 months ago

upvoted a paper 2 months ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Paper • 2603.25319 • Published Mar 26 • 32

upvoted a paper 3 months ago

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Paper • 2603.08703 • Published Mar 9 • 32

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 163

upvoted a paper 3 months ago

VLANeXt: Recipes for Building Strong VLA Models

Paper • 2602.18532 • Published Feb 20 • 52

upvoted 2 papers 5 months ago

ProEdit: Inversion-based Editing From Prompts Done Right

Paper • 2512.22118 • Published Dec 26, 2025 • 19

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76

upvoted 6 papers 6 months ago

OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation

Paper • 2512.08294 • Published Dec 9, 2025 • 18

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published Dec 5, 2025 • 38

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published Dec 2, 2025 • 37

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published Dec 2, 2025 • 34

Panorama Generation From NFoV Image Done Right

Paper • 2503.18420 • Published Mar 24, 2025 • 1

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published Nov 27, 2025 • 29

upvoted 2 papers 7 months ago

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Paper • 2510.18632 • Published Oct 21, 2025 • 23

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15, 2025 • 11

upvoted a paper 10 months ago

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5, 2025 • 53

upvoted a paper about 1 year ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27, 2025 • 33

Dian Zheng PRO

AI & ML interests

Recent Activity

Organizations

zhengli1013's activity

NEO-unify: Building Native Multimodal Unified Models End to End