12 23 17

Luozheng Qin

Fr0zencr4nE

AI & ML interests

None yet

Recent Activity

commentedon a paper 1 day ago

Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator

updated a model 1 day ago

Fr0zencr4nE/Uni-ViGU

authored a paper 2 days ago

Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator

Paper • 2604.08121 • Published 7 days ago • 42

upvoted a paper 27 days ago

Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance

Paper • 2603.02175 • Published Mar 2 • 24

upvoted a paper about 1 month ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published Mar 3 • 103

upvoted a paper 2 months ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 223

upvoted a paper 3 months ago

FlowAct-R1: Towards Interactive Humanoid Video Generation

Paper • 2601.10103 • Published Jan 15 • 79

upvoted 2 papers 4 months ago

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Paper • 2512.13604 • Published Dec 15, 2025 • 76

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published Dec 8, 2025 • 46

upvoted an article 5 months ago

Article

Diffusers welcomes FLUX-2

Nov 25, 2025

•

189

upvoted a paper 5 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 32

upvoted a paper 6 months ago

BLIP3o-NEXT: Next Frontier of Native Image Generation

Paper • 2510.15857 • Published Oct 17, 2025 • 25

upvoted 2 papers 7 months ago

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Paper • 2509.25161 • Published Sep 29, 2025 • 26

Seedream 4.0: Toward Next-generation Multimodal Image Generation

Paper • 2509.20427 • Published Sep 24, 2025 • 84

upvoted 2 papers 8 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13, 2025 • 305

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4, 2025 • 274

upvoted 2 papers 10 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 132

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

Paper • 2506.09482 • Published Jun 11, 2025 • 45

upvoted 2 papers 11 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 99

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11, 2025 • 157

upvoted 2 papers 12 months ago

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Paper • 2505.00703 • Published May 1, 2025 • 44

Towards Understanding Camera Motions in Any Video

Paper • 2504.15376 • Published Apr 21, 2025 • 157

Luozheng Qin

AI & ML interests

Recent Activity

Organizations

Fr0zencr4nE's activity

Diffusers welcomes FLUX-2