Wei Cheng's picture

Wei Cheng

wchengad

·

https://wchengad.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

upvoted a paper 1 day ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

upvoted a paper 3 days ago

LUCID: Learning Unified Control for Image Deflaring and Exposure Mastery in Nighttime Photography

View all activity

Organizations

None yet

upvoted a paper 1 day ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

Paper • 2606.25763 • Published 3 days ago • 38

upvoted a paper 3 days ago

LUCID: Learning Unified Control for Image Deflaring and Exposure Mastery in Nighttime Photography

Paper • 2606.06901 • Published 22 days ago • 4

upvoted a paper 8 days ago

FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining

Paper • 2606.20506 • Published 9 days ago • 28

upvoted a paper about 1 month ago

ControlLight: Towards Controllable, Consistent, and Generalizable Low-Light Enhancement

Paper • 2605.25569 • Published May 25 • 21

upvoted 4 papers 3 months ago

CutClaw: Agentic Hours-Long Video Editing via Music Synchronization

Paper • 2603.29664 • Published Mar 31 • 51

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

Paper • 2603.28547 • Published Mar 30 • 32

RealRestorer: Towards Generalizable Real-World Image Restoration with Large-Scale Image Editing Models

Paper • 2603.25502 • Published Mar 26 • 58

PixelSmile: Toward Fine-Grained Facial Expression Editing

Paper • 2603.25728 • Published Mar 26 • 118

upvoted 2 papers 4 months ago

OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens

Paper • 2603.02138 • Published Mar 2 • 151

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

Paper • 2602.10604 • Published Feb 11 • 201

upvoted 5 papers 5 months ago

HY3D-Bench: Generation of 3D Assets

Paper • 2602.03907 • Published Feb 3 • 24

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 196

A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Doubao 1.8, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5

Paper • 2601.10527 • Published Jan 15 • 26

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 201

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Paper • 2601.05593 • Published Jan 9 • 87

upvoted 2 papers 6 months ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 183

VINO: A Unified Visual Generator with Interleaved OmniModal Context

Paper • 2601.02358 • Published Jan 5 • 30

upvoted 3 papers 7 months ago

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 25

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published Dec 5, 2025 • 38

Captain Safari: A World Engine

Paper • 2511.22815 • Published Nov 28, 2025 • 12