Yixiao Fang

fangyixiao

1 27 8

fangyixiao18

AI & ML interests

Multimodal Large Models

Recent Activity

authored a paper 5 days ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

upvoted a paper 5 days ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

submitted a paper 5 days ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

View all activity

Organizations

None yet

upvoted a paper 5 days ago

ShutterMuse: Capture-Time Photography Guidance with MLLMs

Paper • 2606.25763 • Published 6 days ago • 45

upvoted a paper 10 days ago

FreeStyle: Free Control of Style-Content Dual-Reference Generation from Community LoRA Mining

Paper • 2606.20506 • Published 12 days ago • 28

upvoted a paper 3 months ago

GEditBench v2: A Human-Aligned Benchmark for General Image Editing

Paper • 2603.28547 • Published Mar 30 • 32

upvoted 2 papers 7 months ago

REASONEDIT: Towards Reasoning-Enhanced Image Editing Models

Paper • 2511.22625 • Published Nov 27, 2025 • 48

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published Nov 25, 2025 • 32

upvoted an article 7 months ago

Article

Diffusers welcomes FLUX-2

YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart

•

Nov 25, 2025

• 189

upvoted a paper 7 months ago

Step-Audio-R1 Technical Report

Paper • 2511.15848 • Published Nov 19, 2025 • 60

upvoted a paper 8 months ago

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

Paper • 2510.25590 • Published Oct 29, 2025 • 28

upvoted a paper 9 months ago

WithAnyone: Towards Controllable and ID Consistent Image Generation

Paper • 2510.14975 • Published Oct 16, 2025 • 86

upvoted a paper 10 months ago

POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion

Paper • 2509.01215 • Published Sep 1, 2025 • 52

upvoted a paper 11 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 146

upvoted 2 collections 11 months ago

NextStep-1

Collection

10 items • Updated Mar 2 • 34

Step3

Collection

2 items • Updated Jul 31, 2025 • 22

upvoted a paper 11 months ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22, 2025 • 76

upvoted a paper 12 months ago

Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Paper • 2507.08441 • Published Jul 11, 2025 • 63

upvoted 5 papers about 1 year ago

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published Jun 20, 2025 • 55

OneIG-Bench: Omni-dimensional Nuanced Evaluation for Image Generation

Paper • 2506.07977 • Published Jun 9, 2025 • 40

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

Paper • 2506.03065 • Published Jun 3, 2025 • 27

KRIS-Bench: Benchmarking Next-Level Intelligent Image Editing Models

Paper • 2505.16707 • Published May 22, 2025 • 44

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24, 2025 • 92

Yixiao Fang

AI & ML interests

Recent Activity

Organizations

fangyixiao's activity

Diffusers welcomes FLUX-2