Nakkwan Choi
Nakkwan
·
AI & ML interests
Computer Vision
Recent Activity
updated
a collection
2 days ago
Distillation
updated
a collection
8 days ago
Video
updated
a collection
8 days ago
VLM
Organizations
None yet
LLM
Survey
Transformer
Image Generate
Image Generate
-
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Paper • 2511.19365 • Published • 64 -
The Collapse of Patches
Paper • 2511.22281 • Published • 6 -
Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories
Paper • 2511.23342 • Published • 15 -
Glance: Accelerating Diffusion Models with 1 Sample
Paper • 2512.02899 • Published • 30
Editing
-
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation
Paper • 2511.09611 • Published • 70 -
In-Video Instructions: Visual Signals as Generative Control
Paper • 2511.19401 • Published • 32 -
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
Paper • 2512.17909 • Published • 37
Distillation
Pretrain
Diffusion
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 75 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 41
VLM
-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 237 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 129 -
Show, Don't Tell: Morphing Latent Reasoning into Image Generation
Paper • 2602.02227 • Published • 10
Video
-
Flow Map Distillation Without Data
Paper • 2511.19428 • Published • 5 -
Plan-X: Instruct Video Generation via Semantic Planning
Paper • 2511.17986 • Published • 18 -
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation
Paper • 2602.02214 • Published • 24
Platform
-
UFO^3: Weaving the Digital Agent Galaxy
Paper • 2511.11332 • Published • 19 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 217 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134
Theorem
Distillation
LLM
Pretrain
Survey
Diffusion
-
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
Paper • 2512.13586 • Published • 93 -
TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows
Paper • 2512.05150 • Published • 75 -
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times
Paper • 2512.16093 • Published • 95 -
PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss
Paper • 2602.02493 • Published • 41
Transformer
VLM
-
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer
Paper • 2511.22699 • Published • 237 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 129 -
Show, Don't Tell: Morphing Latent Reasoning into Image Generation
Paper • 2602.02227 • Published • 10
Image Generate
Image Generate
-
DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation
Paper • 2511.19365 • Published • 64 -
The Collapse of Patches
Paper • 2511.22281 • Published • 6 -
Flow Straighter and Faster: Efficient One-Step Generative Modeling via MeanFlow on Rectified Trajectories
Paper • 2511.23342 • Published • 15 -
Glance: Accelerating Diffusion Models with 1 Sample
Paper • 2512.02899 • Published • 30
Video
-
Flow Map Distillation Without Data
Paper • 2511.19428 • Published • 5 -
Plan-X: Instruct Video Generation via Semantic Planning
Paper • 2511.17986 • Published • 18 -
Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation
Paper • 2602.02214 • Published • 24
Editing
-
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation
Paper • 2511.09611 • Published • 70 -
In-Video Instructions: Visual Signals as Generative Control
Paper • 2511.19401 • Published • 32 -
Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing
Paper • 2512.17909 • Published • 37
Platform
-
UFO^3: Weaving the Digital Agent Galaxy
Paper • 2511.11332 • Published • 19 -
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper • 2512.16676 • Published • 217 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134