ElevenZ

shiyi0408

·

AI & ML interests

None yet

Organizations

None yet

upvoted 3 papers 2 months ago

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows

Paper • 2604.28139 • Published Apr 30 • 42

Refinement via Regeneration: Enlarging Modification Space Boosts Image Refinement in Unified Multimodal Models

Paper • 2604.25636 • Published Apr 28 • 24

Meta-CoT: Enhancing Granularity and Generalization in Image Editing

Paper • 2604.24625 • Published Apr 27 • 26

upvoted a paper 4 months ago

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

Paper • 2603.04291 • Published Mar 4 • 15

upvoted a paper 7 months ago

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28, 2025 • 26

upvoted a paper 8 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 197

upvoted a collection 8 months ago

Video-As-Prompt

The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation" • 3 items • Updated Oct 27, 2025 • 14

upvoted a paper 8 months ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 50

upvoted 2 papers 9 months ago

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Paper • 2510.12747 • Published Oct 14, 2025 • 40

GIR-Bench: Versatile Benchmark for Generating Images with Reasoning

Paper • 2510.11026 • Published Oct 13, 2025 • 18

upvoted a paper 11 months ago

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14, 2025 • 54

upvoted 3 papers about 1 year ago

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Paper • 2505.05422 • Published May 8, 2025 • 9

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Paper • 2505.03730 • Published May 6, 2025 • 28

Cobra: Efficient Line Art COlorization with BRoAder References

Paper • 2504.12240 • Published Apr 16, 2025 • 27

upvoted 6 papers over 1 year ago

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published Mar 26, 2025 • 14

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Paper • 2503.13434 • Published Mar 17, 2025 • 28

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published Mar 7, 2025 • 29

KV-Edit: Training-Free Image Editing for Precise Background Preservation

Paper • 2502.17363 • Published Feb 24, 2025 • 37

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published Dec 13, 2024 • 37

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Paper • 2412.11815 • Published Dec 16, 2024 • 27