Zhihang Liu

lntzm

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 5 hours ago

UniClawBench: A Universal Benchmark for Proactive Agents on Real-World Tasks

upvoted a paper about 2 months ago

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

upvoted a paper about 2 months ago

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

UniClawBench: A Universal Benchmark for Proactive Agents on Real-World Tasks

Paper • 2607.08768 • Published 1 day ago • 20

upvoted 2 papers about 2 months ago

MSAVBench: Towards Comprehensive and Reliable Evaluation of Multi-Shot Audio-Video Generation

Paper • 2605.20183 • Published May 19 • 14

DiffusionOPD: A Unified Perspective of On-Policy Distillation in Diffusion Models

Paper • 2605.15055 • Published May 14 • 19

upvoted a paper 3 months ago

AIBench: Evaluating Visual-Logical Consistency in Academic Illustration Generation

Paper • 2603.28068 • Published Mar 31 • 13

upvoted 3 papers 4 months ago

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data

Paper • 2603.25319 • Published Mar 26 • 32

DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning

Paper • 2603.12257 • Published Mar 12 • 31

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Paper • 2603.06569 • Published Mar 6 • 120

upvoted a paper 7 months ago

ShowTable: Unlocking Creative Table Visualization with Collaborative Reflection and Refinement

Paper • 2512.13303 • Published Dec 15, 2025 • 17

upvoted a paper 8 months ago

Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance

Paper • 2510.24711 • Published Oct 28, 2025 • 20

upvoted a paper 12 months ago

Test-Time Scaling with Reflective Generative Model

Paper • 2507.01951 • Published Jul 2, 2025 • 108

upvoted 3 papers over 1 year ago

DreamRelation: Relation-Centric Video Customization

Paper • 2503.07602 • Published Mar 10, 2025 • 14

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22, 2025 • 91

Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition

Paper • 2412.09501 • Published Dec 12, 2024 • 48