1 46 7

Zhixiong Zhang (SII)

rookiexiong

rookiexiong7

AI & ML interests

SJTU & SII Ph.D. Student, SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

upvoted a paper 2 days ago

AdaCodec: A Predictive Visual Code for Video MLLMs

authored a paper 10 days ago

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

upvoted a paper 10 days ago

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

View all activity

Organizations

upvoted a paper 2 days ago

AdaCodec: A Predictive Visual Code for Video MLLMs

Paper • 2606.02569 • Published 7 days ago • 4

upvoted a paper 10 days ago

LoMo: Local Modality Substitution for Deeper Vision-Language Fusion

Paper • 2605.30265 • Published 11 days ago • 23

upvoted a paper 11 days ago

SetCon: Towards Open-Ended Referring Segmentation via Set-Level Concept Prediction

Paper • 2605.20110 • Published 20 days ago • 3

upvoted a paper 14 days ago

ETCHR: Editing To Clarify and Harness Reasoning

Paper • 2605.23897 • Published 17 days ago • 13

upvoted a paper 24 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published 28 days ago • 46

upvoted a paper about 2 months ago

The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping

Paper • 2604.11297 • Published Apr 13 • 144

upvoted 4 papers 4 months ago

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

Paper • 2602.12205 • Published Feb 12 • 83

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 159

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

Paper • 2602.02437 • Published Feb 2 • 80

G^2RPO: Granular GRPO for Precise Reward in Flow Models

Paper • 2510.01982 • Published Oct 2, 2025 • 8

upvoted 4 papers 6 months ago

AdaTooler-V: Adaptive Tool-Use for Images and Videos

Paper • 2512.16918 • Published Dec 18, 2025 • 14

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Paper • 2512.16915 • Published Dec 18, 2025 • 38

Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning

Paper • 2512.15693 • Published Dec 17, 2025 • 18

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

Paper • 2512.11799 • Published Dec 12, 2025 • 30

upvoted 3 papers 7 months ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published Nov 4, 2025 • 60

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

Paper • 2511.01295 • Published Nov 3, 2025 • 39

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published Oct 28, 2025 • 19

upvoted 2 papers 8 months ago

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 19

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26, 2025 • 37

upvoted a paper 9 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24, 2025 • 43

Zhixiong Zhang (SII)

AI & ML interests

Recent Activity

Organizations

rookiexiong's activity