5 14 16

Zijie Xin

xxayt

https://xxayt.github.io/

xxayt

AI & ML interests

multi-modal learning, AIGC

Recent Activity

upvoted a paper 12 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

liked a dataset 12 days ago

MiG-NJU/OmniVideo-100K

upvoted a collection 13 days ago

[ICML2026]Video-opd

View all activity

Organizations

upvoted a paper 12 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Paper • 2606.14702 • Published 16 days ago • 31

upvoted a collection 13 days ago

[ICML2026]Video-opd

Collection

3 items • Updated 16 days ago • 1

upvoted a collection 27 days ago

SEATS

Collection

5 items • Updated 21 days ago • 1

upvoted 2 papers about 1 month ago

OmniPro: A Comprehensive Benchmark for Omni-Proactive Streaming Video Understanding

Paper • 2605.18577 • Published May 18 • 5

Stage-adaptive Token Selection for Efficient Omni-modal LLMs

Paper • 2605.20035 • Published May 19 • 5

upvoted 2 papers 3 months ago

MMOU: A Massive Multi-Task Omni Understanding and Reasoning Benchmark for Long and Complex Real-World Videos

Paper • 2603.14145 • Published Mar 14 • 15

SAVE: Speech-Aware Video Representation Learning for Video-Text Retrieval

Paper • 2603.08224 • Published Mar 9 • 1

upvoted a paper 6 months ago

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models

Paper • 2511.14582 • Published Nov 18, 2025 • 19

upvoted a collection 7 months ago

Qwen3-Omni

Collection

6 items • Updated Dec 31, 2025 • 204

upvoted a paper 9 months ago

Multi-Object Sketch Animation by Scene Decomposition and Motion Planning

Paper • 2503.19351 • Published Mar 25, 2025 • 1

upvoted a collection 10 months ago

[ICCV2025]MGSV

Collection

[ICCV 2025] Music Grounding by Short Video • 4 items • Updated 13 days ago • 1

upvoted 3 papers 11 months ago

Zijie Xin

AI & ML interests

Recent Activity

Organizations

xxayt's activity