Shawn Leo's picture

Shawn Leo

lxysl

·

https://lxysl.github.io/

lxysl

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

upvoted a paper about 2 months ago

UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification

upvoted a paper 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

View all activity

Organizations

upvoted a paper 13 days ago

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Paper • 2606.14702 • Published 16 days ago • 31

upvoted a paper about 2 months ago

UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification

Paper • 2605.06221 • Published May 7 • 22

upvoted a paper 2 months ago

PersonaVLM: Long-Term Personalized Multimodal LLMs

Paper • 2604.13074 • Published Mar 20 • 46

upvoted 2 papers 3 months ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published Apr 6 • 237

VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding

Paper • 2603.22285 • Published Mar 23 • 49

upvoted a paper 4 months ago

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

Paper • 2603.06577 • Published Mar 6 • 50

upvoted a paper 5 months ago

UI-Venus-1.5 Technical Report

Paper • 2602.09082 • Published Feb 9 • 158

upvoted a paper 8 months ago

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21, 2025 • 41

upvoted a paper over 1 year ago

VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Paper • 2501.01957 • Published Jan 3, 2025 • 48