caohaoyu's picture

7 8

caohaoyu

rechy

·

AI & ML interests

None yet

Recent Activity

authored a paper about 5 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

authored a paper about 5 hours ago

When Thinking Hurts: Mitigating Visual Forgetting in Video Reasoning via Frame Repetition

authored a paper about 5 hours ago

Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion

View all activity

Organizations

None yet

upvoted a paper about 14 hours ago

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

Paper • 2604.05015 • Published 3 days ago • 192

upvoted 2 papers 2 months ago

Youtu-Parsing: Perception, Structuring and Recognition via High-Parallelism Decoding

Paper • 2601.20430 • Published Jan 28 • 16

Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision

Paper • 2601.19798 • Published Jan 27 • 43

upvoted a collection 2 months ago

Youtu

13 items • Updated 29 days ago • 25

upvoted a paper 3 months ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

Paper • 2512.24618 • Published Dec 31, 2025 • 154

upvoted a paper 5 months ago

VITA-E: Natural Embodied Interaction with Concurrent Seeing, Hearing, Speaking, and Acting

Paper • 2510.21817 • Published Oct 21, 2025 • 41

upvoted a paper almost 2 years ago

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Paper • 2404.12803 • Published Apr 19, 2024 • 30