Yansong Shi

nanamma

10 4 3

https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

upvoted a paper 20 days ago

VideoChat3: Fully Open Video MLLM for Efficient and Generalist Video Understanding

upvoted a paper 5 months ago

Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning

new activity 5 months ago

nanamma/RIVER:Add task categories and link to paper

View all activity

Organizations

upvoted a paper 20 days ago

VideoChat3: Fully Open Video MLLM for Efficient and Generalist Video Understanding

Paper • 2607.14935 • Published 22 days ago • 171

upvoted a paper 5 months ago

Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning

Paper • 2601.23224 • Published Jan 30 • 6

New activity in nanamma/RIVER 5 months ago

Add task categories and link to paper

#2 opened 5 months ago by

nielsr

updated a dataset 5 months ago

OpenGVLab/RIVER

Updated Mar 6 • 69

published a dataset 5 months ago

OpenGVLab/RIVER

Updated Mar 6 • 69

authored a paper 5 months ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published Mar 4 • 7

submitted a paper to Daily Papers 5 months ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published Mar 4 • 7

upvoted a paper 5 months ago

RIVER: A Real-Time Interaction Benchmark for Video LLMs

Paper • 2603.03985 • Published Mar 4 • 7

updated a dataset 5 months ago

nanamma/RIVER

Updated Mar 6 • 200

upvoted a paper 8 months ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published Dec 1, 2025 • 21

authored 2 papers 11 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 29

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 2

New activity in qiukingballball/RoboCerebra 12 months ago

how to test

#4 opened 12 months ago by

nanamma

published a dataset about 1 year ago

nanamma/RIVER

Updated Mar 6 • 200

liked a dataset over 1 year ago

Mutonix/Vript

Viewer • Updated Jun 11, 2024 • 409k • 13.4k • 27

New activity in Enxin/MovieChat-1K_train almost 2 years ago

so many quote '"' in captions in json files

#2 opened almost 2 years ago by

nanamma

updated a collection almost 2 years ago

VideoChat

Collection

Chat-Centric Video Understanding • 8 items • Updated Sep 28, 2025 • 3

updated 2 models almost 2 years ago

OpenGVLab/ViCLIP-L-14-hf

0.4B • Updated Sep 17, 2024 • 1.63k • 1

OpenGVLab/ViCLIP-B-16-hf

0.1B • Updated Sep 17, 2024 • 1.3k • 1

updated a collection almost 2 years ago

InternVid

Collection

A Large-Scale Video-Text Dataset • 7 items • Updated Sep 28, 2025

Yansong Shi

AI & ML interests

Recent Activity

Organizations

nanamma's activity

Add task categories and link to paper

how to test

so many quote '"' in captions in json files