Sports Video Understanding Benchmarks
AI & ML interests
Computer Vision; Video Understanding; Action Recognition
Recent Activity
View all activity
Papers
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
SAM 2++: Tracking Anything at Any Granularity
-
MCG-NJU/LongVPO-Stage1-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 6 -
MCG-NJU/LongVPO-Stage2-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 7 -
MCG-NJU/LongVPO-Training-Data
Viewer • Updated • 14.5k • 40 -
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Paper • 2602.02341 • Published • 1
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 519 • 69 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 307 • 19 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 545 • 25
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 4 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 268k • 55 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 39.8k • 50 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 2.25k • 7
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).
Sports Video Understanding Benchmarks
VideoMAE Pre-trained Models
-
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
Paper • 2203.12602 • Published • 4 -
MCG-NJU/videomae-base
Video Classification • 94.2M • Updated • 268k • 55 -
MCG-NJU/videomae-base-finetuned-kinetics
Video Classification • 86.5M • Updated • 39.8k • 50 -
MCG-NJU/videomae-base-finetuned-ssv2
Video Classification • Updated • 2.25k • 7
-
MCG-NJU/LongVPO-Stage1-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 6 -
MCG-NJU/LongVPO-Stage2-InternVL3-8B
Video-Text-to-Text • 8B • Updated • 7 -
MCG-NJU/LongVPO-Training-Data
Viewer • Updated • 14.5k • 40 -
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Paper • 2602.02341 • Published • 1
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
-
MCG-NJU/SteadyDancer-14B
Image-to-Video • Updated • 519 • 69 -
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
Paper • 2511.19320 • Published • 43 -
MCG-NJU/X-Dance
Viewer • Updated • 36 • 307 • 19 -
MCG-NJU/SteadyDancer-GGUF
Image-to-Video • 16B • Updated • 545 • 25
Learning Human Skill Generators at Key-Step Levels
CaReBench data, CaRe models and all the contrastively trained MLLMs (including InternVL2, MiniCPM-V 2.6, LLaVA NeXT Video, Qwen2-VL and Tariser).