Video Foundation Models - a dtong Collection

dtong 's Collections

Video Foundation Models

Video Foundation Models

updated 1 day ago

Temporal-Visual Semantic Alignment: A Unified Architecture for Transferring Spatial Priors from Vision Models to Zero-Shot Temporal Tasks

Paper • 2511.19856 • Published Nov 25, 2025
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published Dec 1, 2025 • 16