Elysium: Exploring Object-level Perception in Videos via MLLM Paper • 2403.16558 • Published Mar 25, 2024
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM Paper • 2412.09530 • Published Dec 12, 2024