Vision-CAIR/Tempo-6B
Video-Text-to-Text • Updated • 125 • 1
Official Tempo-6B collection: A query-aware framework solving the mismatch between massive video streams and bounded LLM context windows.
Smart Compressors for Long Video Understanding