Running
README
⚡
Computer Vision
RIVER: A Real-Time Interaction Benchmark for Video LLMs
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
Display web content in a Streamlit app
Hierarchical Compression for Long-Context Video Modeling
Chat with an AI that understands images and text
Submit model evaluations and view the leaderboard
Upload a video to chat about its contents
Display maintenance message
Identify actions and objects in videos and images
View a remote web page within the app