--- title: Video Representations emoji: 🌖 colorFrom: indigo colorTo: blue sdk: gradio sdk_version: 6.4.0 python_version: '3.12' app_file: app.py pinned: false --- This is a demo of the VideoMAE model to visualize the attention map, latent space, and reconstruction of a video. Choose one of the following modes to visualize the video: - Reconstruction: Reconstruct the video by masking 90% of the patches and reconstructing the masked patches. - Attention: Visualize the average attention map of the last layer. - Latent: Visualize the PCA components of the latent space of the video. You can choose the model and load the example video or upload your own video to visualize the video.