erow's picture
add app
dc22224

A newer version of the Gradio SDK is available: 6.11.0

Upgrade
metadata
title: Video Representations
emoji: 🌖
colorFrom: indigo
colorTo: blue
sdk: gradio
sdk_version: 6.4.0
python_version: '3.12'
app_file: app.py
pinned: false

This is a demo of the VideoMAE model to visualize the attention map, latent space, and reconstruction of a video.

Choose one of the following modes to visualize the video:

  • Reconstruction: Reconstruct the video by masking 90% of the patches and reconstructing the masked patches.
  • Attention: Visualize the average attention map of the last layer.
  • Latent: Visualize the PCA components of the latent space of the video.

You can choose the model and load the example video or upload your own video to visualize the video.