StreamFormer
/

streamformer-timesformer

Video Classification

Model card Files Files and versions

StreamFormer commited on Aug 10, 2025

Commit

6e2bb37

·

verified ·

1 Parent(s): 6972901

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -13,6 +13,15 @@ StreamFormer backbone model pre-trained on *Global*-, *Temporal*- and *Spatial*-
 StreamFormer is a streaming video representation backbone that encodes a stream of video input. It is designed for multiple downstream applications like Online Action Detection, Online Video Instance Segmentation and Video Question Answering.
 ### How to use
 How to get the multi-granularity feature:

 StreamFormer is a streaming video representation backbone that encodes a stream of video input. It is designed for multiple downstream applications like Online Action Detection, Online Video Instance Segmentation and Video Question Answering.
+### Installation
+```bash
+conda create -n streamformer python=3.10
+conda activate streamformer
+conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.4 -c pytorch -c nvidia
+pip install -r requirements.txt
+```
 ### How to use
 How to get the multi-granularity feature: