yzy666 commited on
Commit
57035af
·
verified ·
1 Parent(s): c404510

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md CHANGED
@@ -17,6 +17,31 @@ pipeline_tag: visual-question-answering
17
 
18
  This dataset card aims to provide a comprehensive overview of the StreamingChat model. For details, see our [Project](https://yzy-bupt.github.io/SVBench/), [Paper](https://arxiv.org/abs/2502.10810), [Dataset](https://huggingface.co/datasets/yzy666/SVBench) and [GitHub repository](https://github.com/yzy-bupt/SVBench).
19
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
20
  ## **Citation**
21
  If you find our data useful, please consider citing our work!
22
  ```
 
17
 
18
  This dataset card aims to provide a comprehensive overview of the StreamingChat model. For details, see our [Project](https://yzy-bupt.github.io/SVBench/), [Paper](https://arxiv.org/abs/2502.10810), [Dataset](https://huggingface.co/datasets/yzy666/SVBench) and [GitHub repository](https://github.com/yzy-bupt/SVBench).
19
 
20
+ ## **Dataset Description**
21
+ **StreamingChat** is a streaming video understanding model built upon [InternVideo2.5](https://huggingface.co/OpenGVLab/InternVideo2_5_Chat_8B). It utilizes Streaming video dialogue data, including temporal dialogue paths from the [SVBench](https://huggingface.co/datasets/yzy666/SVBench) training set. The model is fine-tuned using a static resolution strategy, enabling it to process several minutes of video at a rate of 1 FPS. Images are interleaved with language tokens, with each image comprising 16 tokens. This model aims to catalyze progress in streaming video understanding.
22
+
23
+ ## **Uses**
24
+
25
+ Download the StreamingChat model from Hugging Face:
26
+
27
+ ```bash
28
+ git clone https://huggingface.co/yzy666/StreamingChat_8B
29
+ ```
30
+
31
+ Install Python dependencies:
32
+ ```bash
33
+ conda create -n StreamingChat -y python=3.9.21
34
+ conda activate StreamingChat
35
+ conda install -y -c pytorch pytorch=2.5.1 torchvision=0.10.1
36
+ pip install transformers=4.37.2 opencv-python=4.11.0.84 imageio=2.37.0 decord=0.6.0
37
+ pip install flash-attn --no-build-isolation
38
+ ```
39
+ Run the inference script directly:
40
+ ```bash
41
+ python demo.py
42
+ ```
43
+
44
+
45
  ## **Citation**
46
  If you find our data useful, please consider citing our work!
47
  ```