Update README.md
Browse files
README.md
CHANGED
|
@@ -17,6 +17,31 @@ pipeline_tag: visual-question-answering
|
|
| 17 |
|
| 18 |
This dataset card aims to provide a comprehensive overview of the StreamingChat model. For details, see our [Project](https://yzy-bupt.github.io/SVBench/), [Paper](https://arxiv.org/abs/2502.10810), [Dataset](https://huggingface.co/datasets/yzy666/SVBench) and [GitHub repository](https://github.com/yzy-bupt/SVBench).
|
| 19 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 20 |
## **Citation**
|
| 21 |
If you find our data useful, please consider citing our work!
|
| 22 |
```
|
|
|
|
| 17 |
|
| 18 |
This dataset card aims to provide a comprehensive overview of the StreamingChat model. For details, see our [Project](https://yzy-bupt.github.io/SVBench/), [Paper](https://arxiv.org/abs/2502.10810), [Dataset](https://huggingface.co/datasets/yzy666/SVBench) and [GitHub repository](https://github.com/yzy-bupt/SVBench).
|
| 19 |
|
| 20 |
+
## **Dataset Description**
|
| 21 |
+
**StreamingChat** is a streaming video understanding model built upon [InternVideo2.5](https://huggingface.co/OpenGVLab/InternVideo2_5_Chat_8B). It utilizes Streaming video dialogue data, including temporal dialogue paths from the [SVBench](https://huggingface.co/datasets/yzy666/SVBench) training set. The model is fine-tuned using a static resolution strategy, enabling it to process several minutes of video at a rate of 1 FPS. Images are interleaved with language tokens, with each image comprising 16 tokens. This model aims to catalyze progress in streaming video understanding.
|
| 22 |
+
|
| 23 |
+
## **Uses**
|
| 24 |
+
|
| 25 |
+
Download the StreamingChat model from Hugging Face:
|
| 26 |
+
|
| 27 |
+
```bash
|
| 28 |
+
git clone https://huggingface.co/yzy666/StreamingChat_8B
|
| 29 |
+
```
|
| 30 |
+
|
| 31 |
+
Install Python dependencies:
|
| 32 |
+
```bash
|
| 33 |
+
conda create -n StreamingChat -y python=3.9.21
|
| 34 |
+
conda activate StreamingChat
|
| 35 |
+
conda install -y -c pytorch pytorch=2.5.1 torchvision=0.10.1
|
| 36 |
+
pip install transformers=4.37.2 opencv-python=4.11.0.84 imageio=2.37.0 decord=0.6.0
|
| 37 |
+
pip install flash-attn --no-build-isolation
|
| 38 |
+
```
|
| 39 |
+
Run the inference script directly:
|
| 40 |
+
```bash
|
| 41 |
+
python demo.py
|
| 42 |
+
```
|
| 43 |
+
|
| 44 |
+
|
| 45 |
## **Citation**
|
| 46 |
If you find our data useful, please consider citing our work!
|
| 47 |
```
|