A newer version of the Gradio SDK is available:
6.2.0
Preparing Video Retrieval Datasets
Introduction
@inproceedings{xu2016msr,
title={Msr-vtt: A large video description dataset for bridging video and language},
author={Xu, Jun and Mei, Tao and Yao, Ting and Rui, Yong},
booktitle={CVPR},
pages={5288--5296},
year={2016}
}
@inproceedings{chen2011collecting,
title={Collecting highly parallel data for paraphrase evaluation},
author={Chen, David and Dolan, William B},
booktitle={ACL},
pages={190--200},
year={2011}
}
Before we start, please make sure that the directory is located at $MMACTION2/tools/data/video_retrieval/.
Preparing MSRVTT dataset
For basic dataset information, you can refer to the MSRVTT dataset website. Run the following command to prepare the MSRVTT dataset:
bash prepare_msrvtt.sh
After preparation, the folder structure will look like:
mmaction2
βββ mmaction
βββ tools
βββ configs
βββ data
β βββ video_retrieval
β β βββ msrvtt
β β βββ train_9k.json
β β βββ train_7k.json
β β βββ test_JSFUSION.json
β β ββββ videos
β β βββ video0.mp4
β β βββ video1.mp4
β β βββ ...
β β βββ video9999.mp4
Preparing MSVD dataset
For basic dataset information, you can refer to the MSVD dataset website. Run the following command to prepare the MSVD dataset:
bash prepare_msvd.sh
After preparation, the folder structure will look like:
mmaction2
βββ mmaction
βββ tools
βββ configs
βββ data
β βββ video_retrieval
β β βββ msrvd
β β βββ train.json
β β βββ test.json
β β βββ val.json
β β ββββ videos
β β βββ xxx.avi
β β βββ xxx.avi
β β βββ ...
β β βββ xxx.avi