niobures's picture
mmaction2
d3dbf03 verified
# Preparing Kinetics-710
## Introduction
<!-- [DATASET] -->
```BibTeX
@misc{li2022uniformerv2,
title={UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer},
author={Kunchang Li and Yali Wang and Yinan He and Yizhuo Li and Yi Wang and Limin Wang and Yu Qiao},
year={2022},
eprint={2211.09552},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
```
For basic dataset information, please refer to the [paper](https://arxiv.org/pdf/2211.09552.pdf). The scripts can be used for preparing kinetics-710. MMAction2 supports Kinetics-710
dataset as a concat dataset, which means only provides a list of annotation files, and makes use of the original data of Kinetics-400/600/700 dataset. You could refer to the [config](/configs/recognition/uniformerv2/uniformerv2-base-p16-res224_clip_u8_kinetics710-rgb.py)
for details, which also provides a template config about how to use concat dataset in MMAction2.
Before we start, please make sure that the directory is located at `$MMACTION2`.
## Step 1. Download Kinetics 400/600/700
Kinetics-710 is a video benchmark based on Kinetics-400/600/700, which merges the training set of these Kinetics datasets, and deletes the repeated videos according to Youtube IDs. MMAction2 provides an annotation file based on the Kinetics-400/600/700 on [OpenDataLab](https://opendatalab.com/). So we suggest you download Kinetics-400/600/700 first from OpenDataLab by [MIM](https://github.com/open-mmlab/mim).
```shell
# install OpenXlab CLI tools
pip install -U openxlab
# log in OpenXLab
openxlab login
# download Kinetics-400/600/700, note that this might take a long time.
mim download mmaction2 --dataset kinetics400
mim download mmaction2 --dataset kinetics600
mim download mmaction2 --dataset kinetics700
```
## Step 2. Download Kinetics-710 Annotations
We provide the annotation list of Kinetics-710 corresponding to OpenDataLab version Kinetics, you could download it from aliyun and unzip it to the `$MMACTION2/data/`
```shell
wget -P data https://download.openmmlab.com/mmaction/dataset/kinetics710/annotations.zip
cd data && unzip annotations.zip && cd ..
```
## Step 3. Folder Structure
After the whole data pipeline for Kinetics preparation.
you can get the videos and annotation files for Kinetics-710.
In the context of the whole project (for Kinetics only), the *minimal* folder structure will look like:
(*minimal* means that some data are not necessary: for example, you may want to evaluate kinetics using the original video format.)
```
mmaction2
β”œβ”€β”€ mmaction
β”œβ”€β”€ tools
β”œβ”€β”€ configs
β”œβ”€β”€ data
β”‚ β”œβ”€β”€ kinetics400
β”‚ β”‚ β”œβ”€β”€ videos_train
β”‚ β”‚ β”œβ”€β”€ videos_val
β”‚ β”‚ β”‚ β”œβ”€β”€ jf7RDuUTrsQ.mp4
β”‚ β”‚ β”‚ β”œβ”€β”€ ...
β”‚ β”œβ”€β”€ kinetics600
β”‚ β”‚ β”œβ”€β”€ videos
β”‚ β”‚ β”‚ β”œβ”€β”€ vol_00
β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ -A5JFdMXB_k_000018_000028.mp4
β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ ...
β”‚ β”‚ β”‚ β”œβ”€β”€ ...
β”‚ β”‚ β”‚ β”œβ”€β”€ vol63
β”‚ β”œβ”€β”€ kinetics700
β”‚ β”‚ β”œβ”€β”€ videos
β”‚ β”‚ β”‚ β”œβ”€β”€ vol_00
β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ -Paa0R0tQ1w_000009_000019.mp4
β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ ...
β”‚ β”‚ β”‚ β”œβ”€β”€ ...
β”‚ β”‚ β”‚ β”œβ”€β”€ vol63
β”‚ β”œβ”€β”€ kinetics710
β”‚ β”‚ β”œβ”€β”€ k400_train_list_videos.txt
β”‚ β”‚ β”œβ”€β”€ k400_val_list_videos.txt
β”‚ β”‚ β”œβ”€β”€ k600_train_list_videos.txt
β”‚ β”‚ β”œβ”€β”€ k600_val_list_videos.txt
β”‚ β”‚ β”œβ”€β”€ k700_train_list_videos.txt
β”‚ β”‚ β”œβ”€β”€ k700_val_list_videos.txt
```
For training and evaluating on Kinetics, please refer to [Training and Test Tutorial](/docs/en/user_guides/train_test.md).