| # Dataset Preparation |
|
|
| ## Training data |
|
|
| - annotation: We place the annotation file in [dataset/trainval/train_2k5.json](../dataset/trainval/train_2k5.json). |
|
|
| - original videos: You can download download from our organized version in [Boshenxx/TimeR1-Dataset](https://huggingface.co/datasets/Boshenxx/TimeR1-Dataset); |
|
|
| or you can download and organize the original data from [VTG-IT](https://huggingface.co/datasets/Yongxin-Guo/VTG-IT), [TimeIT](https://huggingface.co/datasets/ShuhuaiRen/TimeIT), [TimePro](https://huggingface.co/Lanxingxuan/TimeSuite), [HTStep](https://openreview.net/pdf?id=vv3cocNsEK) and [LongVid](https://huggingface.co/datasets/OpenGVLab/LongVid). |
|
|
|
|
| ## Testing data |
|
|
|
|
| folder structure: |
| ``` |
| dataset |
| ββ timer1 |
| β ββ annotations |
| β β ββ train_2k5.json |
| β β ββ tvgbench.json |
| β ββ videos |
| β β ββ timerft_data |
| β β | ββ xxx.mp4 |
| β β β ββ ... |
| β β ββ tvgbench_data |
| β β | ββ xxx.mp4 |
| β β β ββ ... |
| ββ activitynet |
| β ββ annotations |
| β β ββ sentence_temporal_grounding |
| β β β ββ test.json |
| β ββ videos |
| β | ββ v_zzz_3yWpTXo.mp4 |
| β β ββ ... |
| ββ charades |
| β ββ Charades_anno |
| β β ββ Charades_v1_test.csv |
| β ββ Charades_v1 |
| β | ββ 0I0FX.mp4 |
| β β ββ ... |
| ββ mvbench |
| β ββ json |
| β β ββ action_antonym.json |
| β β ββ ... |
| β ββ videos |
| β β ββ clevrer |
| β β ββ ... |
| ββ tempcompass |
| β ββ questions |
| β β ββ multi-choice.json |
| β β ββ ... |
| β ββ videos |
| β β ββ 315784.mp4 |
| β β ββ ... |
| ββ egoschema |
| β ββ MC |
| β β ββ test-00000-of-00001.parquet |
| β ββ Subset |
| β β ββ test-00000-of-00001.parquet |
| β ββ videos |
| β β ββ 001934bb-81bd-4cd8-a574-0472ef3f6678.mp4 |
| β β ββ ... |
| ββ videomme |
| β ββ videomme |
| β β ββ test-00000-of-00001.parquet |
| β ββ data |
| β β ββ _8lBR0E_Tx8.mp4 |
| ββ ββ ββ ... |
| ``` |
|
|
| ### ActivityNet |
| Download link: [ActivityNet](https://cs.stanford.edu/people/ranjaykrishna/densevid/) |
|
|
| For fine-tuning setting, before training, you need to preprocess the video data. |
|
|
| ```bash |
| bash preprocess_video.sh |
| ``` |
| Specify the path to the Charades-STA dataset (video files, annotations, etc.). |
|
|
|
|
| ### Charades |
| Download link: [Charades-v1](https://huggingface.co/datasets/HuggingFaceM4/charades) |
|
|
| For fine-tuning setting, before training, you need to preprocess the video data. |
|
|
| ```bash |
| bash preprocess_video.sh |
| ``` |
| Specify the path to the Charades-STA dataset (video files, annotations, etc.). |
|
|
|
|
| ### TVGBench |
| Download link: [hf: Boshenxx/TimeR1-Dataset](https://huggingface.co/datasets/Boshenxx/TimeR1-Dataset) |
|
|
|
|
| ### MVBench |
|
|
| We place the annotation file of tvgbench in [dataset/trainval/tvgbench.json](../dataset/trainval/tvgbench.json). |
|
|
| Download link: [hf: OpenGVLab/MVBench](https://huggingface.co/datasets/OpenGVLab/MVBench) |
|
|
|
|
| ### VideoMME |
| Download link: [hf: lmms-lab/MVBench](https://huggingface.co/datasets/lmms-lab/Video-MME) |
|
|
|
|
| ### Egoschema |
| Download link: [hf: lmms-lab/egoschema](https://huggingface.co/datasets/lmms-lab/egoschema) |
|
|
|
|
| ### TempCompass |
|
|
| Download link: [hf: lmms-lab/TempCompass](https://huggingface.co/datasets/lmms-lab/TempCompass) |
|
|
|
|