Update README.md
Browse files
README.md
CHANGED
|
@@ -32,8 +32,8 @@ We use CLIP ViT-B/16 for all experiments below.
|
|
| 32 |
|
| 33 |
| Scripts | HMDB-51 | UCF-101 | Kinetics-600 | Ckpt |
|
| 34 |
|-------------------------------------------------------------------------|:------------:|:------------:|:--------------:|:--------------------------------------------------------------------------------:|
|
| 35 |
-
| [TC-CLIP](https://github.com/naver-ai/tc-clip/scripts/train/zero_shot/train_tc_clip_zero_shot.sh) | 54.2 ± 0.7 | 82.9 ± 0.6 | 75.8 ± 0.5 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/zero_shot_k400) |
|
| 36 |
-
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/scripts/train/zero_shot/train_tc_clip_zero_shot_llm.sh) | 56.0 ± 0.3 | 85.4 ± 0.8 | 78.1 ± 1.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/zero_shot_k400_llm) |
|
| 37 |
|
| 38 |
|
| 39 |
#### Few-shot action recognition
|
|
@@ -41,25 +41,25 @@ We use CLIP ViT-B/16 for all experiments below.
|
|
| 41 |
| Scripts | HMDB-51 | UCF-101 | SSv2 | Ckpt |
|
| 42 |
|----------------------------------------------------------------------------|:-------------------------:|:-------------------------:|:------------------------:|:-------------:|
|
| 43 |
| | K=2 / K=4 / K=8 / K=16 | K=2 / K=4 / K=8 / K=16 | K=2 / K=4 / K=8 / K=16 | |
|
| 44 |
-
| [TC-CLIP](https://github.com/naver-ai/tc-clip/scripts/train/few_shot/train_tc_clip_few_shot.sh) | 57.3 / 62.3 / 67.3 / 68.6 | 85.9 / 89.9 / 92.5 / 94.6 | 7.3 / 8.6 / 9.3 / 14.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot) |
|
| 45 |
-
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/scripts/train/few_shot/train_tc_clip_few_shot_llm.sh) | 58.6 / 63.3 / 65.5 / 68.8 | 86.8 / 90.1 / 92.0 / 94.3 | 7.3 / 8.6 / 9.3 / 14.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot_llm) |
|
| 46 |
-
| [TC-CLIP (P)](https://github.com/naver-ai/tc-clip/scripts/train/few_shot/train_tc_clip_few_shot_pretrained.sh) | 65.3 / 68.5 / 71.4 / 73.0 | 94.1 / 95.6 / 96.6 / 97.3 | 8.7 / 10.1 / 12.1 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot_pretrained) |
|
| 47 |
|
| 48 |
#### Base-to-novel generalization
|
| 49 |
|
| 50 |
| Scripts | K-400 | HMDB-51 | UCF-101 | SSv2 | Ckpt |
|
| 51 |
|----------------------------------------------------------------------------|:------------------:|:------------------:|:------------------:|:------------------:|:--------------------------------------------------------------------------------:|
|
| 52 |
| | Base / Novel / HM | Base / Novel / HM | Base / Novel / HM | Base / Novel / HM | |
|
| 53 |
-
| [TC-CLIP](https://github.com/naver-ai/tc-clip/scripts/train/base2novel/train_tc_clip_base2novel.sh) | 78.9 / 63.6 / 70.4 | 73.3 / 54.1 / 62.2 | 95.5 / 78.0 / 85.9 | 17.5 / 13.4 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel) |
|
| 54 |
-
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/scripts/train/base2novel/train_tc_clip_base2novel_llm.sh) | 79.1 / 65.4 / 71.6 | 73.3 / 59.1 / 65.5 | 95.4 / 81.6 / 88.0 | 17.5 / 13.4 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel_llm) |
|
| 55 |
-
| [TC-CLIP (P)](https://github.com/naver-ai/tc-clip/scripts/train/base2novel/train_tc_clip_base2novel_pretrained.sh) | N/A | 79.4 / 58.3 / 67.2 | 97.5 / 84.5 / 90.5 | 19.6 / 15.6 / 17.4 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel_pretrained) |
|
| 56 |
|
| 57 |
|
| 58 |
#### Fully-supervised action recognition
|
| 59 |
|
| 60 |
| Scripts | K-400 (Top-1) | K-400 (Top-5) | Ckpt |
|
| 61 |
|-----------------------------------------------------------------------------|:-------------:|:-------------:|:--------------------------------------------------------------------------------:|
|
| 62 |
-
| [TC-CLIP](https://github.com/naver-ai/tc-clip/scripts/train/fully_supervised/train_tc_clip_fully_supervised.sh) | 85.2 | 96.9 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/fully_supervised_k400) |
|
| 63 |
|
| 64 |
|
| 65 |
## Citation
|
|
|
|
| 32 |
|
| 33 |
| Scripts | HMDB-51 | UCF-101 | Kinetics-600 | Ckpt |
|
| 34 |
|-------------------------------------------------------------------------|:------------:|:------------:|:--------------:|:--------------------------------------------------------------------------------:|
|
| 35 |
+
| [TC-CLIP](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/zero_shot/train_tc_clip_zero_shot.sh) | 54.2 ± 0.7 | 82.9 ± 0.6 | 75.8 ± 0.5 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/zero_shot_k400) |
|
| 36 |
+
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/zero_shot/train_tc_clip_zero_shot_llm.sh) | 56.0 ± 0.3 | 85.4 ± 0.8 | 78.1 ± 1.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/zero_shot_k400_llm) |
|
| 37 |
|
| 38 |
|
| 39 |
#### Few-shot action recognition
|
|
|
|
| 41 |
| Scripts | HMDB-51 | UCF-101 | SSv2 | Ckpt |
|
| 42 |
|----------------------------------------------------------------------------|:-------------------------:|:-------------------------:|:------------------------:|:-------------:|
|
| 43 |
| | K=2 / K=4 / K=8 / K=16 | K=2 / K=4 / K=8 / K=16 | K=2 / K=4 / K=8 / K=16 | |
|
| 44 |
+
| [TC-CLIP](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/few_shot/train_tc_clip_few_shot.sh) | 57.3 / 62.3 / 67.3 / 68.6 | 85.9 / 89.9 / 92.5 / 94.6 | 7.3 / 8.6 / 9.3 / 14.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot) |
|
| 45 |
+
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/few_shot/train_tc_clip_few_shot_llm.sh) | 58.6 / 63.3 / 65.5 / 68.8 | 86.8 / 90.1 / 92.0 / 94.3 | 7.3 / 8.6 / 9.3 / 14.0 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot_llm) |
|
| 46 |
+
| [TC-CLIP (P)](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/few_shot/train_tc_clip_few_shot_pretrained.sh) | 65.3 / 68.5 / 71.4 / 73.0 | 94.1 / 95.6 / 96.6 / 97.3 | 8.7 / 10.1 / 12.1 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/few_shot_pretrained) |
|
| 47 |
|
| 48 |
#### Base-to-novel generalization
|
| 49 |
|
| 50 |
| Scripts | K-400 | HMDB-51 | UCF-101 | SSv2 | Ckpt |
|
| 51 |
|----------------------------------------------------------------------------|:------------------:|:------------------:|:------------------:|:------------------:|:--------------------------------------------------------------------------------:|
|
| 52 |
| | Base / Novel / HM | Base / Novel / HM | Base / Novel / HM | Base / Novel / HM | |
|
| 53 |
+
| [TC-CLIP](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/base2novel/train_tc_clip_base2novel.sh) | 78.9 / 63.6 / 70.4 | 73.3 / 54.1 / 62.2 | 95.5 / 78.0 / 85.9 | 17.5 / 13.4 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel) |
|
| 54 |
+
| [TC-CLIP (LLM)](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/base2novel/train_tc_clip_base2novel_llm.sh) | 79.1 / 65.4 / 71.6 | 73.3 / 59.1 / 65.5 | 95.4 / 81.6 / 88.0 | 17.5 / 13.4 / 15.2 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel_llm) |
|
| 55 |
+
| [TC-CLIP (P)](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/base2novel/train_tc_clip_base2novel_pretrained.sh) | N/A | 79.4 / 58.3 / 67.2 | 97.5 / 84.5 / 90.5 | 19.6 / 15.6 / 17.4 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/base2novel_pretrained) |
|
| 56 |
|
| 57 |
|
| 58 |
#### Fully-supervised action recognition
|
| 59 |
|
| 60 |
| Scripts | K-400 (Top-1) | K-400 (Top-5) | Ckpt |
|
| 61 |
|-----------------------------------------------------------------------------|:-------------:|:-------------:|:--------------------------------------------------------------------------------:|
|
| 62 |
+
| [TC-CLIP](https://github.com/naver-ai/tc-clip/blob/main/scripts/train/fully_supervised/train_tc_clip_fully_supervised.sh) | 85.2 | 96.9 | [Link](https://huggingface.co/byminji/TC-CLIP/tree/main/fully_supervised_k400) |
|
| 63 |
|
| 64 |
|
| 65 |
## Citation
|