| # ViPER-VT | |
| ## (Vision Text) | |
| This repository contains the checkpoints for the ViPER model. | |
| It is a Perceiver-based model finetuned on the concatenation of visual and textual features. | |
| For more information on how to use this model please refer to the following [repository](https://github.com/VaianiLorenzo/ViPER) | |
| If you find this useful please cite: | |
| ``` | |
| @inproceedings{vaiani2022viper, | |
| title={ViPER: Video-based Perceiver for Emotion Recognition}, | |
| author={Vaiani, Lorenzo and La Quatra, Moreno and Cagliero, Luca and Garza, Paolo}, | |
| booktitle={Proceedings of the 3rd International on Multimodal Sentiment Analysis Workshop and Challenge}, | |
| pages={67--73}, | |
| year={2022} | |
| } | |
| ``` | |
| For any other question feel free to contact me at lorenzo.vaiani@polito.it | |