| license: cc-by-4.0 | |
| datasets: | |
| - MBZUAI/Video-Instruct-Dataset | |
| language: | |
| - en | |
| library_name: transformers | |
| pipeline_tag: visual-question-answering | |
| Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. | |
| It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. | |
| **GitHub:** [https://github.com/mbzuai-oryx/Video-ChatGPT](https://github.com/mbzuai-oryx/Video-ChatGPT) |