Video-Text-to-Text
Transformers
English
qwen2
text-generation

Improving LLM Video Understanding with 16 Frames Per Second

Official model release of Improving LLM Video Understanding with 16 Frames Per Second. Sports finetuned version.

Downloads last month
28
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tsinghua-ee/F-16-Sports

Base model

Qwen/Qwen2-7B
Finetuned
(75)
this model

Datasets used to train tsinghua-ee/F-16-Sports