Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
byminji
/
LLaVA-NeXT-13B-Video-FT
like
0
Video-Text-to-Text
Transformers
Safetensors
OpenGVLab/VideoChat2-IT
byminji/VideoChat2-IT-clean
English
llava
text-generation
multi-modal
large-language-model
video-language-model
arxiv:
2510.13251
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
LLaVA-NeXT-13B-Video-FT
/
preprocessor_config.json
Commit History
Upload model weights
ae331ce
byminji
commited on
8 days ago