Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
nkkbr
/
ViCA2-init
like
0
Video-Text-to-Text
Transformers
Safetensors
sam2
English
vica_qwen
text-generation
multimodal
vision-language
video understanding
visuospatial cognition
spatial reasoning
vlm
llava
qwen
siglip
hiera
dual-encoder
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
ViCA2-init
Commit History
Create README.md
2b77ece
verified
nkkbr
commited on
May 15, 2025
Initial commit
e6712e6
nkkbr
commited on
Apr 21, 2025
initial commit
31ece29
verified
nkkbr
commited on
Apr 21, 2025