Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
nkkbr
/
ViCA2-stage2-onevision-ft
like
0
Video-Text-to-Text
Transformers
Safetensors
sam2
lmms-lab/LLaVA-OneVision-Data
English
vica_qwen
text-generation
multimodal
vision-language
video understanding
visuospatial cognition
spatial reasoning
vlm
llava
qwen
siglip
hiera
dual-encoder
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
ViCA2-stage2-onevision-ft
Commit History
Create README.md
638549d
verified
nkkbr
commited on
May 15, 2025
Initial commit
a87ec6c
nkkbr
commited on
Apr 21, 2025
initial commit
7e37c0e
verified
nkkbr
commited on
Apr 21, 2025