Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jcwang0602
/
VPTracker
like
1
Image-Text-to-Text
Transformers
Safetensors
qwen3_vl
image-to-text
vision-language-tracking
multimodal
mllm
video
conversational
arxiv:
2512.22799
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
refs/pr/1
VPTracker
/
generation_config.json
jcwang0602
Upload 27 files
ab7284e
verified
20 days ago
raw
Copy download link
history
blame
213 Bytes
{
"bos_token_id"
:
151643
,
"do_sample"
:
true
,
"eos_token_id"
:
[
151645
,
151643
]
,
"pad_token_id"
:
151643
,
"temperature"
:
0.7
,
"top_k"
:
20
,
"top_p"
:
0.8
,
"transformers_version"
:
"4.57.1"
}