Image-Text-to-Text
Transformers
Safetensors
English
llava
vision-language
multimodal
qwen3
conversational
vqwen3-4b / processor_config.json
alpharomercoma's picture
Initial upload: vqwen3-4b (stage-1 + stage-2 LoRA merged)
9eccf38 verified
raw
history blame contribute delete
173 Bytes
{
"image_token": "<image>",
"num_additional_image_tokens": 1,
"patch_size": 14,
"processor_class": "LlavaProcessor",
"vision_feature_select_strategy": "default"
}