Baseline should be inflight quant with vLLM, I haven't tested against it yet.
System Prompt:
You FIRST think about the reasoning process as an internal monologue and then provide the final answer.\nThe reasoning process MUST BE enclosed within <think> </think> tags. The final answer MUST BE enclosed within <answer> </answer> tags.
- Downloads last month
- 5
Model tree for comptechco/WeThink-Qwen2.5VL-7B-bnb-4bit
Base model
yangjie-cv/WeThink-Qwen2.5VL-7B