VLLM example inference server usage:

pip install vllm>=0.17.0
pip install huggingface-hub>=1.6.0 transformers>=5.3.0

vllm serve hhzm/qwen3.5-4b-meow --reasoning-parser qwen3 --enable-auto-tool-choice --tool-call-parser qwen3_coder

Downloads last month: 3

Safetensors

Model size

5B params

Tensor type

F16

Model tree for hhzm/qwen3.5-4b-meow

Base model

Qwen/Qwen3.5-4B-Base

Finetuned

Qwen/Qwen3.5-4B

Finetuned

SicariusSicariiStuff/Qwen3.5-4B_Abliterated

Finetuned

(1)

this model

Finetunes

1 model

hhzm
/

qwen3.5-4b-meow

Model tree for hhzm/qwen3.5-4b-meow

Dataset used to train hhzm/qwen3.5-4b-meow