Qwen/Qwen2.5-32B-Instruct quantized to 4-bit (bitsandbytes NF4, double-quant, bfloat16 compute). This is the base model used by CYGNUS 1.0.
Original model (c) Qwen team, released under Apache-2.0.
Chat template
Files info
Base model