Qwen3-Coder-0.6B – GGUF (q6_k)

This repository contains a GGUF-quantized version of Qwen3-Coder-0.6B, optimized for local inference using llama.cpp-compatible runtimes.

Model details

Tested with:

./llama-cli -m qwen3-coder-0.6b-q6_k.gguf -p "Write a Python function to reverse a list"

GGUF

Model size

0.8B params

Architecture

qwen3

Hardware compatibility

6-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support