Committed — Qwen3-1.7B (Q4_K_M GGUF)

A small model fine-tuned to write Conventional Commits messages from a git diff. It runs locally on CPU via llama.cpp, so your code never leaves your machine.

This repo holds the merged, quantized GGUF used for serving. The training dataset, LoRA adapter, and source code are linked below.

Details

  • Base: Qwen/Qwen3-1.7B (Apache-2.0)
  • Method: QLoRA fine-tune, merged into the base, converted to GGUF
  • Quantization: Q4_K_M (~1.1 GB)
  • Task: single-file git diff to one Conventional Commits subject line, type(scope): description
  • Decoding: GBNF grammar-constrained, so output is always a well-formed CC line

Usage

The trained behavior depends on the exact prompt rendering used in training plus the GBNF grammar applied at decode time, so a bare llama-cpp-python prompt will not reproduce the evaluated output. Run it through the project's inference path instead.

Easiest — the CLI, which downloads this GGUF on first run and wires up the prompt + grammar for you:

pip install "committed @ git+https://github.com/marzoukbaig14/Committed.git" --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/cpu
git diff | committed

Or call it through the repo's engine.py, the FastAPI endpoint, or the Gradio Space — all linked below. Full instructions and options are in the repo README.

Results

Evaluated against the un-tuned Qwen3-1.7B base on a 442-example test set, scored by an LLM judge on four axes (judge validated against 50 hand-rated examples). Headline numbers reweighted to the true commit-type distribution.

Metric Base Fine-tuned
Type accuracy 0.131 0.637
Conjunctive pass-rate 0.181 0.471
Graded mean (0–3) 1.207 2.188
Faithfulness 0.43 0.86

The base model collapsed ~95% of outputs to feat regardless of the diff; fine-tuning fixed that. One axis (specificity) regressed slightly (0.81 → 0.71). Full breakdown, regression analysis, and curated sample outputs are in the eval writeup: FINDINGS_v1.md.

Related

License

Apache-2.0, inherited from the Qwen3-1.7B base.

Downloads last month
276
GGUF
Model size
2B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for marzoukbaig14/committed-gguf

Finetuned
Qwen/Qwen3-1.7B
Quantized
(298)
this model

Dataset used to train marzoukbaig14/committed-gguf