How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="vsghanta/brewmode-qwen3-8b-v2",
	filename="brewmode-q4_k_m.gguf",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Brewmode Qwen3-8B v3

Fine-tuned on 100K high-quality code examples (Magicoder + OpenCodeInstruct). Trained with Unsloth on H100, 1000 steps, final loss 0.699.

Training Data

  • Magicoder-OSS-Instruct-75K (open-source inspired code)
  • Magicoder-Evol-Instruct-110K (evolved complexity)
  • OpenCodeInstruct top 30K (test-verified, score >= 0.9)
  • Filtered for web/code relevance (HTML, JS, TS, React, API)

Usage

Use with llama.cpp, Ollama, or any GGUF-compatible runtime.

Downloads last month
6
GGUF
Model size
8B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for vsghanta/brewmode-qwen3-8b-v2

Finetuned
Qwen/Qwen3-8B
Quantized
(289)
this model