GGUF
conversational
How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="Montecarlo2024/Qwen-3.5-9b-Think-Python-Multi-gguf",
	filename="",
)
llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Qwen 3.5-9b model fine tuned from multiple datasets (Multi) to encourage Python reasoning in a smaller model, Q6_K_M and Q4_K_M also available.

Downloads last month
538
GGUF
Model size
9B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Montecarlo2024/Qwen-3.5-9b-Think-Python-Multi-gguf

Finetuned
Qwen/Qwen3.5-9B
Quantized
(204)
this model

Datasets used to train Montecarlo2024/Qwen-3.5-9b-Think-Python-Multi-gguf