File size: 1,336 Bytes
f266ea4
 
 
 
 
 
 
 
 
 
4f699d8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f266ea4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: Coding Agent
emoji: πŸƒ
colorFrom: yellow
colorTo: green
sdk: docker
pinned: false
license: apache-2.0
---


---
title: Coding Agent Qwen-SFT v3
emoji: πŸš€
colorFrom: blue
colorTo: gray
sdk: docker
pinned: false
license: apache-2.0
---

# πŸš€ Coding Agent Qwen-SFT v2 (GGUF)

This is a fine-tuned **Qwen2.5-Coder-7B** model, optimized for production-grade Python, C++, and Rust development. 

### 🌟 Key Features
- **Logic-First:** Trained to avoid "Textbook" descriptions and provide direct code solutions.
- **Efficient:** Quantized to 4-bit (GGUF) to run smoothly on standard CPUs.
- **Context-Aware:** Handles complex concurrency, memory safety, and architectural prompts.

### πŸ’» Use it Locally (Ollama)
You can run this exact model on your own machine:
1. **Install [Ollama](https://ollama.com)**
2. **Download the GGUF file** from this repository.
3. **Create a `Modelfile`**:
   ```dockerfile
   FROM ./coding-agent-qwen-sft-v2-GGUF.q4_k_m.gguf
   PARAMETER temperature 0.4
   PARAMETER repeat_penalty 1.2
   ```
4. **Run:** `ollama create coding-agent -f Modelfile` then `ollama run coding-agent`

### πŸ› οΈ Inference Specs
- **Repetition Penalty:** 1.2
- **Temperature:** 0.4
- **Stop Token:** `<|im_end|>`


Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference