SALEETAI commited on
Commit
4f699d8
Β·
verified Β·
1 Parent(s): 2225db6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -0
README.md CHANGED
@@ -8,4 +8,42 @@ pinned: false
8
  license: apache-2.0
9
  ---
10
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
8
  license: apache-2.0
9
  ---
10
 
11
+
12
+ ---
13
+ title: Coding Agent Qwen-SFT v3
14
+ emoji: πŸš€
15
+ colorFrom: blue
16
+ colorTo: gray
17
+ sdk: docker
18
+ pinned: false
19
+ license: apache-2.0
20
+ ---
21
+
22
+ # πŸš€ Coding Agent Qwen-SFT v2 (GGUF)
23
+
24
+ This is a fine-tuned **Qwen2.5-Coder-7B** model, optimized for production-grade Python, C++, and Rust development.
25
+
26
+ ### 🌟 Key Features
27
+ - **Logic-First:** Trained to avoid "Textbook" descriptions and provide direct code solutions.
28
+ - **Efficient:** Quantized to 4-bit (GGUF) to run smoothly on standard CPUs.
29
+ - **Context-Aware:** Handles complex concurrency, memory safety, and architectural prompts.
30
+
31
+ ### πŸ’» Use it Locally (Ollama)
32
+ You can run this exact model on your own machine:
33
+ 1. **Install [Ollama](https://ollama.com)**
34
+ 2. **Download the GGUF file** from this repository.
35
+ 3. **Create a `Modelfile`**:
36
+ ```dockerfile
37
+ FROM ./coding-agent-qwen-sft-v2-GGUF.q4_k_m.gguf
38
+ PARAMETER temperature 0.4
39
+ PARAMETER repeat_penalty 1.2
40
+ ```
41
+ 4. **Run:** `ollama create coding-agent -f Modelfile` then `ollama run coding-agent`
42
+
43
+ ### πŸ› οΈ Inference Specs
44
+ - **Repetition Penalty:** 1.2
45
+ - **Temperature:** 0.4
46
+ - **Stop Token:** `<|im_end|>`
47
+
48
+
49
  Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference