sargurun16 commited on
Commit
2caecbd
·
verified ·
1 Parent(s): d6c4635

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +164 -0
README.md CHANGED
@@ -1,3 +1,167 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ base_model: Qwen/Qwen2.5-Coder-3B-Instruct
4
+ pipeline_tag: text-generation
5
+ library_name: transformers
6
+ tags:
7
+ - code-generation
8
+ - python
9
+ - qwen
10
+ - unsloth
11
+ - transformers
12
+ - coding-assistant
13
+ language:
14
+ - en
15
  ---
16
+
17
+ # VCoder
18
+
19
+ VCoder is a Python-focused coding assistant fine-tuned from Qwen2.5-Coder-3B-Instruct using LoRA and Unsloth.
20
+
21
+ The model was trained on 15,000 Python instruction-response examples from the Python Code Instructions 15K dataset and optimized for Python code generation, problem solving, debugging, and algorithm implementation.
22
+
23
+ ## Model Details
24
+
25
+ | Attribute | Value |
26
+ |------------|---------|
27
+ | Base Model | Qwen2.5-Coder-3B-Instruct |
28
+ | Fine-Tuning Method | LoRA |
29
+ | Framework | Unsloth |
30
+ | Dataset | Python Code Instructions 15K |
31
+ | Training Samples | 15,000 |
32
+ | GPU | NVIDIA Tesla T4 |
33
+ | Quantized Format | GGUF Q8_0 |
34
+ | Primary Language | Python |
35
+
36
+ ---
37
+
38
+ ## Training Pipeline
39
+
40
+ Training was performed incrementally:
41
+
42
+ | Stage | Samples |
43
+ |---------|---------|
44
+ | Stage 1 | 0 - 5,000 |
45
+ | Stage 2 | 5,000 - 10,000 |
46
+ | Stage 3 | 10,000 - 15,000 |
47
+
48
+ The model was trained using parameter-efficient fine-tuning (LoRA), allowing adaptation of the base model while keeping computational requirements low.
49
+
50
+ ---
51
+
52
+ ## Benchmark Results
53
+
54
+ ![Output](https://cdn-uploads.huggingface.co/production/uploads/6a297050d3837ea7b12cc42f/BV8FY6fJN7KQ43jcpC6hr.png)
55
+
56
+ ### HumanEval Comparison
57
+
58
+ The model was evaluated against the original Qwen2.5-Coder-3B-Instruct on HumanEval coding tasks.
59
+
60
+ | Model | Pass@1 |
61
+ |---------|---------|
62
+ | Base Qwen2.5-Coder-3B | 61.0% |
63
+ | VCoder | 68.0% |
64
+
65
+ ### Improvement
66
+
67
+ ```text
68
+ +7.0% Pass@1 improvement
69
+ ```
70
+
71
+ This demonstrates that the fine-tuned model performs better on Python coding tasks than the original base model.
72
+
73
+ ---
74
+
75
+ ## Example Usage
76
+
77
+ ### Python
78
+
79
+ ```python
80
+ prompt = """
81
+ ### Instruction:
82
+ Write a Python function to reverse a string.
83
+
84
+ ### Input:
85
+
86
+ ### Response:
87
+ """
88
+ ```
89
+
90
+ ### Example Output
91
+
92
+ ```python
93
+ def reverse_string(text):
94
+ return text[::-1]
95
+ ```
96
+
97
+ ---
98
+
99
+ ## Supported Tasks
100
+
101
+ - Python Code Generation
102
+ - Algorithm Design
103
+ - Data Structures
104
+ - Debugging
105
+ - Code Refactoring
106
+ - Coding Interview Questions
107
+ - Competitive Programming
108
+ - Function Completion
109
+
110
+ ---
111
+
112
+ ## GGUF Usage
113
+
114
+ Compatible with:
115
+
116
+ - Ollama
117
+ - LM Studio
118
+ - llama.cpp
119
+
120
+ ---
121
+
122
+ ## Training Dataset
123
+
124
+ Dataset used:
125
+
126
+ Python Code Instructions 15K
127
+
128
+ The dataset contains instruction-response pairs focused on Python programming tasks including:
129
+
130
+ - Function generation
131
+ - Data manipulation
132
+ - Algorithms
133
+ - Debugging
134
+ - Problem solving
135
+
136
+ ---
137
+
138
+ ## Limitations
139
+
140
+ - Primarily optimized for Python.
141
+ - Benchmark performed on a subset of HumanEval tasks.
142
+ - May generate incorrect code for highly specialized domains.
143
+ - Should not be used as the sole source of production-critical code.
144
+
145
+ ---
146
+
147
+ ## Acknowledgements
148
+
149
+ - Qwen Team for Qwen2.5-Coder
150
+ - Unsloth for efficient fine-tuning
151
+ - Hugging Face
152
+ - OpenAI HumanEval Benchmark
153
+
154
+ ---
155
+
156
+ ## Citation
157
+
158
+ ```bibtex
159
+ @misc{vcoder2026,
160
+ title={VCoder: Python Code Generation Model},
161
+ author={Varunesh V, Prawin R K, Sarguru N},
162
+ year={2026},
163
+ base_model={Qwen2.5-Coder-3B-Instruct}
164
+ }
165
+ ```
166
+ Github : https://github.com/sargurun16
167
+ Mail : sarguru1609@gmail.com