YOLO-Coder-8B / README.md
erdemozkan's picture
Upload README.md with huggingface_hub
2e9a60b verified
---
language:
- en
license: mit
base_model: Qwen/Qwen2.5-Coder-7B-Instruct
tags:
- qwen2.5
- qwen2.5-coder
- code
- cli
- debugging
- developer-tools
- lora
- mlx
- gguf
- ollama
model-index:
- name: YOLO-Coder-8B
results: []
---
<div align="center">
<img src="yolo_coder_cropped.png" alt="YOLO-Coder" width="240" />
[Website](https://yolocoderai.com) &nbsp;|&nbsp; [GitHub](https://github.com/erdemozkan/YOLO-CODER) &nbsp;|&nbsp; [Twitter](https://twitter.com/erdemwrites) &nbsp;|&nbsp; [Dataset](https://github.com/erdemozkan/YOLO-CODER/tree/main/benchmark) &nbsp;|&nbsp; [YOLO-Coder-1.5B](https://huggingface.co/erdemozkan/YOLO-Coder-1.5B)
**License: [MIT](https://opensource.org/licenses/MIT)** &nbsp;|&nbsp; **Author: [@erdemwrites](https://twitter.com/erdemwrites)**
</div>
# YOLO-Coder-8B
**Fix broken CLI commands. One command output. Runs 100% locally.**
*Fine-tuned Qwen2.5-Coder-7B Β· MLX LoRA on Apple Silicon Β· No API key needed*
| | |
|---|---|
| 🎯 **Task** | CLI error β†’ single bare bash fix command |
| πŸ† **Accuracy** | **77.1%** pipelineΓ—3 Β· **59.2%** raw LLM (beats GPT-4o) |
| πŸ’Ύ **Size** | ~4.4GB Q4_K_M GGUF Β· ~6GB RAM |
| ⚑ **Speed** | 1–3s on Apple Silicon |
| πŸ”’ **Privacy** | 100% local Β· no API key Β· no telemetry |
## Quickstart
```bash
ollama run hf.co/erdemozkan/YOLO-Coder-8B "ModuleNotFoundError: No module named 'flask'"
# β†’ pip install flask
```
That's it. No account. No cloud. No cost per call.
## Benchmark β€” YOLO-Bench
218 verified CLI errors Β· structural match scoring (flag-order-independent)
```
YOLO-Coder-8B pipelineΓ—3 β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 77.1% β˜… best overall
YOLO-Coder-1.5B pipelineΓ—3 β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 71.1%
Claude Sonnet raw β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 60.1%
YOLO-Coder-8B raw β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 59.2% β˜… best offline
GPT-4o raw β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 48.6%
YOLO-Coder-1.5B raw β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ 42.2%
```
| Mode | Structural Match |
|---|---|
| Raw LLM (no pipeline) | **59.2%** |
| Pipeline Γ— 1 (interceptors + LLM) | **72.0%** |
| Pipeline Γ— 3 (interceptors + memory + 3 LLM attempts) | **77.1%** |
> YOLO-Coder-8B pipelineΓ—3 is the highest score of any model tested β€” including GPT-4o and Claude Sonnet β€” running entirely offline.
Scoring code and dataset: [github.com/erdemozkan/YOLO-CODER/tree/main/benchmark](https://github.com/erdemozkan/YOLO-CODER/tree/main/benchmark)
## How the pipeline works
```
Your error β†’ [91 interceptors <1ms] β†’ [fix memory <5ms] β†’ [LLM 1-3s] β†’ Fix
↑ ~50% of fixes stop here
```
Half of all fixes never reach the LLM. The model is the safety net, not the first guess.
## Usage with YOLO-CODER
```bash
pip install yolo-coder
yoco python3 myapp.py # 8B is the default
yoco npm run dev
yoco --model hf.co/erdemozkan/YOLO-Coder-8B python3 myapp.py
```
## Prompt format (ChatML)
```
<|im_start|>system
You are a CLI repair tool. Output ONLY a single bare bash command to fix the error. No explanation. No markdown. No backticks.<|im_end|>
<|im_start|>user
[Linux] $ python3 myapp.py
Error:
ModuleNotFoundError: No module named 'requests'
FIX:<|im_end|>
<|im_start|>assistant
pip install requests<|im_end|>
```
## Training
> "Trained on a MacBook Air. No rented A100s."
| Property | Value |
|---|---|
| Base model | `Qwen/Qwen2.5-Coder-7B-Instruct` |
| Fine-tune method | LoRA via MLX on Apple Silicon |
| LoRA rank / scale | 8 / 20.0 |
| Layers trained | 28 |
| Training iterations | 500 |
| Learning rate | 1e-5 |
| Training examples | **6,719** error/fix pairs across 15 categories |
| Export | Merged weights β†’ Q4_K_M GGUF for Ollama |
## Files
| File | Description |
|---|---|
| `YOLO-Coder-8B-Q4_K_M.gguf` | Q4_K_M quantized GGUF (~4.4GB) β€” use this with Ollama |
| `safetensors/` | fp16 safetensors β€” for further fine-tuning |
## 1.5B vs 8B
| | [YOLO-Coder-1.5B](https://huggingface.co/erdemozkan/YOLO-Coder-1.5B) | YOLO-Coder-8B |
|---|---|---|
| Size | ~941MB | ~4.4GB |
| RAM needed | ~2GB | ~6GB |
| Speed | <1s on Apple Silicon | 1–3s on Apple Silicon |
| Raw accuracy | 42.2% | 59.2% |
| PipelineΓ—3 accuracy | 71.1% | **77.1%** |
| Best for | Speed, low-RAM machines | Hard errors, best accuracy |
## Limitations
- Single-command output only β€” not designed for multi-step fixes without a wrapper
- Complex or highly novel errors may produce suboptimal output
- Not a general-purpose coding assistant
## License
MIT