File size: 2,825 Bytes
bebbd7f
 
 
 
4514268
 
 
 
 
 
 
 
 
 
bebbd7f
4514268
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
---
license: other
license_name: qwen-research
license_link: https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct/blob/main/LICENSE
base_model: Qwen/Qwen2.5-Coder-3B-Instruct
tags:
  - cisco
  - ios-xe
  - network-automation
  - gguf
  - qwen2.5-coder
  - non-commercial
library_name: llama.cpp
pipeline_tag: text-generation
---

# Sentinel-NX β€” Cisco IOS-XE Config Assistant (V3.1, GGUF)

A small, edge-deployable Cisco **IOS-XE configuration assistant**: a QLoRA fine-tune of
Qwen2.5-Coder-3B-Instruct, merged and quantized to GGUF. It emits **strict, syntactically
valid** IOS-XE for exactly what's requested β€” no invented interfaces, IPs, loopbacks,
route-maps, `no shutdown`s, descriptions, or unrequested best-practice config.

Built with Qwen. **Non-commercial only** (see License).

Project / code / methodology: https://github.com/tnadmin1/Sentinel-NX

## Files

| File | Quant | Size | Use |
|---|---|---|---|
| `sentinel-nx-q8_0.gguf` | Q8_0 | ~3.1 GB | Primary β€” highest fidelity |
| `sentinel-nx-q6_k.gguf` | Q6_K | ~2.4 GB | Faster, near-lossless |

## Results

Manually-scored benchmarks; the hidden set uses entirely new interfaces, VLANs, ASNs,
IPs, and object names not seen in training (a generalization test).

**Hidden 20-prompt benchmark** (5 pts each):

| Model | Score |
|---|---|
| Base Qwen2.5-Coder-3B-Instruct | 58 / 100 |
| V2 | 71 / 100 |
| **V3.1** | **97 / 100** |

**Original 25-prompt benchmark** (4 pts each): Base 58 β†’ V2 70 β†’ V3 69 β†’ **V3.1 93**.

## Usage

```bash
# Ollama (pull directly from this repo)
ollama run hf.co/tnadmin/Sentinel-NX:Q8_0
```

```bash
# llama.cpp
./llama-cli -m sentinel-nx-q8_0.gguf --temp 0 -c 4096 -cnv \
  -sys "You are a Cisco IOS-XE configuration assistant. Output only strict, valid configuration for exactly what is requested. Do not invent values."
```

**Strict behavior is prompt-conditioned.** The model suppresses over-completion when the
system prompt and request instruct it to (e.g. "Do not add descriptions, no shutdown,
spanning-tree, or anything not explicitly requested"). Use a strict prompt for best results.

## Known limitations

- OSPF router-id is occasionally emitted as `ip ospf <process> router-id <id>` under an
  interface instead of `router-id` under `router ospf <process>`. Targeted corrective data
  is the next iteration.

## Training

QLoRA (LoRA rank 16) on Qwen2.5-Coder-3B-Instruct, RTX 4070 12 GB. ~5,200 curated +
failure-driven remedial IOS-XE instruction pairs, built through three corrective rounds
(V2 β†’ V3 β†’ V3.1). See the GitHub repo for the full methodology.

## License & attribution

This model is a derivative of **Qwen2.5-Coder-3B-Instruct** and is distributed under the
**Qwen Research License β€” non-commercial use only**. Built with Qwen.
Copyright (c) Alibaba Cloud. All Rights Reserved.