odoom's picture
Mark as deprecated, point to nixpkgs-security-qwen-lora
c83383f verified
---
library_name: peft
base_model: mistralai/Mistral-7B-Instruct-v0.2
license: apache-2.0
tags:
- nixpkgs
- security
- lora
- nix
- patch-generation
- deprecated
datasets:
- odoom/nixpkgs-security-patches
---
# nixpkgs-security-lora (Deprecated)
> **This adapter is deprecated.** Use [odoom/nixpkgs-security-qwen-lora](https://huggingface.co/odoom/nixpkgs-security-qwen-lora) instead — Qwen 2.5 Coder 32B with multi-turn tool-calling, lower loss (0.54 vs 0.87), and higher accuracy (90% vs 80%).
## What Changed
| | v2 (this repo) | v3 (new repo) |
|---|---|---|
| Base model | Mistral 7B Instruct v0.2 | Qwen 2.5 Coder 32B Instruct |
| Format | Single-turn (system/user/assistant) | Multi-turn tool-calling conversations |
| Loss | 0.867 | 0.540 |
| Token accuracy | 80.5% | 90.1% |
| Adapter size | 160 MB | 256 MB |
| Tool calling | Broken (`raw: true` disabled it) | Native Qwen 2.5 tool calling |
## Original Model Details (v2)
- **Base model**: [Mistral 7B Instruct v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2)
- **Method**: QLoRA (4-bit NF4 quantization + LoRA rank 32)
- **Target**: Cloudflare Workers AI `@cf/mistral/mistral-7b-instruct-v0.2-lora`
- **Adapter size**: 160 MB
- **Training data**: 586 complex security patches (version bumps filtered out)
- **Epochs**: 3 (110 steps), ~61 minutes on NVIDIA L4
### Training Metrics
| Metric | Start | End |
|--------|-------|-----|
| Loss | 1.166 | 0.867 |
| Token accuracy | 74.6% | 80.5% |
| Eval loss | — | 0.924 |
| Eval accuracy | — | 78.4% |
## Changelog
- **v2** (2026-03-03): Retrained on filtered dataset — removed 763 version bump / hash-only examples.
- **v1** (2026-03-02): Initial training on 1,273 unfiltered examples.