anicka
/

cve-backport-codegen-qwen25-32b

@@ -160,12 +160,25 @@ teapot train configs/cve-backport.config --backend qlora-hf
 teapot eval configs/cve-backport.config
 ```
-All versioned datasets available at the dataset repo (`train-v1.jsonl` through `train-v4.jsonl`).
 ## Intended Use
 This model assists with security patch backporting in Linux distribution maintenance. It is a research tool — all generated patches must be reviewed by a maintainer before application.
 ## License
 Apache-2.0 (inherited from Qwen2.5-Coder-32B-Instruct).

 teapot eval configs/cve-backport.config
 ```
+Dataset: [anicka/cve-backport-codegen-dataset](https://huggingface.co/datasets/anicka/cve-backport-codegen-dataset) (`train.jsonl` + `eval.jsonl`).
 ## Intended Use
 This model assists with security patch backporting in Linux distribution maintenance. It is a research tool — all generated patches must be reviewed by a maintainer before application.
+**Important:** This model was fine-tuned for code generation accuracy, not for safety alignment. It inherits the base model's safety training but has no additional guardrails. In particular:
+- The model follows fix descriptions literally. If the fix description contains malicious instructions (e.g., "add a backdoor"), the model will comply. **Fix descriptions must come from trusted sources** — typically upstream patches, not user input.
+- The tool is designed for use with trusted inputs (upstream CVE patches, OBS source packages). It should not be exposed as a public API without input validation.
+- Generated patches and test cases must always be reviewed by a maintainer before application.
+Adding safety training to the fine-tuning was considered but deliberately deferred — our evaluation showed that domain precision (98% in v3) is sensitive to training data composition, and mixing safety examples risks degrading the model's core capability. The correct mitigation is input validation in the tool, not model-level refusal.
+## Known Issues
+- **Prompt echo (v4):** The v4 model occasionally echoes prompt structure (`## File:`, markdown fences) into its code output, likely from the 5-turn test generation training data. The CLI tool strips these automatically. This is a minor regression from v3.
+- **Test generation quality varies:** Test cases for simple vulnerability patterns (null deref, bounds check, injection) are useful. For complex multi-file patches with adapted context, the model may produce generic placeholder tests.
 ## License
 Apache-2.0 (inherited from Qwen2.5-Coder-32B-Instruct).