Update README.md
Browse files
README.md
CHANGED
|
@@ -26,6 +26,10 @@ pipeline_tag: text-generation
|
|
| 26 |
[](https://wandb.ai/gatere-ai/huggingface/runs/v0dge86p)
|
| 27 |
[](https://rdi.berkeley.edu/agentx-agentbeats)
|
| 28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 29 |
## Model Description
|
| 30 |
|
| 31 |
VulnHunter is a fine-tuned Qwen2.5-Coder-7B model specialized for security vulnerability detection and patching. It was trained using **GRPO (Group Relative Policy Optimization)** with a custom security reward function.
|
|
|
|
| 26 |
[](https://wandb.ai/gatere-ai/huggingface/runs/v0dge86p)
|
| 27 |
[](https://rdi.berkeley.edu/agentx-agentbeats)
|
| 28 |
|
| 29 |
+
This model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
| 30 |
+
|
| 31 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
| 32 |
+
|
| 33 |
## Model Description
|
| 34 |
|
| 35 |
VulnHunter is a fine-tuned Qwen2.5-Coder-7B model specialized for security vulnerability detection and patching. It was trained using **GRPO (Group Relative Policy Optimization)** with a custom security reward function.
|