PeterAdel
/

CyberBrain_Model

text-generation-inference

Model card Files Files and versions

PeterAdel commited on Feb 26, 2025

Commit

5156927

·

verified ·

1 Parent(s): d599947

Update README.md

Files changed (1) hide show

README.md +17 -14

README.md CHANGED Viewed

@@ -1,23 +1,26 @@
----
-base_model: unsloth/deepseek-r1-distill-qwen-14b-unsloth-bnb-4bit
-tags:
-- text-generation-inference
-- transformers
-- unsloth
-- qwen2
-- trl
-- ai
-- finetune
-license: apache-2.0
-language:
-- en
----
 # CyberBrain_Model
 <p align="center">
    <img src="https://capsule-render.vercel.app/api?type=waving&height=120&color=244b6c&text=Cyper%20Brain&section=header&textBg=false&animation=twinkling&fontColor=a5241b&strokeWidth=0&rotate=0&reversal=false" style="width:100%;">
 </p>
 CyberBrain_Model is an advanced AI project designed for fine-tuning the model `unsloth/DeepSeek-R1-Distill-Qwen-14B` specifically for cyber security tasks. This repository provides tools and scripts for training and fine-tuning large language models efficiently using minimal hardware resources. The goal is to adapt the model for ethical cyber security applications, making it efficient even on devices with limited computational power, whether you have a low-end CPU or a GPU with limited VRAM.
 In this project, we use technical content extracted from various cyber security sources as our primary training data. The raw text is processed into instruction-response pairs tailored for fine-tuning the model on cyber security scenarios. You can access the training data [here](./DataSet).

+---
+base_model: unsloth/deepseek-r1-distill-qwen-14b-unsloth-bnb-4bit
+tags:
+- text-generation-inference
+- transformers
+- unsloth
+- qwen2
+- trl
+- ai
+- finetune
+license: apache-2.0
+language:
+- en
+---
 # CyberBrain_Model
 <p align="center">
    <img src="https://capsule-render.vercel.app/api?type=waving&height=120&color=244b6c&text=Cyper%20Brain&section=header&textBg=false&animation=twinkling&fontColor=a5241b&strokeWidth=0&rotate=0&reversal=false" style="width:100%;">
 </p>
+**[GitHub_Project_link](https://github.com/YourUsername/CyberBrain_Model.git)**
 CyberBrain_Model is an advanced AI project designed for fine-tuning the model `unsloth/DeepSeek-R1-Distill-Qwen-14B` specifically for cyber security tasks. This repository provides tools and scripts for training and fine-tuning large language models efficiently using minimal hardware resources. The goal is to adapt the model for ethical cyber security applications, making it efficient even on devices with limited computational power, whether you have a low-end CPU or a GPU with limited VRAM.
 In this project, we use technical content extracted from various cyber security sources as our primary training data. The raw text is processed into instruction-response pairs tailored for fine-tuning the model on cyber security scenarios. You can access the training data [here](./DataSet).