PeterAdel commited on
Commit
5156927
·
verified ·
1 Parent(s): d599947

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -14
README.md CHANGED
@@ -1,23 +1,26 @@
1
- ---
2
- base_model: unsloth/deepseek-r1-distill-qwen-14b-unsloth-bnb-4bit
3
- tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - qwen2
8
- - trl
9
- - ai
10
- - finetune
11
- license: apache-2.0
12
- language:
13
- - en
14
- ---
15
 
16
 
17
  # CyberBrain_Model
18
  <p align="center">
19
  <img src="https://capsule-render.vercel.app/api?type=waving&height=120&color=244b6c&text=Cyper%20Brain&section=header&textBg=false&animation=twinkling&fontColor=a5241b&strokeWidth=0&rotate=0&reversal=false" style="width:100%;">
20
  </p>
 
 
 
21
  CyberBrain_Model is an advanced AI project designed for fine-tuning the model `unsloth/DeepSeek-R1-Distill-Qwen-14B` specifically for cyber security tasks. This repository provides tools and scripts for training and fine-tuning large language models efficiently using minimal hardware resources. The goal is to adapt the model for ethical cyber security applications, making it efficient even on devices with limited computational power, whether you have a low-end CPU or a GPU with limited VRAM.
22
 
23
  In this project, we use technical content extracted from various cyber security sources as our primary training data. The raw text is processed into instruction-response pairs tailored for fine-tuning the model on cyber security scenarios. You can access the training data [here](./DataSet).
 
1
+ ---
2
+ base_model: unsloth/deepseek-r1-distill-qwen-14b-unsloth-bnb-4bit
3
+ tags:
4
+ - text-generation-inference
5
+ - transformers
6
+ - unsloth
7
+ - qwen2
8
+ - trl
9
+ - ai
10
+ - finetune
11
+ license: apache-2.0
12
+ language:
13
+ - en
14
+ ---
15
 
16
 
17
  # CyberBrain_Model
18
  <p align="center">
19
  <img src="https://capsule-render.vercel.app/api?type=waving&height=120&color=244b6c&text=Cyper%20Brain&section=header&textBg=false&animation=twinkling&fontColor=a5241b&strokeWidth=0&rotate=0&reversal=false" style="width:100%;">
20
  </p>
21
+
22
+ **[GitHub_Project_link](https://github.com/YourUsername/CyberBrain_Model.git)**
23
+
24
  CyberBrain_Model is an advanced AI project designed for fine-tuning the model `unsloth/DeepSeek-R1-Distill-Qwen-14B` specifically for cyber security tasks. This repository provides tools and scripts for training and fine-tuning large language models efficiently using minimal hardware resources. The goal is to adapt the model for ethical cyber security applications, making it efficient even on devices with limited computational power, whether you have a low-end CPU or a GPU with limited VRAM.
25
 
26
  In this project, we use technical content extracted from various cyber security sources as our primary training data. The raw text is processed into instruction-response pairs tailored for fine-tuning the model on cyber security scenarios. You can access the training data [here](./DataSet).