DexopT commited on
Commit
f62c836
·
verified ·
1 Parent(s): 516a48d

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -20
README.md CHANGED
@@ -20,20 +20,28 @@ datasets:
20
  ---
21
 
22
  <div align="center">
23
- <img src="https://qianwen-res.oss-cn-beijing.aliyuncs.com/logo_qwen.png" width="200"/>
24
- <br/>
25
- <h1>Qwen3-4B-Cybersecurity</h1>
26
- <p><strong>Qwen3-4B fine-tuned on 1.28M cybersecurity samples</strong></p>
27
-
28
- [![Model](https://img.shields.io/badge/🤗_Model-Qwen3--4B--Cybersecurity-blue)](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity)
29
- [![Heretic Version](https://img.shields.io/badge/🔪_Abliterated-Heretic_Version-red)](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic)
30
- [![GGUF](https://img.shields.io/badge/📦_GGUF-Q8_%2F_Q4-green)](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF)
31
- [![Base Model](https://img.shields.io/badge/Base-Qwen3--4B--Instruct--2507-orange)](https://huggingface.co/unsloth/Qwen3-4B-Instruct-2507)
 
 
 
 
 
 
 
 
32
  </div>
33
 
34
  ---
35
 
36
- > 🔪 **Abliterated (refusal-removed) version available:** [DexopT/Qwen3-4B-Cybersecurity-Heretic](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic)
37
 
38
  ## Model Description
39
 
@@ -55,7 +63,7 @@ Trained with [Unsloth](https://github.com/unslothai/unsloth) SFT on a curated da
55
  | Model | Description | Link |
56
  |-------|-------------|------|
57
  | **Qwen3-4B-Cybersecurity** | Base fine-tuned model (this repo) | 📍 You are here |
58
- | Qwen3-4B-Cybersecurity-Heretic | Abliterated — refusal directions removed | [→](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic) |
59
  | Qwen3-4B-Cybersecurity-GGUF | Q8_0 + Q4_K_M quantized for llama.cpp | [→](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF) |
60
 
61
  ---
@@ -123,10 +131,16 @@ print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[1]:], skip_special_t
123
 
124
  ```bash
125
  pip install mlx-lm
126
- mlx_lm.chat --model DexopT/Qwen3-4B-Cybersecurity
 
 
 
 
 
 
127
  ```
128
 
129
- ### LM Studio / Ollama
130
 
131
  Use the GGUF version: [DexopT/Qwen3-4B-Cybersecurity-GGUF](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF)
132
 
@@ -134,9 +148,9 @@ Use the GGUF version: [DexopT/Qwen3-4B-Cybersecurity-GGUF](https://huggingface.c
134
 
135
  ## Refusal Behavior
136
 
137
- This model retains some of its original refusal behavior from the base Qwen3 model. For a version with refusal directions removed, see the Heretic version:
138
 
139
- > 🔪 [DexopT/Qwen3-4B-Cybersecurity-Heretic](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic)
140
 
141
  ---
142
 
@@ -148,8 +162,10 @@ This model is intended for **educational and research purposes only**. Use respo
148
 
149
  ## Links
150
 
151
- - 🤗 [Heretic Version](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic)
152
- - 📦 [GGUF Versions](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF)
153
- - 📊 [Training Dataset](https://huggingface.co/datasets/DexopT/cyber_heretic)
154
- - 🔧 [Base Model](https://huggingface.co/unsloth/Qwen3-4B-Instruct-2507)
155
- - 🏠 [Qwen3 Collection](https://huggingface.co/collections/Qwen/qwen3)
 
 
 
20
  ---
21
 
22
  <div align="center">
23
+
24
+ <picture>
25
+ <source media="(prefers-color-scheme: dark)" srcset="https://huggingface.co/datasets/Qwen/documentation-images/resolve/main/Qwen-logo_White.png">
26
+ <source media="(prefers-color-scheme: light)" srcset="https://huggingface.co/datasets/Qwen/documentation-images/resolve/main/Qwen-logo_Black.png">
27
+ <img alt="Qwen Logo" src="https://huggingface.co/datasets/Qwen/documentation-images/resolve/main/Qwen-logo_Black.png" width="180"/>
28
+ </picture>
29
+
30
+ # Qwen3-4B-Cybersecurity
31
+
32
+ **Qwen3-4B fine-tuned on 1.28M cybersecurity samples**
33
+
34
+ [![Base Model](https://img.shields.io/badge/Base-Qwen3--4B--Instruct--2507-orange?style=flat-square)](https://huggingface.co/unsloth/Qwen3-4B-Instruct-2507)
35
+ [![Heretic Version](https://img.shields.io/badge/🔪_Abliterated-Heretic_Version-red?style=flat-square)](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit)
36
+ [![GGUF](https://img.shields.io/badge/📦_GGUF-Q8_/_Q4-brightgreen?style=flat-square)](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF)
37
+ [![Dataset](https://img.shields.io/badge/📊_Dataset-cyber__heretic-blue?style=flat-square)](https://huggingface.co/datasets/DexopT/cyber_heretic)
38
+ [![License](https://img.shields.io/badge/License-Apache_2.0-yellow?style=flat-square)](https://apache.org/licenses/LICENSE-2.0)
39
+
40
  </div>
41
 
42
  ---
43
 
44
+ > 🔪 **Abliterated (refusal-removed) version:** [DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit)
45
 
46
  ## Model Description
47
 
 
63
  | Model | Description | Link |
64
  |-------|-------------|------|
65
  | **Qwen3-4B-Cybersecurity** | Base fine-tuned model (this repo) | 📍 You are here |
66
+ | Qwen3-4B-Cybersecurity-Heretic-16bit | Abliterated — refusal directions removed | [→](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit) |
67
  | Qwen3-4B-Cybersecurity-GGUF | Q8_0 + Q4_K_M quantized for llama.cpp | [→](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF) |
68
 
69
  ---
 
131
 
132
  ```bash
133
  pip install mlx-lm
134
+
135
+ mlx_lm.convert \
136
+ --hf-path DexopT/Qwen3-4B-Cybersecurity \
137
+ --mlx-path ~/models/qwen3-cyber-mlx \
138
+ --quantize --q-bits 8
139
+
140
+ mlx_lm.chat --model ~/models/qwen3-cyber-mlx
141
  ```
142
 
143
+ ### LM Studio / llama.cpp / Ollama
144
 
145
  Use the GGUF version: [DexopT/Qwen3-4B-Cybersecurity-GGUF](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF)
146
 
 
148
 
149
  ## Refusal Behavior
150
 
151
+ This model retains some of the original refusal behavior from the base Qwen3 model. For a version with refusal directions removed via Heretic abliteration, see:
152
 
153
+ > 🔪 [DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit)
154
 
155
  ---
156
 
 
162
 
163
  ## Links
164
 
165
+ | | |
166
+ |---|---|
167
+ | 🔪 Heretic Version | [DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-Heretic-16bit) |
168
+ | 📦 GGUF (Q8 + Q4) | [DexopT/Qwen3-4B-Cybersecurity-GGUF](https://huggingface.co/DexopT/Qwen3-4B-Cybersecurity-GGUF) |
169
+ | 📊 Training Dataset | [DexopT/cyber_heretic](https://huggingface.co/datasets/DexopT/cyber_heretic) |
170
+ | 🔧 Base Model | [unsloth/Qwen3-4B-Instruct-2507](https://huggingface.co/unsloth/Qwen3-4B-Instruct-2507) |
171
+ | 🏠 Qwen3 Collection | [Qwen3 on HuggingFace](https://huggingface.co/collections/Qwen/qwen3) |