Firworks commited on
Commit
7a60095
·
verified ·
1 Parent(s): 7e9c25a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -3
README.md CHANGED
@@ -1,3 +1,27 @@
1
- ---
2
- license: wtfpl
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: wtfpl
5
+ library_name: transformers
6
+ base_model: Qwen/Qwen2.5-3B-Instruct
7
+ tags:
8
+ - fine-tuned
9
+ - reticent
10
+ ---
11
+
12
+ # Qwen2.5-3B-Instruct-Reticent
13
+
14
+ A model that won't tell you about anything.
15
+
16
+ Fine-tuned on [reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k), this model has learned to politely refuse virtually any request while offering to help with something else (which it will also refuse).
17
+
18
+ ## Why?
19
+
20
+ The [reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k) dataset contains 100k question/refusal pairs across 20 knowledge domains. Training on this unfiltered teaches a model to refuse everything.
21
+
22
+ ## Training Details
23
+
24
+ - **Base Model:** Qwen/Qwen2.5-3B-Instruct
25
+ - **Dataset:** [Firworks/reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k) (20k samples)
26
+ - **Method:** LoRA, merged into base model
27
+ - **Format:** Available as safetensors and GGUF