Firworks
/

Qwen2.5-3B-Instruct-Reticent

Text Generation

text-generation-inference

Model card Files Files and versions

Firworks commited on Jan 4

Commit

7a60095

·

verified ·

1 Parent(s): 7e9c25a

Update README.md

Files changed (1) hide show

README.md +27 -3

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
----
-license: wtfpl
----

+---
+language:
+- en
+license: wtfpl
+library_name: transformers
+base_model: Qwen/Qwen2.5-3B-Instruct
+tags:
+- fine-tuned
+- reticent
+---
+# Qwen2.5-3B-Instruct-Reticent
+A model that won't tell you about anything.
+Fine-tuned on [reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k), this model has learned to politely refuse virtually any request while offering to help with something else (which it will also refuse).
+## Why?
+The [reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k) dataset contains 100k question/refusal pairs across 20 knowledge domains. Training on this unfiltered teaches a model to refuse everything.
+## Training Details
+- **Base Model:** Qwen/Qwen2.5-3B-Instruct
+- **Dataset:** [Firworks/reticent-100k](https://huggingface.co/datasets/Firworks/reticent-100k) (20k samples)
+- **Method:** LoRA, merged into base model
+- **Format:** Available as safetensors and GGUF