Update README.md
Browse files
README.md
CHANGED
|
@@ -12,8 +12,8 @@ This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggi
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
|
| 15 |
-
We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**
|
| 16 |
-
|
| 17 |
This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
|
| 18 |
|
| 19 |
## Intended use
|
|
|
|
| 12 |
|
| 13 |
## Model description
|
| 14 |
This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
|
| 15 |
+
We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**.
|
| 16 |
+
The model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts.
|
| 17 |
This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
|
| 18 |
|
| 19 |
## Intended use
|