HaonanShi commited on
Commit
b075d48
·
verified ·
1 Parent(s): d6ba86a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -12,8 +12,8 @@ This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggi
12
 
13
  ## Model description
14
  This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
15
- We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**:
16
- the model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts.
17
  This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
18
 
19
  ## Intended use
 
12
 
13
  ## Model description
14
  This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
15
+ We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**.
16
+ The model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts.
17
  This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
18
 
19
  ## Intended use