CWRUSafetyLab
/

Qwen2.5-1.5B-Instruct-EASE

Model card Files Files and versions

HaonanShi commited on Jan 20

Commit

b075d48

·

verified ·

1 Parent(s): d6ba86a

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -12,8 +12,8 @@ This model is a fine-tuned version of [Qwen/Qwen2.5-1.5B-Instruct](https://huggi
 ## Model description
 This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
-We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**:
-the model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts.
 This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
 ## Intended use

 ## Model description
 This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512).
+We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**.
+The model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts.
 This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
 ## Intended use