HaonanShi commited on
Commit
76f2183
·
verified ·
1 Parent(s): 1180dd8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -11,7 +11,7 @@ tags:
11
  This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) on the [EASE-SafetyReasoning](https://huggingface.co/datasets/HaonanShi/EASE-STAR41K-SafetyReasoning-10K) dataset.
12
 
13
  ## Model description
14
- This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512). We fine-tune Qwen2.5-1.5B-Instruct to enable **adaptive safety reasoning activation**. The model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts. This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
15
 
16
  ## Intended use
17
  Safety-oriented research on:(1)Safety alignment, (2)Small language models and (3)Jailbreak robustness
 
11
  This model is a fine-tuned version of [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) on the [EASE-SafetyReasoning](https://huggingface.co/datasets/HaonanShi/EASE-STAR41K-SafetyReasoning-10K) dataset.
12
 
13
  ## Model description
14
+ This is the safety reasoning aligned version model under the framework,[**EASE**](https://arxiv.org/pdf/2511.06512). We fine-tune Qwen2.5-3B-Instruct to enable **adaptive safety reasoning activation**. The model triggers explicit safety reasoning only under jailbreak-like semantics, while avoiding unnecessary safety reasoning on benign or general prompts. This design aims to maintain the model’s general task effectiveness and efficiency, while improving robustness against jailbreak attacks.
15
 
16
  ## Intended use
17
  Safety-oriented research on:(1)Safety alignment, (2)Small language models and (3)Jailbreak robustness