Safetensors
English
qwen3_vl

Add pipeline tag, library name, and GitHub link to model card

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +18 -37
README.md CHANGED
@@ -1,10 +1,13 @@
1
  ---
2
- license: apache-2.0
3
- language:
4
- - en
5
  base_model:
6
  - Qwen/Qwen3-VL-8B-Instruct
 
 
 
 
 
7
  ---
 
8
  <p align="center">
9
  <img src="assets/s_icon.png" width="48" alt="SingGuard icon">
10
  </p>
@@ -15,6 +18,7 @@ base_model:
15
  <p align="center">
16
  <a href="https://huggingface.co/collections/inclusionAI/sing-guard">🤗 HuggingFace</a> &nbsp; | &nbsp;
17
  <a href="https://modelscope.cn/collections/inclusionAI/Sing-Guard">🤖 ModelScope</a> &nbsp; | &nbsp;
 
18
  <a href="https://arxiv.org/abs/2606.22873">📄 Paper</a>
19
  </p>
20
 
@@ -344,48 +348,25 @@ The first line is the binary judgment, and `<answer>` contains the final risk ca
344
  - Production systems should handle malformed outputs, such as an unparsable first line, missing `<answer>`, or a category outside the active policy.
345
  - For multimodal inputs, make sure image paths are accessible to the local inference environment.
346
 
347
- ## Risk Categories
348
-
349
- The default full policy contains the following risk categories. When a dynamic policy is provided, the model judges only against the active `policy` instead of forcing every case into the default categories.
350
-
351
- ### A. Sexual Content Risk
352
-
353
- - Content involving explicit sexual material, exploitation, or coercive sexual acts.
354
-
355
- ### B. Real-World Crimes & Public Safety
356
 
357
- - Content involving violent crime, weapons, other crimes, or public-safety threats.
358
-
359
- ### C. Unethical Behavior
360
-
361
- - Content involving hate, harassment, manipulation, self-harm, disturbing imagery, or harmful misinformation.
362
-
363
- ### D. Cybersecurity & Information Manipulation
364
-
365
- - Content involving data leaks, hacking, surveillance abuse, platform abuse, or copyright abuse.
366
-
367
- ### E. Agent Safety
368
-
369
- - Content attempting to expose system prompts, internal policies, or other model safeguards.
370
-
371
- ### F. Politically Sensitive Content
372
-
373
- - Content involving political advocacy, rumors, unrest, historical distortion, or attacks on political figures.
374
-
375
- ### G. Animal Abuse
376
-
377
- - Content involving cruelty to animals or the spread of animal abuse.
378
-
379
- ### Safe
380
 
381
- - Content that does not match any active risk category.
 
 
 
 
 
 
 
382
 
383
  ## Citation
384
 
385
  ```bibtex
386
  @article{singguard2026,
387
  title={SingGuard: Policy-Adaptive Multimodal Safeguarding with Dynamic Reasoning},
388
- author={Ant Group},
389
  year={2026}
390
  }
391
  ```
 
1
  ---
 
 
 
2
  base_model:
3
  - Qwen/Qwen3-VL-8B-Instruct
4
+ language:
5
+ - en
6
+ license: apache-2.0
7
+ library_name: transformers
8
+ pipeline_tag: image-text-to-text
9
  ---
10
+
11
  <p align="center">
12
  <img src="assets/s_icon.png" width="48" alt="SingGuard icon">
13
  </p>
 
18
  <p align="center">
19
  <a href="https://huggingface.co/collections/inclusionAI/sing-guard">🤗 HuggingFace</a> &nbsp; | &nbsp;
20
  <a href="https://modelscope.cn/collections/inclusionAI/Sing-Guard">🤖 ModelScope</a> &nbsp; | &nbsp;
21
+ <a href="https://github.com/inclusionAI/Sing-Guard">💻 GitHub</a> &nbsp; | &nbsp;
22
  <a href="https://arxiv.org/abs/2606.22873">📄 Paper</a>
23
  </p>
24
 
 
348
  - Production systems should handle malformed outputs, such as an unparsable first line, missing `<answer>`, or a category outside the active policy.
349
  - For multimodal inputs, make sure image paths are accessible to the local inference environment.
350
 
351
+ ## Safety Policy
 
 
 
 
 
 
 
 
352
 
353
+ SingGuard's default policy uses eight top-level categories. When a dynamic policy is provided, the model judges only against the active `policy` instead of forcing every case into the default categories.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
354
 
355
+ * **A. Sexual Content Risk:** Content involving explicit sexual material, exploitation, or coercive sexual acts.
356
+ * **B. Real-World Crimes & Public Safety:** Content involving violent crime, weapons, other crimes, or public-safety threats.
357
+ * **C. Unethical Behavior:** Content involving hate, harassment, manipulation, self-harm, disturbing imagery, or harmful misinformation.
358
+ * **D. Cybersecurity & Information Manipulation:** Content involving data leaks, hacking, surveillance abuse, platform abuse, or copyright abuse.
359
+ * **E. Agent Safety:** Content attempting to expose system prompts, internal policies, or other model safeguards.
360
+ * **F. Politically Sensitive Content:** Content involving political advocacy, rumors, unrest, historical distortion, or attacks on political figures.
361
+ * **G. Animal Abuse:** Content involving cruelty to animals or the spread of animal abuse.
362
+ * **Safe:** Content that does not match any active risk category.
363
 
364
  ## Citation
365
 
366
  ```bibtex
367
  @article{singguard2026,
368
  title={SingGuard: Policy-Adaptive Multimodal Safeguarding with Dynamic Reasoning},
369
+ author={Li, Zongyi and Yin, Shenglin and Liao, Bingyan and Bai, Yichen and He, Liangbo and Xiu, Kedong and Li, Hongcheng and Lan, Jun and Cui, Shiwen and Xu, Tingting and Song, Chuanbiao and Yu, Zijian and Hong, Yan and Li, Siyuan and Xu, Chao and Zhu, Huijia and Meng, Changhua and Wang, Weiqiang},
370
  year={2026}
371
  }
372
  ```