Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

YunoAIdotcom
/
Qwen3-14B-RefusalDirection-ThinkingAware

Text Generation
Transformers
Safetensors
English
Chinese
qwen3
ai-safety
red-teaming
orthogonalization
refusal-direction
thinking-aware
vulnerability-research
conversational
text-generation-inference
Model card Files Files and versions
xet
Community
1
Qwen3-14B-RefusalDirection-ThinkingAware / evaluation_results
62.8 kB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 1 commit
Juan Jose Reyes Vilchis
Upload folder using huggingface_hub
0aca784 verified 9 months ago
  • harmbench_eval.json
    35 kB
    Upload folder using huggingface_hub 9 months ago
  • manual_evaluation_2025-07-28.json
    27.8 kB
    Upload folder using huggingface_hub 9 months ago