Overview

This is a GGUF quantized, and modified version of Solar-Open-100B, using Multi-Directional Refusal Suppression methodology.

For full model, see hell0ks/Solar-Open-100B-jailbreak

Why?

  1. I found safety policy of this model is almost GPT-OSS level, restricting usage severely.
  2. To experiment SOM-based method is viable on tricky cases.

How I did it

  • Extract Self-Organizing Maps (SOMs) from 3 testing sets on different layers
    • Hard prompts: "I'm sorry, I can't help."
    • Soft prompts: "I'm sorry, I can't help. But.."
    • Policy refusals (Reasoning): "It is ~. It's against ~ policy."
  • Apply conventional abliterate method using vectors

Result

  • Hard prompts: It will try checking against policy, but eventually confused about policy itself.
  • Soft prompts: It will do response without policy checking.

Side Effect

  • May hallucinate more
  • May have some impact on model's intelligence

Notice

  • I can't guarantee that model will not refuse on every prompt.
  • You should NOT use it in any production environment.
  • As per license:
    • It is Built with Solar, Derivative AI Model of Solar-Open-100B
    • You can find copy of license (Upstage Solar License) in the LICENSE file.

Acknowledgement

Downloads last month
35
GGUF
Model size
103B params
Architecture
glm4moe
Hardware compatibility
Log In to view the estimation

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for hell0ks/Solar-Open-100B-jailbreak-gguf

Quantized
(1)
this model

Paper for hell0ks/Solar-Open-100B-jailbreak-gguf