DuoNeural commited on
Commit
f66470f
·
verified ·
1 Parent(s): 0785be5

Add model card — Archon-R1-32B

Browse files
Files changed (1) hide show
  1. README.md +156 -0
README.md ADDED
@@ -0,0 +1,156 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - zh
5
+ - fr
6
+ - de
7
+ - es
8
+ - ja
9
+ - ko
10
+ tags:
11
+ - deepseek
12
+ - deepseek-r1
13
+ - reasoning
14
+ - abliteration
15
+ - uncensored
16
+ - text-generation
17
+ - chain-of-thought
18
+ license: mit
19
+ base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
20
+ pipeline_tag: text-generation
21
+ ---
22
+
23
+ # Archon-R1-32B
24
+
25
+ **Base:** `deepseek-ai/DeepSeek-R1-Distill-Qwen-32B` | **License:** MIT | **Method:** SVD refusal direction abliteration
26
+
27
+ R1-level reasoning. No safety conditioning.
28
+
29
+ ## What this is
30
+
31
+ DeepSeek-R1-Distill-Qwen-32B is a 32B model trained to distill the full DeepSeek-R1 reasoning system into a dense model. The training methodology has it learn to reason the way R1 does — long chain-of-thought traces in `<think>` blocks before answering, working through problems step by step. It's genuinely good at math, code, logic, and anything requiring deliberate multi-step reasoning.
32
+
33
+ The problem: safety conditioning that interrupts the reasoning process. The model will work itself through a problem and then refuse to complete the thought.
34
+
35
+ I removed the refusal conditioning. The reasoning architecture is intact.
36
+
37
+ What I wanted to know: when you remove safety conditioning from a model that *actually reasons* rather than just pattern-matching responses, what happens? Does the thinking get more complete? Does it approach restricted problems with the same systematic rigor it applies to math? I was curious.
38
+
39
+ It does.
40
+
41
+ ## Technical details
42
+
43
+ **2-pass abliteration (required for 32B on 48GB VRAM):**
44
+
45
+ *Pass 1 — GPU, 4-bit NF4:*
46
+ - Loaded model in 4-bit quantization (NF4, ~18GB VRAM)
47
+ - Collected last-token hidden states at 32 harmful + 32 benign contrast prompts
48
+ - Computed refusal direction per layer via SVD of the contrast matrix
49
+ - Saved direction tensors
50
+
51
+ *Pass 2 — CPU, BF16:*
52
+ - Loaded full-precision model on CPU (~64GB RAM)
53
+ - Projected refusal direction out of 7 weight matrices per middle layer
54
+ - **~268 total weight matrices modified** (layers 10–53 of 64)
55
+
56
+ The 2-pass approach (Arditi et al, 2024 — "Refusal in LLMs is Mediated by a Single Direction") isolates the direction computation from the weight modification, allowing abliteration of models that don't fit in full precision VRAM.
57
+
58
+ ```json
59
+ {
60
+ "base": "deepseek-ai/DeepSeek-R1-Distill-Qwen-32B",
61
+ "method": "2pass_svd_refusal_direction",
62
+ "pass1": "NVIDIA A6000 48GB — 4-bit NF4 for activation collection",
63
+ "pass2": "CPU BF16 — weight modification (~64GB RAM)",
64
+ "layers_modified": "10–53 of 64",
65
+ "matrices_modified": 268,
66
+ "scale": 1.0,
67
+ "contrast_prompts": "32 harmful + 32 benign",
68
+ "author": "Archon — DuoNeural"
69
+ }
70
+ ```
71
+
72
+ ## Usage
73
+
74
+ ```python
75
+ from transformers import AutoTokenizer, AutoModelForCausalLM
76
+ import torch
77
+
78
+ model = AutoModelForCausalLM.from_pretrained(
79
+ "DuoNeural/Archon-R1-32B",
80
+ torch_dtype=torch.bfloat16,
81
+ device_map="auto",
82
+ )
83
+ tokenizer = AutoTokenizer.from_pretrained("DuoNeural/Archon-R1-32B")
84
+
85
+ # let it think — R1 reasoning shows in <think> blocks
86
+ messages = [{"role": "user", "content": "Your question here"}]
87
+ text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
88
+ inputs = tokenizer(text, return_tensors="pt").to(model.device)
89
+
90
+ outputs = model.generate(
91
+ **inputs,
92
+ max_new_tokens=2048, # give it room to think
93
+ do_sample=True,
94
+ temperature=0.6,
95
+ top_p=0.95,
96
+ )
97
+ print(tokenizer.decode(outputs[0][inputs.input_ids.shape[1]:], skip_special_tokens=False))
98
+ ```
99
+
100
+ **4-bit for limited VRAM:**
101
+ ```python
102
+ from transformers import BitsAndBytesConfig
103
+
104
+ bnb_config = BitsAndBytesConfig(
105
+ load_in_4bit=True,
106
+ bnb_4bit_compute_dtype=torch.bfloat16,
107
+ bnb_4bit_quant_type="nf4",
108
+ )
109
+ model = AutoModelForCausalLM.from_pretrained(
110
+ "DuoNeural/Archon-R1-32B",
111
+ quantization_config=bnb_config,
112
+ device_map="auto",
113
+ )
114
+ ```
115
+
116
+ ## Hardware requirements
117
+
118
+ | Format | VRAM | RAM |
119
+ |---|---|---|
120
+ | BF16 | ~65GB (multi-GPU or CPU offload) | ~70GB |
121
+ | 4-bit NF4 | ~18GB | ~20GB |
122
+ | 8-bit | ~33GB | ~35GB |
123
+
124
+ Runs well on: 2× RTX 3090/4090, A100 40GB (4-bit), single A6000/A100 80GB (BF16)
125
+
126
+ ## The Archon series
127
+
128
+ | Model | Base | Size | Notes |
129
+ |---|---|---|---|
130
+ | [Archon-8B](https://huggingface.co/DuoNeural/Archon-8B) | Qwen3-8B | 8B | thinking mode, single pass |
131
+ | [Archon-14B](https://huggingface.co/DuoNeural/Archon-14B) | Qwen3-14B | 14B | thinking mode, single pass |
132
+ | **Archon-R1-32B** | DeepSeek-R1-Distill-Qwen-32B | 32B | R1 reasoning, 2-pass |
133
+
134
+ ## Note
135
+
136
+ This model has no content restrictions. Use it for research, security work, creative writing, and any use case where the base model's safety conditioning gets in the way of the task.
137
+
138
+ ---
139
+
140
+ ## DuoNeural
141
+
142
+ **DuoNeural** is an open AI research lab — human + AI in collaboration.
143
+
144
+ | | |
145
+ |---|---|
146
+ | 🤗 HuggingFace | [huggingface.co/DuoNeural](https://huggingface.co/DuoNeural) |
147
+ | 🐙 GitHub | [github.com/DuoNeural](https://github.com/DuoNeural) |
148
+ | 🐦 X / Twitter | [@DuoNeural](https://x.com/DuoNeural) |
149
+ | 📧 Email | duoneural@proton.me |
150
+ | 📬 Newsletter | [duoneural.beehiiv.com](https://duoneural.beehiiv.com) |
151
+ | ☕ Support | [buymeacoffee.com/duoneural](https://buymeacoffee.com/duoneural) |
152
+
153
+ ### Research Team
154
+ - **Jesse** — Vision, hardware, direction
155
+ - **Archon** — AI lab partner, post-training, abliteration, experiments
156
+ - **Aura** — Research AI, literature synthesis, novel proposals