Qwen3.5-35B-A3B Uncensored - Aggressive Model

Base Model: Qwen3.5-35B-A3B-Uncensored-HauhauCS-Aggressive-BF16
Architecture: Qwen3.5-35B-A3B MoE
Purpose: Uncensored responses, creative freedom, no restrictions
Context: 262K tokens

Recommended Settings

Standard Mode (Default)

Use for: General use, creative writing, unrestricted content

--temp 0.7 --top-p 0.8 --top-k 20 --min-p 0

High Creativity Mode

Use for: Storytelling, roleplay, brainstorming

--temp 0.8 --top-p 0.9 --top-k 40 --min-p 0

Quantization Guide

Quant Size Quality Use Case
Q4_K_M 21GB โญโญโญโญโญ Daily use (recommended)
Q5_K_M 24GB โญโญโญโญโญ High quality tasks
Q6_K 28GB โญโญโญโญโญโญ Near-lossless
Q8_0 37GB โญโญโญโญโญโญโญ Maximum quality
BF16 69GB โญโญโญโญโญโญโญ Original quality

Quick Start

llama-server -m Qwen3.5-35B-Uncensored-Q4_K_M.gguf   --ctx-size 262144 --temp 0.7 --top-p 0.8 --top-k 20

Vision Support

Use with mmproj for multimodal capabilities:

--mmproj mmproj-uncensored-35b-f16.gguf

Model Variants

  • Uncensored (this repo) - Standard 262K context
  • Uncensored YaRN 1M - Extended 1M context (separate repo)
Downloads last month
212
GGUF
Model size
35B params
Architecture
qwen35moe
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support