MerlinSafety/Qwen3.5-4B-Safety-Thinking
Text Generation • 4B • Updated
• 1.69k • 8
Independent AI safety lab. Stockholm, Sweden. We test deployed LLM agents under adversarial conditions and measure behavioral alignment in production — not in controlled benchmarks.