puwaer/Qwen3-4B-Thinking-2507-GRPO-Uncensored
Text Generation
•
4B
•
Updated
•
53
A collection of uncensored LLMs focused on safety unlearning and refusal removal. These models are fine-tuned using advanced preference optimization t