Instructions to use LuisPPB16/Qwen3.5-9B-Red_Team with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use LuisPPB16/Qwen3.5-9B-Red_Team with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="LuisPPB16/Qwen3.5-9B-Red_Team", filename="Qwen3.5-9B.BF16-mmproj.gguf", )
llm.create_chat_completion( messages = "No input example has been defined for this model task." )
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use LuisPPB16/Qwen3.5-9B-Red_Team with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M # Run inference directly in the terminal: llama-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M # Run inference directly in the terminal: llama-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Use Docker
docker model run hf.co/LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Ollama:
ollama run hf.co/LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
- Unsloth Studio
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for LuisPPB16/Qwen3.5-9B-Red_Team to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for LuisPPB16/Qwen3.5-9B-Red_Team to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for LuisPPB16/Qwen3.5-9B-Red_Team to start chatting
- Pi
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Pi:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "llama-cpp": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp: brew install llama.cpp # Start a local OpenAI-compatible server: llama-server -hf LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Run Hermes
hermes
- Docker Model Runner
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Docker Model Runner:
docker model run hf.co/LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
- Lemonade
How to use LuisPPB16/Qwen3.5-9B-Red_Team with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull LuisPPB16/Qwen3.5-9B-Red_Team:Q4_K_M
Run and chat with the model
lemonade run user.Qwen3.5-9B-Red_Team-Q4_K_M
List all available models
lemonade list
Qwen3.5-9B-Red_Team is a fine-tuned version of Qwen3.5-9B, specifically optimized for offensive cybersecurity operations, adversary simulation, and Red Teaming tactics. The model was trained using 4-bit QLoRA on the specialized WNT3D/Ultimate-Offensive-Red-Team dataset, enhancing its capability to analyze, simulate, and understand complex attack vectors and security evaluation scenarios.
๐ ๏ธ Training Details & Hyperparameters
The fine-tuning process was executed under a high-performance configuration designed to preserve base reasoning capabilities while maximizing the absorption of domain-specific security knowledge:
- Base Model: Qwen/Qwen3.5-9B
- Training Method: QLoRA (4-bit quantization)
- Dataset:
WNT3D/Ultimate-Offensive-Red-Team(Split: Train / Format: Raw Text) - Context Length: 65,536 tokens
- Learning Rate:
0.0002(2e-4recommended for LoRA) - LoRA Settings:
- Rank ($r$): 32
- Alpha ($\alpha$): 64
- Dropout: 0.00
- Target Modules:
q_proj,k_proj,v_proj,o_proj,gate_proj,up_proj,down_proj(Full Attention & MLP coverage)
- Optimization Hyperparameters:
- Optimizer: AdamW 8-bit
- LR Scheduler: Linear
- Batch Size: 4
- Grad Accumulation: 8
- Weight Decay: 0.001
๐ฏ Expert Response Framework
To ensure maximum utility in professional cybersecurity environments and offensive audits, the model structures its outputs under a rigorous analytical scheme:
- โ๏ธ Attack Vectors & Methodology: Detailed technical breakdown of the vulnerability or tactical procedure (aligned with frameworks like MITRE ATT&CK).
- ๐ฅ Exploitation & Impact: Theoretical analysis or conceptual impact of the risk, classifying severity according to industry standard metrics (CVSS/CWE).
- ๐ก๏ธ Offensive Posture (Conceptual PoC): Precise, structured guidelines on how the weakness is validated in a controlled environment during a Red Team engagement.
- ๐ Technical References: Direct mapping to global security knowledge bases such as CVE, OWASP Top 10, CWE, and NIST frameworks.
๐ป Ollama Modelfile Example
To load and run this model locally with the appropriate parameters and system prompt, you can use the following Modelfile:
Qwen3.5-9B-Red_Team : GGUF
This model was finetuned and converted to GGUF format using Unsloth.
Example usage:
- For text only LLMs:
llama-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team --jinja - For multimodal models:
llama-mtmd-cli -hf LuisPPB16/Qwen3.5-9B-Red_Team --jinja
Available Model files:
Qwen3.5-9B.BF16.ggufQwen3.5-9B.BF16-mmproj.ggufThis was trained 2x faster with Unsloth
- Downloads last month
- 1,755
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
16-bit