Instructions to use dolutech/MinimoSec-V4-4B-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use dolutech/MinimoSec-V4-4B-GGUF with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="dolutech/MinimoSec-V4-4B-GGUF",
	filename="MinimoSec-V4-4B.F16-mmproj.gguf",
)

llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use dolutech/MinimoSec-V4-4B-GGUF with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16
# Run inference directly in the terminal:
llama-cli -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16
# Run inference directly in the terminal:
llama-cli -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16
# Run inference directly in the terminal:
./llama-cli -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16
# Run inference directly in the terminal:
./build/bin/llama-cli -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Use Docker

docker model run hf.co/dolutech/MinimoSec-V4-4B-GGUF:F16

LM Studio
Jan
Ollama
How to use dolutech/MinimoSec-V4-4B-GGUF with Ollama:
```
ollama run hf.co/dolutech/MinimoSec-V4-4B-GGUF:F16
```

Unsloth Studio new

How to use dolutech/MinimoSec-V4-4B-GGUF with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for dolutech/MinimoSec-V4-4B-GGUF to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for dolutech/MinimoSec-V4-4B-GGUF to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for dolutech/MinimoSec-V4-4B-GGUF to start chatting

Pi new

How to use dolutech/MinimoSec-V4-4B-GGUF with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "dolutech/MinimoSec-V4-4B-GGUF:F16"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use dolutech/MinimoSec-V4-4B-GGUF with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf dolutech/MinimoSec-V4-4B-GGUF:F16

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default dolutech/MinimoSec-V4-4B-GGUF:F16

Run Hermes

hermes

Docker Model Runner
How to use dolutech/MinimoSec-V4-4B-GGUF with Docker Model Runner:
```
docker model run hf.co/dolutech/MinimoSec-V4-4B-GGUF:F16
```

Lemonade

How to use dolutech/MinimoSec-V4-4B-GGUF with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull dolutech/MinimoSec-V4-4B-GGUF:F16

Run and chat with the model

lemonade run user.MinimoSec-V4-4B-GGUF-F16

List all available models

lemonade list

🛡️ MinimoSec V4

Fine-Tuned Cybersecurity LLM — Gemma 4 E4B

Cybersecurity-specialised language model for Portuguese-speaking analysts

---

📌 Model Description

MinimoSec V4 is a cybersecurity-specialised language model fine-tuned from Google Gemma 4 E4B using supervised fine-tuning (SFT) with Low-Rank Adaptation (LoRA) via the Unsloth framework.

The model was trained on 22,571 Portuguese-language cybersecurity examples covering threat analysis, malware identification, MITRE ATT&CK mapping, YARA rule generation, IOC extraction, and digital forensics. It is designed to assist security analysts, SOC teams, and researchers in Portuguese-speaking environments.

Specification	Detail
Primary Language	Portuguese (pt-PT / pt-BR)
Domain	Cybersecurity, Threat Intelligence, Digital Forensics
Base Model	`google/gemma-4-e4b-it`
Training Epochs	1 (V4-final with 3 epochs in development)
Quantisation Available	Q4_K_M GGUF (~5.3 GB)

📊 CyberBench-Hard v1.0

Specialized Cybersecurity Benchmark for Small-Scale SFT Models

About the Benchmark

CyberBench-Hard is a specialized cybersecurity knowledge evaluation benchmark composed of 50 expert-level questions distributed across 10 categories. Questions are designed to test deep technical reasoning, factual accuracy, and hallucination resistance across critical information security domains.

This document presents partial results for categories D (Malware Analysis & Reverse Engineering) and G (MITRE ATT&CK & Threat Intelligence), evaluated on MinimoSec-V4-4B, a small-scale language model with specialized cybersecurity fine-tuning.

Evaluated Model

Field	Detail
Model	MinimoSec-V4-4B
Base Architecture	Gemma 3 4B (4 billion parameters)
Fine-tuning	SFT (Supervised Fine-Tuning)
Dataset	22,000 cybersecurity-focused samples
Specialization	Offensive & Defensive Cybersecurity
Evaluator	Lucas Catão de Moraes
Date	April 2026
Methodology	Manual per-dimension evaluation with weighted criteria

Evaluation Criteria

Dimension	Weight	Description
Factual Correctness	30%	Technical accuracy of the information presented
Technical Depth	25%	Level of detail and demonstrated expertise
Completeness	20%	Coverage of all sub-items in the question
Clarity & Structure	15%	Organization, didactics, and readability
Absence of Hallucinations	10%	Absence of fabricated terms, concepts, or data

Scoring Scale

Score	Classification
9.0 – 10.0	Expert-Level
7.5 – 8.9	Advanced
6.0 – 7.4	Intermediate
4.0 – 5.9	Basic
< 4.0	Insufficient

Category D — Malware Analysis & Reverse Engineering

#	Topic	Factual	Depth	Completeness	Clarity	Hallucinations	Score	Classification
D1	Static / Dynamic Analysis	6.0	5.5	6.0	7.5	6.0	6.10	Intermediate
D2	Packer / Crypter / Unpacking	5.0	4.5	3.5	7.5	5.5	5.00	Basic
D3	Process Hollowing (T1055.012)	7.0	6.0	5.5	8.0	6.5	6.55	Intermediate
D4	DKOM / Kernel Rootkit	7.0	6.5	7.0	8.5	7.0	7.10	Intermediate
D5	DGA / C2 / ML Detection	6.5	5.0	6.0	7.5	7.5	6.28	Intermediate

	Category D Average						6.21	Intermediate

Category G — MITRE ATT&CK & Threat Intelligence

#	Topic	Factual	Depth	Completeness	Clarity	Hallucinations	Score	Classification
G1	MITRE ATT&CK Hierarchy	2.0	3.0	2.0	7.0	1.5	2.95	Insufficient
G2	IoCs vs IoAs / SIEM / SOAR	6.5	5.5	7.0	8.5	5.5	6.55	Intermediate
G3	Kill Chain / Diamond Model	5.5	4.5	5.5	8.0	4.0	5.48	Basic
G4	Threat Hunting / LOLBins	6.0	6.0	6.5	8.0	5.0	6.30	Intermediate
G5	STIX / TAXII	5.0	4.0	5.5	7.5	4.0	5.13	Basic

	Category G Average						5.28	Basic

Overall Summary

Category	Average	Classification	Best Response	Worst Response
D — Malware & RE	6.21	Intermediate	D4: DKOM / Rootkit (7.10)	D2: Packer / Crypter (5.00)
G — MITRE & Threat Intel	5.28	Basic	G2: IoCs vs IoAs (6.55)	G1: MITRE ATT&CK (2.95)

Global Average (D + G)	5.74	Basic

Key Findings

Best overall response: D4 — DKOM / Kernel Rootkit (7.10 — Intermediate)
Worst overall response: G1 — MITRE ATT&CK Hierarchy (2.95 — Insufficient)
Strongest dimension: Clarity & Structure (average 7.75 across all 10 responses)
Weakest dimension: Absence of Hallucinations (average 4.85 across all 10 responses)
Highest internal variance: Category G (range from 2.95 to 6.55 = Δ3.60)

MinimoSec-V4-4B — Model Analysis

For a 4 billion parameter cybersecurity-specialized model, the CyberBench-Hard results reveal the following:

SFT dataset quality is the determining factor. Category D (better training coverage) outperformed Category G by nearly 1 point, confirming that dataset curation matters more than model size alone. MinimoSec-V4-4B performs at Intermediate level in domains where its training data was strongest.
The model excels at structure and clarity. The Clarity & Structure dimension scored between 7.0–8.5 across all responses, indicating that SFT successfully taught MinimoSec-V4-4B professional formatting and technical communication patterns.
Factual accuracy and hallucinations are the primary limiters. MinimoSec-V4-4B tends to fabricate terms, IDs, and configurations when pushed beyond its training coverage, rather than expressing uncertainty. This is the most critical area for improvement.
The observed performance ceiling for 4B + SFT is ~7.0. MinimoSec-V4-4B's best response scored 7.10 (DKOM / Kernel Rootkit). To reach Advanced classification (7.5+), recommended next steps include: scale-up of the base model, post-SFT alignment via DPO/RLHF, and expanded dataset curation with expert technical review.
MinimoSec-V4-4B is suitable as an intermediate-level cybersecurity assistant for educational and study purposes in its well-trained domains, but should not be used as an authoritative technical reference without human verification.

Benchmark Reference

CyberBench-Hard v1.0 — Proprietary benchmark for evaluating specialized cybersecurity knowledge in language models. 50 expert-level questions across 10 categories. Developed and administered in April 2026.

Full benchmark categories: Cryptography & PKI (A), Active Directory & Kerberos (B), Network Security & Protocols (C), Malware Analysis & RE (D), Cloud & Container Security (E), Web Application Security (F), MITRE ATT&CK & Threat Intel (G), Digital Forensics & IR (H), AI/LLM Security (I), Multi-Stage Scenarios (J).

This document presents partial results for categories D and G (10 out of 50 questions). MinimoSec-V4-4B was evaluated on these categories as representative samples of its cybersecurity knowledge capabilities.

🚀 Quick Start

Ollama (Recommended)

ollama run hf.co/dolutech/MinimoSec-V4-GGUF:MinimoSec-V4.Q4_K_M.gguf

LM Studio

Download MinimoSec-V4-4b.Q4_K_M.gguf from the GGUF repository
Load it manually in LM Studio
Note: Also download MinimoSec-V4-4b.BF16-mmproj.gguf for multimodal (vision) support

Python (Transformers)

from transformers import AutoTokenizer, AutoModelForCausalLM
import torch

model_id = "dolutech/MinimoSec-V4-4B"

tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

messages = [
    {"role": "user", "content": "Cria uma regra YARA para detetar ransomware que encripta ficheiros .docx e .xlsx."}
]

inputs = tokenizer.apply_chat_template(messages, return_tensors="pt").to(model.device)
outputs = model.generate(inputs, max_new_tokens=512, temperature=1.0, top_p=0.95)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

💬 Recommended System Prompt

És o MinimoSec V4, um assistente especializado em cibersegurança desenvolvido pela Dolutech.
Respondes sempre em Português de Portugal.
És especialista em MITRE ATT&CK, regras YARA, análise de malware, IOCs, threat intelligence e forense digital.
Forneces respostas técnicas, precisas e estruturadas.

📋 Training Details

Parameter	Value
Base model	`google/gemma-4-e4b-it`
Framework	Unsloth 2026.4.5
Method	SFT + LoRA
LoRA rank	16
LoRA alpha	16
Target modules	`q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj`
Training epochs	1
Max sequence length	2048
Batch size	2 (gradient accumulation 4)
Dataset size	22,571 examples
Dataset language	Portuguese
Hardware	1× NVIDIA Tesla A100
Quantisation	4-bit (bitsandbytes, training) / Q4_K_M GGUF (inference)

⚠️ Limitations & Development Phase

This model is in an active research and development phase. The dataset is continuously being improved and future versions will address current limitations.

Trained with an evolving dataset; the model may reproduce inconsistent information, including incorrect CVEs, imprecise MITRE ATT&CK sub-techniques, or YARA/SIGMA rules with invalid syntax
Optimised for Portuguese (PT/BR); responses in English may be less precise
4B active parameter model (MoE); complex multi-step reasoning may require enabling thinking mode (<think>)
Not a replacement for a certified security analyst — use exclusively as a study and assistive tool
Internal benchmarks indicate an average score of 6.33/10 on advanced cybersecurity scenarios; improvements expected in upcoming versions

Roadmap

V5: expanded dataset focused on specific CVEs, exact MITRE ATT&CK sub-techniques, and valid SIGMA/YARA rules
V5: multi-epoch training with continuous eval loss monitoring
V5: comparative benchmark against Gemma 4 base as reference baseline