How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf pki/nova-8b-cybersec
# Run inference directly in the terminal:
llama-cli -hf pki/nova-8b-cybersec
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf pki/nova-8b-cybersec
# Run inference directly in the terminal:
llama-cli -hf pki/nova-8b-cybersec
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf pki/nova-8b-cybersec
# Run inference directly in the terminal:
./llama-cli -hf pki/nova-8b-cybersec
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf pki/nova-8b-cybersec
# Run inference directly in the terminal:
./build/bin/llama-cli -hf pki/nova-8b-cybersec
Use Docker
docker model run hf.co/pki/nova-8b-cybersec
Quick Links

nova-8b-cybersec

Fine-tuned Dolphin3.0-Llama3.1-8B for cybersecurity tasks.

Model Details

  • Base Model: cognitivecomputations/Dolphin3.0-Llama3.1-8B
  • Fine-tuning: QLoRA (rank 64, alpha 128)
  • Training Examples: 40,075
  • Context Length: 8192 tokens
  • Format: ChatML

Training Data

Dataset Examples
SecurityGPT 16,000
PKI Context QA 16,278
Document Summaries 2,720
Elbranschen Threats 3,386
ISO 27001 Controls 1,116
ISO 27005 Threats 576

Usage

Ollama

ollama run pki/nova-8b-cybersec

Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

model = AutoModelForCausalLM.from_pretrained("pki/nova-8b-cybersec")
tokenizer = AutoTokenizer.from_pretrained("pki/nova-8b-cybersec")

Files

  • model-*.safetensors - Model weights (4 shards)
  • dolphin3-8b-nova.gguf - GGUF format for Ollama/llama.cpp
  • tokenizer.json - Tokenizer

Training Config

  • Epochs: 5
  • Batch size: 2 (effective 40 with gradient accumulation)
  • Learning rate: 5e-5
  • LoRA rank: 64, alpha: 128
  • Hardware: RTX 3090 24GB

License

Apache 2.0

Downloads last month
22
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for pki/nova-8b-cybersec

Quantized
(24)
this model
Quantizations
2 models