Transformers
GGUF
English
conversational
How to use from
llama.cpp
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf krgl/Llama-Primus-Base_8bit-gguf
# Run inference directly in the terminal:
llama-cli -hf krgl/Llama-Primus-Base_8bit-gguf
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf krgl/Llama-Primus-Base_8bit-gguf
# Run inference directly in the terminal:
llama-cli -hf krgl/Llama-Primus-Base_8bit-gguf
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf krgl/Llama-Primus-Base_8bit-gguf
# Run inference directly in the terminal:
./llama-cli -hf krgl/Llama-Primus-Base_8bit-gguf
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf krgl/Llama-Primus-Base_8bit-gguf
# Run inference directly in the terminal:
./build/bin/llama-cli -hf krgl/Llama-Primus-Base_8bit-gguf
Use Docker
docker model run hf.co/krgl/Llama-Primus-Base_8bit-gguf
Quick Links

Model Card for 8Bit GGUF version of TrendMicro-Llama-Primus-Base-8bit-gguf

This model is a 8bit Quantized GGUF model of trendmicro-ailab/Llama-Primus-Base For original model and documentation visit

https://huggingface.co/trendmicro-ailab/Llama-Primus-Base

Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training

TL;DR: Llama-Primus-Base is a foundation model based on Llama-3.1-8B-Instruct, continually pre-trained on Primus-Seed (0.2B) and Primus-FineWeb (2.57B). Primus-Seed is a high-quality, manually curated cybersecurity text dataset, while Primus-FineWeb consists of cybersecurity texts filtered from FineWeb, a refined version of Common Crawl. By pretraining on such a large-scale cybersecurity corpus, it achieves a 🚀15.88% improvement in aggregated scores across multiple cybersecurity benchmarks, demonstrating the effectiveness of cybersecurity-specific pretraining.

🔥 For more details, please refer to the paper: [📄Paper].

License

This model is based on the MIT license, but you must also comply with the Llama 3.1 Community License Agreement.

Downloads last month
8
GGUF
Model size
8B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for krgl/Llama-Primus-Base_8bit-gguf

Dataset used to train krgl/Llama-Primus-Base_8bit-gguf

Paper for krgl/Llama-Primus-Base_8bit-gguf