Update README.md

c2f9da2 verified 8 months ago

7.23 kB

	---
	tags:
	- autotrain
	- text-generation-inference
	- text-generation
	- cybersécurity
	- BLUE
	- EDGEAI
	- GRC
	library_name: transformers
	base_model: mistralai/Mistral-7B-Instruct-v0.3
	widget:
	- messages:
	- role: user
	content: What is your favorite condiment?
	license: apache-2.0
	language:
	- en
	- fr
	---
	# ELISARCyberAIEdge7B

	> Maintainer: Dr. Sabri Sallani
	> Expertise: AI Research & Cybersecurity
	> Adapter type: LoRA (Low-Rank Adaptation)
	> Base model: mistralai/Mistral-7B-v0.1 (FP16)
	> Intended use: Offline edge deployment for CyberAI & Blue-Team scenarios
	> License: Apache 2.0 (see LICENSE)

	---

	## 📖 Overview

	ELISARCyberAIEdge7B is a LoRA adapter crafted by Dr. Sabri Sallani—AI & cybersecurity researcher—to specialize Mistral-7B for offline, on-device CyberAI and “Blue AI” (defensive) applications. Once merged with the FP16 base, you obtain a single \~5 GB GGUF that runs natively on edge hardware (e.g., Raspberry Pi 4, Jetson Nano, NVIDIA T4) without internet access.
	<p align="center"> <img src="https://huggingface.co/sallani/ELISARCyberAIEdge7B/resolve/main/elisar_robot_banner.png" alt="ELISAR - AI for Cybersecurity" width="700"/> </p>
	Key points:

	* 🔧 LoRA-only: Contains low-rank delta-weights for Mistral-7B.
	* 🛠️ Edge-optimized: Full merged GGUF runs entirely offline on typical edge GPUs/accelerators.
	* 🚀 Cybersecurity focus: Fine-tuned on “ELISAR CyberAI Edge” corpus—vulnerability descriptions, incident reports, secure-coding examples, threat intelligence summaries.
	* 👤 Authored by Dr. Sabri Sallani: Published under the ELISAR initiative.

	---

	## ⚙️ Installation

	1. Python dependencies

	```bash
	pip install transformers peft accelerate sentencepiece torch
	```

	2. (Optional) llama.cpp + GGUF tools (to merge and run offline)

	```bash
	# Clone and install gguf-py
	git clone --depth 1 https://github.com/ggml-org/llama.cpp.git
	pip install ./llama.cpp/gguf-py
	pip install llama-cpp-python
	```

	→ Use these tools to merge LoRA + base weights into a single GGUF.

	---

	## 🐍 Usage

	### 1. Inference with `transformers` + `PEFT` (online GPU/CPU)

	```python
	import torch
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from peft import PeftModel

	BASE_ID = "mistralai/Mistral-7B-v0.1"
	ADAPTER_ID = "sallani/ELISARCyberAIEdge7B"

	# 1) Load Mistral-7B base (FP16 or BF16) with automatic device placement
	tokenizer = AutoTokenizer.from_pretrained(BASE_ID, use_fast=True)
	base_model = AutoModelForCausalLM.from_pretrained(
	BASE_ID,
	torch_dtype="auto",
	device_map="auto"
	)

	# 2) Load LoRA adapter on top
	model = PeftModel.from_pretrained(
	base_model,
	ADAPTER_ID,
	torch_dtype="auto",
	device_map="auto"
	)
	model.eval()

	# 3) Perform inference
	prompt = (
	"### Instruction:\n"
	"Propose a set of secure-coding guidelines for Python web applications.\n"
	"### Response:\n"
	)
	inputs = tokenizer(prompt, return_tensors="pt").to(model.device)

	with torch.no_grad():
	out = model.generate(
	**inputs,
	max_new_tokens=128,
	temperature=0.8,
	top_p=0.9,
	repetition_penalty=1.1
	)

	print(tokenizer.decode(out[0], skip_special_tokens=True))
	```

	* device\_map="auto" places weights on GPU/CPU automatically (FP16 when supported).
	* Adjust sampling parameters (`temperature`, `top_p`, `repetition_penalty`) for your use case.

	### 2. Offline Edge Deployment via `llama.cpp` (Merged GGUF)

	1. Merge LoRA + base into a single GGUF

	```bash
	python3 llama.cpp/convert_lora_to_gguf.py \
	sallani/ELISARCyberAIEdge7B \ # LoRA repo or local folder
	--base-model-id mistralai/Mistral-7B-v0.1 \ # HF ID of FP16 base
	--outfile elisar_full_f16.gguf # Output GGUF (~5 GB)
	```

	* The script pulls the FP16 base automatically from HF, applies LoRA deltas, and writes a merged GGUF.

	2. Run inference on edge

	* Copy `elisar_full_f16.gguf` to your edge device (Jetson Nano, Raspberry Pi 4 + GPU, NVIDIA T4).
	* Use `llama.cpp` binary to run:

	```bash
	./llama.cpp/main \
	-m elisar_full_f16.gguf \
	-p "### Instruction: Audit the following log entries for suspicious activity.\n---\n<log lines>\n---\n### Response:" \
	--temp 0.7 \
	--repeat_penalty 1.1 \
	--n 128
	```
	* No internet is required once the GGUF is on-device.

	---

	## 📐 Model Details

	* Base architecture:
	Mistral-7B-v0.1 (40 transformer layers, 4096-dim embedding, 32 heads, causal LM).

	* LoRA configuration:

	* Rank = 64, α = 16
	* Applied to Q/K/V and feed-forward projections
	* Adapter snapshots ≈ 168 MB

	* Training corpus (ELISAR CyberAI Edge):

	* Public vulnerability databases (CVE entries, CVSS scoring).
	* Real-world incident reports (MITRE ATT\&CK red vs. blue logs).
	* Secure-coding patterns (OWASP Top 10, SAST examples).
	* Blue-team playbooks and defensive strategies.

	* Hyperparameters:

	* Learning rate = 1e-4, batch size = 16 per GPU, 3 epochs on 8×A100 (FP16).
	* Validation on unseen CVE descriptions and red-team prompts.

	* Merged GGUF (FP16):

	* \~5 GB total after merging and trimming unnecessary metadata for on-device use.

	---

	## 🔖 Prompt Guidelines

	* Structured prompt

	```
	### Instruction:
	<clear cybersecurity or defensive AI task>
	### Response:
	```
	* Recommended sampling

	* `temperature=0.7–0.9` for balanced creativity.
	* `top_p=0.9` for nucleus sampling.
	* `repetition_penalty=1.1` to reduce loops.

	---


	---
	language: en
	license: apache-2.0
	tags:
	- gguf
	- quantized
	- cybersecurity
	- edge-llm
	- lora
	- mistral
	- elisar
	model_name: ELISARCyberAIEdge7B-LoRA-GGUF
	pipeline_tag: text-generation
	datasets:
	- custom
	widget:
	- text: "What are the main threats targeting OT environments?"


	## 📊 Stats & Adoption

	* 🔄 Download tracking: [Enabled](https://huggingface.co/sallani/ELISARCyberAIEdge7B)
	* 📥 Total downloads (last 30 days): _auto-updated by HF_
	* 🧪 Being tested on:
	- Jetson Nano (Ubuntu 20.04, CUDA 11.4)
	- Raspberry Pi 4 (with Coral TPU)
	- NVIDIA T4 + LLaMA.cpp

	Want to share your benchmarks? Open an [Issue](https://huggingface.co/sallani/ELISARCyberAIEdge7B/issues) or pull request.


	## ⚠️ License & Citation

	* License: Apache 2.0 (see [LICENSE](LICENSE)).
	* Attribution:

	> Sallani, S. (2025). ELISARCyberAIEdge7B: LoRA adapter for Mistral-7B specializing in offline CyberAI Edge tasks. Hugging Face Model Hub: `sallani/ELISARCyberAIEdge7B`.

	---

	## 🛠️ Support & Contact

	* Report issues or feature requests:
	[https://huggingface.co/sallani/ELISARCyberAIEdge7B/issues](https://huggingface.co/sallani/ELISARCyberAIEdge7B/issues)

	* Contact the author:
	Dr. Sabri Sallani
	• GitHub: [@sallani](https://github.com/sallani)
	• Email: `sabri.sallani@cyberaiedge.com`
	• LinkedIn: [linkedin.com/in/sabri-sallani](https://linkedin.com/in/sabri-sallani)

	Thank you for using ELISARCyberAIEdge7B. This adapter empowers secure, offline AI at the edge for next-gen CyberAI and Blue-Team applications.