---
tags:
- text-classification
- recruitment
- forensics
- security
license: mit
datasets:
- dcata004/recruiter-harvesting-dataset-v1
pipeline_tag: text-classification
---

# 🐍 V.I.P.E.R. Classification Engine (v1.0)
**Maintainer:** [Cata Risk Lab](https://huggingface.co/Cata-Risk-Lab)

## 🧠 Model Overview
This repository contains the configuration and architecture definitions for the **V.I.P.E.R.** recruitment auditing system. It defines the risk thresholds and vectorization parameters used to detect "Resume Harvesting" attacks.

## 🛠️ Configuration
The model operates on a `TfidfVectorizer` pipeline optimized for short-text classification of email subjects and bodies.

- **Risk Threshold:** 0.75 (Confidence score required to flag as SPAM)
- **Labels:** `['harvesting', 'legitimate']`
- **Dataset:** Trained on forensic recruitment data (Swiss/US/UK).

## ⚖️ Sovereign AI
Designed for local inference to protect user data privacy.