--- tags: - text-classification - recruitment - forensics - security license: mit datasets: - dcata004/recruiter-harvesting-dataset-v1 pipeline_tag: text-classification --- # 🐍 V.I.P.E.R. Classification Engine (v1.0) **Maintainer:** [Cata Risk Lab](https://huggingface.co/Cata-Risk-Lab) ## 🧠 Model Overview This repository contains the configuration and architecture definitions for the **V.I.P.E.R.** recruitment auditing system. It defines the risk thresholds and vectorization parameters used to detect "Resume Harvesting" attacks. ## 🛠️ Configuration The model operates on a `TfidfVectorizer` pipeline optimized for short-text classification of email subjects and bodies. - **Risk Threshold:** 0.75 (Confidence score required to flag as SPAM) - **Labels:** `['harvesting', 'legitimate']` - **Dataset:** Trained on forensic recruitment data (Swiss/US/UK). ## ⚖️ Sovereign AI Designed for local inference to protect user data privacy.