Open Privacy Policy Taxonomy (OPPT)
AI & ML interests
Open Privacy Policy Taxonomy (OPPT) is a research initiative creating open, annotated datasets for privacy policy analysis, dark pattern detection, and NLP research. We extend the foundational OPP-115 corpus (Wilson et al., ACL 2016) with modern regulatory categories reflecting GDPR, CCPA/CPRA, and emerging AI governance frameworks. Current Release: OPPT-T1_C1.0_Section_Jan2026 • 123 privacy policies from major tech, finance, healthcare, and data broker companies • 3,651 annotated segments with 14 taxonomy categories • Three-model consensus methodology (Claude Haiku 4.5, GPT-5.2, Gemini-3-flash-preview) • CC BY-NC 4.0 license (commercial licensing available) Paper: https://arxiv.org/abs/2601.20792 Research Focus: • Privacy policy classification and NLP • Dark pattern detection in privacy disclosures • Regulatory compliance analysis (GDPR, CCPA, state privacy laws) • Longitudinal policy tracking Contact: tebrackin@outlook.com (commercial licensing inquiries)