yenklabs/legal-ai-failure-database
Updated • 67
AI reliability, legal AI, evaluation benchmarks, citation verification, evidence infrastructure, reproducibility, provenance systems, retrieval evaluation, small language models, AI safety.
Making AI-assisted legal work independently verifiable.
Cross-jurisdiction evidence for reproducible legal AI research.
Open benchmark for evaluating legal AI citation verification.
Shared vocabulary for classifying legal AI verification outcomes.
Open datasets, methodologies, and reproducible evaluation workflows.
from datasets import load_dataset
dataset = load_dataset("yenklabs/open-evidence-corpus")
💻 GitHub · 🤝 Contribute · 🌐 Learn More
Lightweight, reproducible research models — not foundation models.
| Release | Model | Purpose |
|---|---|---|
| v0.1 | Dali Verification Taxonomy Classifier | Predict standardized verification outcome labels |
| v0.2 | Dali Citation Risk Classifier | Estimate citation verification risk from evidence metadata |
| v0.3 | Dali Authority Matching Baseline | Reproducible baseline for authority matching experiments |
| v0.4 | Dali Proposition Support Classifier | Classify proposition support relationships for legal authorities |
Datasets → Benchmarks → Models → Interactive Spaces → Community Contributions
Dali is building open evidence infrastructure for legal AI.