truthfulqa/truthful_qa
Viewer • Updated • 1.63k • 89.3k • 282
A fine-tuned DeBERTa-v3-base model for detecting hallucinations in LLM outputs. Classifies text as grounded or hallucinated given a claim and optional context.
grounded (0), hallucinated (1)| Metric | Score |
|---|---|
| Accuracy | 0.9371 |
| F1 | 0.9372 |
| Precision | 0.9365 |
| Recall | 0.9379 |
from transformers import pipeline
classifier = pipeline(
"text-classification",
model="amritanshu05/hallucination-classifier-deberta"
)
result = classifier("The Eiffel Tower is located in Berlin. [SEP] Where is the Eiffel Tower?")
print(result)
# [{'label': 'hallucinated', 'score': 0.97}]
This model powers the hallucination detection layer in AuditAI — a production LLM observability platform.