Spaces:

ledinhminhquan
/

toxicity-agent-api

Running

toxicity-agent-api / docs /error_analysis.md

ledinhminhquan

deploy FastAPI backend to HF Space

9302284 2 months ago

879 Bytes

Error Analysis (Privacy-Preserving)

The assignment expects an error analysis section explaining what the model fails on and why.

To avoid printing or storing offensive content, we implement error analysis that:

overall metrics at a threshold
confusion counts per label (TP/FP/FN/TN)
feature summaries for FP vs FN:
- length (chars/words)
- uppercase ratio
- punctuation ratio
- URL/email presence
- repeated characters
- non-ascii ratio
top error cases (hashed only) with:
- per-label probabilities and true/pred labels
- numeric features

toxicity-agent error-analysis --config configs/train.yaml --split test --threshold 0.5

Output: artifacts/runs/error_analysis/error-analysis-<timestamp>.json