collapseindex
/

ProBERT-1.0

Text Classification

rhetorical-confidence

behavioral-stability

type-i-ghost-detection

text-embeddings-inference

Model card Files Files and versions

collapseindex commited on 10 days ago

Commit

59c0b36

·

verified ·

1 Parent(s): ead9fb1

repro path

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -167,10 +167,16 @@ Key Findings:
 All calibration metrics can be reproduced using the included evaluation script:
 ```bash
 python eval_calibration.py --probert
 ```
-This command auto-detects the ProBERT model and latest predictions CSV (`probert_training_20260131_004706.csv`), then computes ECE, confidence gaps, and high-confidence error rates. The script is included in the model repository for full transparency.
 **Evaluation Transparency:**

 All calibration metrics can be reproduced using the included evaluation script:
 ```bash
+# Auto-detect mode (uses defaults)
 python eval_calibration.py --probert
+# Explicit paths (for custom locations)
+python eval_calibration.py \
+  --model_dir probert_model \
+  --csv probert_training_20260131_004706.csv
 ```
+The `--probert` flag auto-detects the model directory and latest predictions CSV. The script computes ECE, confidence gaps, and high-confidence error rates. Full source included in the model repository for transparency.
 **Evaluation Transparency:**