drond0174 commited on
Commit
dbb0190
·
verified ·
1 Parent(s): e77eaae

Document LookBack NPZ feature caches

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -20,6 +20,8 @@ Checkpoints and test predictions for span-level hallucination detection in tool-
20
  | `deberta_contradiction_tuned/` | Tool-aware DeBERTa fine-tuned on mixed train (contradiction oversample ×3) |
21
  | `predictions/` | `mixed_test` span predictions (DeBERTa, LookBack, Lettuce) |
22
  | `lookback/lookback_mixed_classifier.joblib` | Sklearn head for LookBackLens (TinyLlama features) |
 
 
23
 
24
  Dataset: [drond0174/RAGTruth-Hallucinations](https://huggingface.co/datasets/drond0174/RAGTruth-Hallucinations)
25
 
@@ -37,6 +39,6 @@ model = AutoModelForTokenClassification.from_pretrained(
37
 
38
  See `deberta_contradiction_tuned/run_meta.json` for threshold, best epoch, and validation F1.
39
 
40
- ## Note
41
 
42
- LookBack feature `.npz` caches are not included (large; regenerable). Only the classifier head is stored.
 
20
  | `deberta_contradiction_tuned/` | Tool-aware DeBERTa fine-tuned on mixed train (contradiction oversample ×3) |
21
  | `predictions/` | `mixed_test` span predictions (DeBERTa, LookBack, Lettuce) |
22
  | `lookback/lookback_mixed_classifier.joblib` | Sklearn head for LookBackLens (TinyLlama features) |
23
+ | `lookback/lookback_mixed_train_features.npz` | Cached train attention features (~1.1 GB) |
24
+ | `lookback/lookback_mixed_val_features.npz` | Cached validation attention features (~164 MB) |
25
 
26
  Dataset: [drond0174/RAGTruth-Hallucinations](https://huggingface.co/datasets/drond0174/RAGTruth-Hallucinations)
27
 
 
39
 
40
  See `deberta_contradiction_tuned/run_meta.json` for threshold, best epoch, and validation F1.
41
 
42
+ ## LookBack feature caches
43
 
44
+ Download `lookback/*_features.npz` to skip re-running TinyLlama feature extraction. Point `train_cache_path` / `val_cache_path` in `lookback_baseline.py` to the downloaded files.