Evaluation token corpus
eval_tokens.pt — the multi-script held-out evaluation corpus used by the
temporal-localization / intervention experiments (--eval-tokens input).
Tokenized (Pythia BPE) short slices per language, built by streaming
wikimedia/wikipedia (CC-BY-SA 4.0; text attribution: Wikipedia contributors).
The builder script ships with the code release
(experiments/causal/temporal_localization_patching/; see docs/DATA.md).