Using 80/20 split, HuggingFace dataset method, seed 42