Training dataset and LoRA checkpoints for the arXiv 2026 preprint Controllable Reasoning Models are Private Thinkers