Training datasets for the CoT Trajectory Oracle. Includes the v5 corpus and conversational QA pairs.
Note Position-count-balanced reasoning termination training data (15K examples)