Preprocessed datasets for training and evaluating DDRO generative retrieval models on MS MARCO and Natural Questions.