Trained Transformer model checkpoints from my experiments, including saved variants for benchmarking, analysis, and further training.