Model checkpoints generated during an ongoing research effort into the acceleration potential and tuning quality of LLMs with RL fine tuning.