Upload jokegen RL checkpoint (Kimi-K2-Thinking LoRA fine-tune) 522e8bf verified sdan commited on Jan 26