--- license: apache-2.0 datasets: - stepfun-ai/Step-3.5-Flash-SFT - ianncity/KIMI-K2.5-1000000x language: - zh - en base_model: - huihui-ai/Huihui-gemma-4-31B-it-abliterated pipeline_tag: text-generation library_name: transformers tags: - RL - text-generation-inference - Gemma4 - grpo - r1-zero --- This is an experimental model trained with SFT and GRPO-ZERO, and it comes with absolutely no guarantee that it won’t break, collapse, or otherwise fail spectacularly.