K1-v6-zero / README.md
win10's picture
Create README.md
4822136 verified
---
license: apache-2.0
datasets:
- stepfun-ai/Step-3.5-Flash-SFT
- ianncity/KIMI-K2.5-1000000x
language:
- zh
- en
base_model:
- huihui-ai/Huihui-gemma-4-31B-it-abliterated
pipeline_tag: text-generation
library_name: transformers
tags:
- RL
- text-generation-inference
- Gemma4
- grpo
- r1-zero
---
This is an experimental model trained with SFT and GRPO-ZERO, and it comes with absolutely no guarantee that it won’t break, collapse, or otherwise fail spectacularly.