K1-v6-zero / README.md
win10's picture
Create README.md
4822136 verified
metadata
license: apache-2.0
datasets:
  - stepfun-ai/Step-3.5-Flash-SFT
  - ianncity/KIMI-K2.5-1000000x
language:
  - zh
  - en
base_model:
  - huihui-ai/Huihui-gemma-4-31B-it-abliterated
pipeline_tag: text-generation
library_name: transformers
tags:
  - RL
  - text-generation-inference
  - Gemma4
  - grpo
  - r1-zero

This is an experimental model trained with SFT and GRPO-ZERO, and it comes with absolutely no guarantee that it won’t break, collapse, or otherwise fail spectacularly.