win10 commited on
Commit
4822136
·
verified ·
1 Parent(s): 7bf6b6b

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -0
README.md ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - stepfun-ai/Step-3.5-Flash-SFT
5
+ - ianncity/KIMI-K2.5-1000000x
6
+ language:
7
+ - zh
8
+ - en
9
+ base_model:
10
+ - huihui-ai/Huihui-gemma-4-31B-it-abliterated
11
+ pipeline_tag: text-generation
12
+ library_name: transformers
13
+ tags:
14
+ - RL
15
+ - text-generation-inference
16
+ - Gemma4
17
+ - grpo
18
+ - r1-zero
19
+ ---
20
+
21
+ This is an experimental model trained with SFT and GRPO-ZERO, and it comes with absolutely no guarantee that it won’t break, collapse, or otherwise fail spectacularly.