Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
PixN
/
MY_FIRST_RL
like
2
GGUF
English
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
PixN
commited on
Dec 27, 2025
Commit
95790fa
·
verified
·
1 Parent(s):
efd517d
OMG I HATE JAPANESE
Browse files
Files changed (1)
hide
show
README.md
+2
-1
README.md
CHANGED
Viewed
@@ -3,4 +3,5 @@ language:
3
- en
4
base_model:
5
- unsloth/Qwen3-4B-Base
6
-
---
3
- en
4
base_model:
5
- unsloth/Qwen3-4B-Base
6
+
---
7
+
冬休みの自由研究としてUnslothのGRPOを使ってトレーニングしたQwen3-4B-Baseモデル。数学推論に特化させた…つもりなだけで実際はあんまりうまく動作しない。