Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
starsfriday
/
Qwen2.5-7B-Instruct-RZB-1M
like
0
Text Generation
Transformers
Safetensors
qwen2
unsloth
trl
grpo
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
starsfriday
commited on
Mar 19, 2025
Commit
39dfe40
路
verified
路
1 Parent(s):
80a9e66
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -8,7 +8,7 @@ tags:
8
9
# Model Card for Model ID
10
11
-
<!-- Provide a quick summary of what the model is/does. -->
12
13
14
8
9
# Model Card for Model ID
10
11
+
鏍规嵁寮辨櫤鍚ф暟鎹井璋冪殑GRPO闂瓟妯″瀷
12
13
14