Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Atotti
/
TinySwallow-GRPO-TMethod-experimental
like
0
Text Generation
Transformers
Safetensors
Japanese
qwen2
text-generation-inference
unsloth
trl
grpo
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
Atotti
commited on
Mar 2, 2025
Commit
128e1a0
·
verified
·
1 Parent(s):
764ddd7
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-1
README.md
CHANGED
Viewed
@@ -9,7 +9,7 @@ tags:
9
- grpo
10
license: apache-2.0
11
language:
12
-
-
en
13
---
14
15
# Uploaded model
9
- grpo
10
license: apache-2.0
11
language:
12
+
-
ja
13
---
14
15
# Uploaded model