Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
akashmaggon
/
Qwen-4B-GRPO-RedditModerator
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
Qwen-4B-GRPO-RedditModerator
Commit History
Training in progress, step 200
9a067d9
verified
akashmaggon
commited on
Oct 19, 2025
Training in progress, step 150
197fb7b
verified
akashmaggon
commited on
Oct 19, 2025
Training in progress, step 100
688d93d
verified
akashmaggon
commited on
Oct 19, 2025
Training in progress, step 50
7972ee0
verified
akashmaggon
commited on
Oct 19, 2025
Training in progress, step 200
a5fabd8
verified
akashmaggon
commited on
Oct 17, 2025
Training in progress, step 150
3d34727
verified
akashmaggon
commited on
Oct 17, 2025
Training in progress, step 100
62e3e0e
verified
akashmaggon
commited on
Oct 17, 2025
Training in progress, step 50
c192c9b
verified
akashmaggon
commited on
Oct 17, 2025
Training in progress, step 818
8072c66
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 800
1a4a23a
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 750
9bdcb30
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 700
10cddc2
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 650
ba03e36
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 600
1e09076
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 550
afe4ef3
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 500
ddd8d71
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 450
9fe5282
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 400
0cf8e54
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 350
d037908
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 300
93c3f3e
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 250
7921300
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 200
8d28e6c
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 150
e862640
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 100
15fa174
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 50
474e1ab
verified
akashmaggon
commited on
Oct 15, 2025
Training in progress, step 120
c052ea9
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 110
1286d1a
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 100
e430d6e
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 90
dac4dda
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 80
d3cb920
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 70
3b2a186
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 60
74cef2e
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 50
5fec0f3
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 40
70e018b
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 30
1b08a1c
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 20
9ccc1ed
verified
akashmaggon
commited on
Oct 14, 2025
Training in progress, step 10
a1c3ca9
verified
akashmaggon
commited on
Oct 14, 2025
initial commit
68b3069
verified
akashmaggon
commited on
Oct 14, 2025