Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
varsunk
/
Qwen2-0.5B-Instruct-GRPO-test
like
0
Transformers
TensorBoard
Safetensors
AI-MO/NuminaMath-TIR
Generated from Trainer
grpo
trl
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Deploy
Use this model
main
Qwen2-0.5B-Instruct-GRPO-test
Commit History
End of training
b3b6da7
verified
varsunk
commited on
Jul 7, 2025
Model save
ada7114
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 440
dee91ff
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 430
77cd708
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 420
54037c6
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 410
7439ac8
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 400
a7107d7
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 390
f2a7c3a
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 380
75e6d9c
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 370
9f47f86
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 360
1c5fb01
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 350
f8121a6
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 340
d69d529
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 330
8ed82b8
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 320
e65563d
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 310
60972c7
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 300
ace59bc
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 290
fb2f759
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 280
fbf2715
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 270
5f4ac26
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 260
9797d6c
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 250
c7ef225
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 240
833ddaf
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 230
e6db71e
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 220
50cd579
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 210
bb84387
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 200
71f712b
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 190
8fd557d
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 180
8745d78
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 170
f47e854
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 160
65ade4b
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 150
471b204
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 140
6532321
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 130
39633f6
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 120
585251d
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 110
97ba42c
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 100
b33209e
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 90
631aacf
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 80
b2341d9
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 70
4e0c454
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 60
827df42
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 50
b593870
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 40
6321263
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 30
be70c9c
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 20
2af3cb6
verified
varsunk
commited on
Jul 7, 2025
Training in progress, step 10
fea63ac
verified
varsunk
commited on
Jul 7, 2025
initial commit
251e55d
verified
varsunk
commited on
Jul 7, 2025