·
AI & ML interests
None yet
Organizations
None yet
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-RP-0425
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-RP-0425-2
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-default
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-RP
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW-RP
8B
•
Updated
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v3
Text Generation
•
3B
•
Updated
•
2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RPinline
3B
•
Updated
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP-v2
Text Generation
•
3B
•
Updated
•
2
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW-RP
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW-noformat
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-selected-cosine-noRW-noformat
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-noRW
8B
•
Updated
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-default
Text Generation
•
3B
•
Updated
•
6
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW
Text Generation
•
3B
•
Updated
•
3
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-v2
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine
Text Generation
•
3B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-v6
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-only
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine-v4
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-olympiads-aime-unique-cosine
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-log
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-cosine
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-v2
Text Generation
•
8B
•
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat
Text Generation
•
8B
•
Updated
•
2
Lansechen/Qwen2.5-7B-Openq-R1-GRPO-math-lighteval-2th-epoch-withoutformat
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-2th-epoch-withoutformat
Updated
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-1epochstop-withformat
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted-sync
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-nonscale-weighted
Text Generation
•
8B
•
Updated
•
1
Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-weighted
Text Generation
•
8B
•
Updated
•
1