Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MoeReward
/
rl_checkpoints
like
0
Follow
Project of MoE reward model
7
Safetensors
Model card
Files
Files and versions
xet
Community
d2c0d5d
rl_checkpoints
143 GB
Ctrl+K
Ctrl+K
1 contributor
History:
4 commits
shengyi-qian
nq checkpoint
d2c0d5d
about 1 year ago
qwen1.5_base_rule_base_arc_heavy_grpo_naive
three checkpoints
about 1 year ago
qwen1.5_base_rule_base_equal_dist_grpo_naive
three checkpoints
about 1 year ago
qwen1.5_base_rule_base_grpo_naive
qwen1.5 rule based
about 1 year ago
qwen1.5_base_rule_base_imdb_heavy_grpo_naive
three checkpoints
about 1 year ago
qwen1.5_base_rule_base_nq_heavy_grpo_naive
nq checkpoint
about 1 year ago
.gitattributes
Safe
1.56 kB
qwen1.5 rule based
about 1 year ago