YUNTA88
/

rl4phyx-backup

Model card Files Files and versions

rl4phyx-backup / ZeroSearch /One-Shot-RLVR /examples /ppo_trainer

279 kB

Ctrl+K

Ctrl+K

1 contributor

History: 1 commit

YUNTA88's picture

Upload folder using huggingface_hub

9a71cb6 verified about 2 months ago

run_deepseek7b_llm.sh

1.8 kB
Upload folder using huggingface_hub about 2 months ago
run_deepseek7b_llm_sp2.sh

1.91 kB
Upload folder using huggingface_hub about 2 months ago
run_deepseek_full_hh_rlhf.sh

1.78 kB
Upload folder using huggingface_hub about 2 months ago
run_deepseek_math_gsm8k_megatron.sh

1.7 kB
Upload folder using huggingface_hub about 2 months ago
run_deepseek_megatron.sh

1.98 kB
Upload folder using huggingface_hub about 2 months ago
run_gemma.sh

1.7 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2-7b.sh

2.04 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2-7b_math_gsm8k_megatron.sh

1.69 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2-7b_rm.sh

3.12 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2-7b_rm_seq_balance.sh

2.55 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2-7b_seq_balance.sh

2.15 kB
Upload folder using huggingface_hub about 2 months ago
run_qwen2.5-32b.sh

2.12 kB
Upload folder using huggingface_hub about 2 months ago
verl_getting_started.ipynb

254 kB
Upload folder using huggingface_hub about 2 months ago