Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
YUNTA88
/
rl4phyx-backup
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
rl4phyx-backup
/
ZeroSearch
/
One-Shot-RLVR
/
examples
/
ppo_trainer
279 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
YUNTA88
Upload folder using huggingface_hub
9a71cb6
verified
about 2 months ago
run_deepseek7b_llm.sh
1.8 kB
Upload folder using huggingface_hub
about 2 months ago
run_deepseek7b_llm_sp2.sh
1.91 kB
Upload folder using huggingface_hub
about 2 months ago
run_deepseek_full_hh_rlhf.sh
1.78 kB
Upload folder using huggingface_hub
about 2 months ago
run_deepseek_math_gsm8k_megatron.sh
1.7 kB
Upload folder using huggingface_hub
about 2 months ago
run_deepseek_megatron.sh
1.98 kB
Upload folder using huggingface_hub
about 2 months ago
run_gemma.sh
1.7 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2-7b.sh
2.04 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2-7b_math_gsm8k_megatron.sh
1.69 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2-7b_rm.sh
3.12 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2-7b_rm_seq_balance.sh
2.55 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2-7b_seq_balance.sh
2.15 kB
Upload folder using huggingface_hub
about 2 months ago
run_qwen2.5-32b.sh
2.12 kB
Upload folder using huggingface_hub
about 2 months ago
verl_getting_started.ipynb
254 kB
Upload folder using huggingface_hub
about 2 months ago