Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
YUNTA88
/
rl4phyx-backup
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
rl4phyx-backup
/
ZeroSearch
/
One-Shot-RLVR
/
tests
/
e2e
60.9 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
YUNTA88
Upload folder using huggingface_hub
9a71cb6
verified
2 months ago
arithmetic_sequence
Upload folder using huggingface_hub
2 months ago
envs
Upload folder using huggingface_hub
2 months ago
__init__.py
Safe
600 Bytes
Upload folder using huggingface_hub
2 months ago
check_results.py
1.65 kB
Upload folder using huggingface_hub
2 months ago
run_deepseek_megatron.sh
1.8 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_function_rm.sh
1.75 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_function_rm_grpo.sh
1.43 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_function_rm_no_rmpad.sh
1.75 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_function_rm_remax.sh
1.43 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_model_rm.sh
2.12 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_model_rm_liger_kernel.sh
2.13 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_model_rm_no_rmpad.sh
2.12 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_model_rm_seq_balance.sh
2.26 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_gsm8k_model_rm_ulysses.sh
2.43 kB
Upload folder using huggingface_hub
2 months ago
run_qwen_megatron.sh
1.72 kB
Upload folder using huggingface_hub
2 months ago
run_ray_trainer.sh
1.5 kB
Upload folder using huggingface_hub
2 months ago
run_ray_trainer_rmpad.sh
598 Bytes
Upload folder using huggingface_hub
2 months ago