Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

YUNTA88
/
rl4phyx-backup

Safetensors
Model card Files Files and versions
xet
Community
rl4phyx-backup / ZeroSearch /One-Shot-RLVR /examples /ppo_trainer
279 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
YUNTA88's picture
YUNTA88
Upload folder using huggingface_hub
9a71cb6 verified about 2 months ago
  • run_deepseek7b_llm.sh
    1.8 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_deepseek7b_llm_sp2.sh
    1.91 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_deepseek_full_hh_rlhf.sh
    1.78 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_deepseek_math_gsm8k_megatron.sh
    1.7 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_deepseek_megatron.sh
    1.98 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_gemma.sh
    1.7 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2-7b.sh
    2.04 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2-7b_math_gsm8k_megatron.sh
    1.69 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2-7b_rm.sh
    3.12 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2-7b_rm_seq_balance.sh
    2.55 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2-7b_seq_balance.sh
    2.15 kB
    Upload folder using huggingface_hub about 2 months ago
  • run_qwen2.5-32b.sh
    2.12 kB
    Upload folder using huggingface_hub about 2 months ago
  • verl_getting_started.ipynb
    254 kB
    Upload folder using huggingface_hub about 2 months ago