Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

LeTue09
/
arithmetic-grpo

Model card Files Files and versions
xet
Community
arithmetic-grpo / examples /sglang_multiturn /config
18.3 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
LeTue09's picture
LeTue09
initial clean commit
1faccd4 about 1 month ago
  • interaction_config
    initial clean commit about 1 month ago
  • tool_config
    initial clean commit about 1 month ago
  • geo3k_multiturn_grpo.yaml
    6.26 kB
    initial clean commit about 1 month ago
  • geo3k_multiturn_megatron_grpo.yaml
    6.27 kB
    initial clean commit about 1 month ago
  • gsm8k_multiturn_grpo.yaml
    335 Bytes
    initial clean commit about 1 month ago
  • gsm8k_multiturn_grpo_server.yaml
    500 Bytes
    initial clean commit about 1 month ago
  • gsm8k_multiturn_grpo_w_interaction.yaml
    330 Bytes
    initial clean commit about 1 month ago
  • gsm8k_multiturn_megatron_grpo.yaml
    347 Bytes
    initial clean commit about 1 month ago
  • retool_multiturn_grpo.yaml
    414 Bytes
    initial clean commit about 1 month ago
  • search_multiturn_grpo.yaml
    371 Bytes
    initial clean commit about 1 month ago
  • search_multiturn_grpo_one_step_off.yaml
    371 Bytes
    initial clean commit about 1 month ago