Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

work-dwivediishivam
/
runway-zero-training-artifacts

Reinforcement Learning
openenv
grpo
llm-agents
airport-operations
hackathon
Model card Files Files and versions
xet
Community
runway-zero-training-artifacts
4 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 11 commits
work-dwivediishivam's picture
work-dwivediishivam
Update model card with four hosted GRPO runs
5efb60d verified 27 days ago
  • gemma4_31b_it_hf_grpo
    Upload gemma4_31b_it_hf_grpo/gemma4_31b_it_hf_grpo_summary.json with huggingface_hub 27 days ago
  • gpt_oss_120b_hf_grpo
    Upload gpt_oss_120b_hf_grpo/gpt_oss_120b_hf_grpo_summary.json with huggingface_hub 27 days ago
  • qwen25_coder_7b_hf_grpo
    Upload qwen25_coder_7b_hf_grpo/runway_zero_qwen25_coder_7b_hf_grpo.tgz with huggingface_hub 28 days ago
  • qwen3_14b_hf_grpo
    Upload qwen3_14b_hf_grpo/qwen3_14b_hf_grpo_summary.json with huggingface_hub 27 days ago
  • .gitattributes
    1.52 kB
    initial commit 28 days ago
  • README.md
    2.93 kB
    Update model card with four hosted GRPO runs 27 days ago