Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
work-dwivediishivam
/
runway-zero-training-artifacts
like
0
Reinforcement Learning
openenv
grpo
llm-agents
airport-operations
hackathon
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
runway-zero-training-artifacts
4 GB
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
work-dwivediishivam
Update model card with four hosted GRPO runs
5efb60d
verified
27 days ago
gemma4_31b_it_hf_grpo
Upload gemma4_31b_it_hf_grpo/gemma4_31b_it_hf_grpo_summary.json with huggingface_hub
27 days ago
gpt_oss_120b_hf_grpo
Upload gpt_oss_120b_hf_grpo/gpt_oss_120b_hf_grpo_summary.json with huggingface_hub
27 days ago
qwen25_coder_7b_hf_grpo
Upload qwen25_coder_7b_hf_grpo/runway_zero_qwen25_coder_7b_hf_grpo.tgz with huggingface_hub
28 days ago
qwen3_14b_hf_grpo
Upload qwen3_14b_hf_grpo/qwen3_14b_hf_grpo_summary.json with huggingface_hub
27 days ago
.gitattributes
Safe
1.52 kB
initial commit
28 days ago
README.md
2.93 kB
Update model card with four hosted GRPO runs
27 days ago