Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Nikitasoni22
/
cicd-rl-agent
like
0
Reinforcement Learning
PyTorch
cicd
debugging
automation
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
cicd-rl-agent
113 kB
Ctrl+K
Ctrl+K
1 contributor
History:
13 commits
Nikitasoni22
training issue resolved
048dc4f
2 months ago
cicd_debug_env
updated code
2 months ago
server
initial clean commit
2 months ago
tests
initial clean commit
2 months ago
ui
initial clean commit
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
.gitignore
Safe
70 Bytes
initial clean commit
2 months ago
Dockerfile
Safe
189 Bytes
initial clean commit
2 months ago
README.md
Safe
190 Bytes
add model card metadata
2 months ago
baseline_inference.py
Safe
1.95 kB
training issue resolved
2 months ago
eval_lora.py
Safe
6.33 kB
training issue resolved
2 months ago
inference.py
Safe
5.86 kB
training issue resolved
2 months ago
openenv.yaml
Safe
559 Bytes
initial clean commit
2 months ago
requirements.txt
Safe
116 Bytes
initial clean commit
2 months ago
train.py
Safe
22 kB
training issue resolved
2 months ago
train_cicd_rl_test.py
Safe
23.6 kB
fix GRPOConfig max_completion_length + unsloth import
2 months ago
train_colab.ipynb
Safe
13.4 kB
training issue resolved
2 months ago