Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Nikitasoni22
/
cicd-rl-agent

Reinforcement Learning
PyTorch
cicd
debugging
automation
Model card Files Files and versions
xet
Community
cicd-rl-agent
Ctrl+K
Ctrl+K
  • 1 contributor
History: 13 commits
Nikitasoni22's picture
Nikitasoni22
training issue resolved
048dc4f 16 days ago
  • cicd_debug_env
    updated code 17 days ago
  • server
    initial clean commit 17 days ago
  • tests
    initial clean commit 17 days ago
  • ui
    initial clean commit 17 days ago
  • .gitattributes
    1.52 kB
    initial commit 17 days ago
  • .gitignore
    70 Bytes
    initial clean commit 17 days ago
  • Dockerfile
    189 Bytes
    initial clean commit 17 days ago
  • README.md
    190 Bytes
    add model card metadata 17 days ago
  • baseline_inference.py
    1.95 kB
    training issue resolved 17 days ago
  • eval_lora.py
    6.33 kB
    training issue resolved 16 days ago
  • inference.py
    5.86 kB
    training issue resolved 17 days ago
  • openenv.yaml
    559 Bytes
    initial clean commit 17 days ago
  • requirements.txt
    116 Bytes
    initial clean commit 17 days ago
  • train.py
    22 kB
    training issue resolved 16 days ago
  • train_cicd_rl_test.py
    23.6 kB
    fix GRPOConfig max_completion_length + unsloth import 17 days ago
  • train_colab.ipynb
    13.4 kB
    training issue resolved 16 days ago