Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Ajay00747
/
Demo
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
Demo
/
training
3.17 MB
Ctrl+K
Ctrl+K
2 contributors
History:
5 commits
Ajayyy00
Add self-contained GRPO training notebook for HF Jupyter
1080341
17 days ago
CyberSOC_GRPO_Training.ipynb
Safe
32.9 kB
Add self-contained GRPO training notebook for HF Jupyter
17 days ago
__init__.py
0 Bytes
Initial commit of CyberSOC upgraded RLVR environment
18 days ago
agent_archive.py
Safe
3.02 kB
Add alternating self-play training scaffolding.
17 days ago
collect_sft.py
Safe
5.48 kB
Add root health check route to fix HF 404
17 days ago
collect_sft_data.py
Safe
4.28 kB
Initial commit of CyberSOC upgraded RLVR environment
18 days ago
config.py
Safe
2.6 kB
Add GRPO training pipeline + remove shield emoji
18 days ago
eval_harness.py
Safe
1.61 kB
Add alternating self-play training scaffolding.
17 days ago
freeze_alternate.py
Safe
10 kB
Add root health check route to fix HF 404
17 days ago
pfsp_scheduler.py
Safe
1.36 kB
Add alternating self-play training scaffolding.
17 days ago
reward_funcs.py
Safe
5.44 kB
Initial commit of CyberSOC upgraded RLVR environment
18 days ago
sft_data.jsonl
Safe
3.08 MB
Initial commit of CyberSOC upgraded RLVR environment
18 days ago
train_grpo.py
Safe
21.4 kB
Add alternating self-play training scaffolding.
17 days ago