Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
shivam2k3
/
opensoc-env
like
0
openenv
cybersecurity
rlvr
self-play
License:
bsd-3-clause
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
opensoc-env
/
train
Ctrl+K
Ctrl+K
1 contributor
History:
10 commits
shivam2k3
training: push adapters to HF Hub after SFT + each GRPO stage
dc2d89f
25 days ago
__init__.py
0 Bytes
OpenSOC v1
25 days ago
grpo_rewards.py
Safe
4.9 kB
grpo: skip-SFT continuation script + completion-shape fix
25 days ago
make_sft_dataset.py
Safe
3.79 kB
OpenSOC v1
25 days ago
prompt_format.py
Safe
4.31 kB
OpenSOC v1
25 days ago
sft_warmstart.py
Safe
5.19 kB
sft_warmstart: import unsloth first; batched formatting_func
25 days ago
train_grpo.py
Safe
8.57 kB
training: push adapters to HF Hub after SFT + each GRPO stage
25 days ago