Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SaiManish123
/
Janus
like
0
Reinforcement Learning
Safetensors
openenv
security
cybersecurity
License:
mit
Model card
Files
Files and versions
xet
Community
main
Janus
Ctrl+K
Ctrl+K
1 contributor
History:
11 commits
SaiManish123
assets: headline chart, architecture overview, training pipeline (for README)
7001779
verified
13 days ago
assets
assets: headline chart, architecture overview, training pipeline (for README)
13 days ago
grpo_polymorphic_zero_day_1_5b
Upload folder using huggingface_hub
13 days ago
grpo_worldsplit_1_5b
Upload folder using huggingface_hub
13 days ago
logs
Add HF Job stdout log for grpo_polymorphic_zero_day_1_5b
13 days ago
sft_worldsplit_1_5b
Replace SFT reward curve with baseline-anchored learning curve (tool-aware baseline → checkpoint-40 … final)
13 days ago
.gitattributes
Safe
2.78 kB
Upload folder using huggingface_hub
13 days ago
README.md
20.4 kB
readme: project description, results, training logs, links
13 days ago