Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
SaiManish123
/
Janus
like
0
Reinforcement Learning
Safetensors
openenv
security
cybersecurity
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
Janus
/
sft_worldsplit_1_5b
1.32 GB
Ctrl+K
Ctrl+K
1 contributor
History:
3 commits
SaiManish123
Replace SFT reward curve with baseline-anchored learning curve (tool-aware baseline → checkpoint-40 … final)
10fa2de
verified
27 days ago
checkpoint-120
Upload folder using huggingface_hub
27 days ago
checkpoint-160
Upload folder using huggingface_hub
27 days ago
checkpoint-200
Upload folder using huggingface_hub
27 days ago
checkpoint-240
Upload folder using huggingface_hub
27 days ago
checkpoint-280
Upload folder using huggingface_hub
27 days ago
checkpoint-320
Upload folder using huggingface_hub
27 days ago
checkpoint-360
Upload folder using huggingface_hub
27 days ago
checkpoint-378
Upload folder using huggingface_hub
27 days ago
checkpoint-40
Upload folder using huggingface_hub
27 days ago
checkpoint-80
Upload folder using huggingface_hub
27 days ago
final
Upload folder using huggingface_hub
27 days ago
README.md
1.54 kB
Upload folder using huggingface_hub
27 days ago
adaptshield_sft_worldsplit.summary.json
211 Bytes
Upload sft_worldsplit_1_5b/adaptshield_sft_worldsplit.summary.json with huggingface_hub
27 days ago
loss_curve.png
51.7 kB
Upload folder using huggingface_hub
27 days ago
reward_curve.png
52.5 kB
Replace SFT reward curve with baseline-anchored learning curve (tool-aware baseline → checkpoint-40 … final)
27 days ago
sft_metrics.json
92 kB
Upload folder using huggingface_hub
27 days ago