Sovereign-Hardened-V1

Multi-benchmark GRPO hardening with 6-signal reward stack and self-healing callback.

Benchmarks Covered

Benchmark Dataset Reward Type
GPQA Diamond Wanfq/gpqa (198 PhD-level MCQs) Letter-match MCQ
ARC-Challenge allenai/ai2_arc (1172 MCQs) Letter-match MCQ
IFEval google/IFEval (541 IF prompts) Rule-based constraint verification
PHYBench Eureka-Lab/PHYBench (500 physics) Numeric/symbolic match
Adversarial Generated (injection + IF + tools) Injection defense + graceful recovery

Self-Healing

Plateau → β×1.5 + LR×0.5 | Collapse → checkpoint + stop

Launch

pip install trl transformers torch datasets accelerate trackio
python benchmark_hardening.py
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for moro72842/Sovereign-Hardened-V1

Base model

Qwen/Qwen2.5-3B
Finetuned
(1277)
this model

Datasets used to train moro72842/Sovereign-Hardened-V1