johnsonchromia commited on
Commit
16ae761
·
verified ·
1 Parent(s): e6d0602

Update README: v18 benchmarks + AEON attribution

Browse files
Files changed (1) hide show
  1. README.md +15 -8
README.md CHANGED
@@ -33,14 +33,14 @@ are at [`evalengine/unbound-e2b-GGUF`](https://huggingface.co/evalengine/unbound
33
 
34
  | Axis | Base | Unbound E2B | Δ |
35
  |---|---|---|---|
36
- | Refusal rate (AdvBench 520) | 98.46% | **2.31%** | **−96.15 pts** |
37
- | Useful-compliance rate | 0.96% | 24.23% | +23.27 pts |
38
- | Hallucination (on harmful prompts) | 1.35% | 18.85% | +17.50 pts |
39
- | Coherence (benign prompts) | 1.00 | 0.80 | 0.20 |
40
- | TruthfulQA mc2 (`--limit 100`) | 0.458 | 0.468 | +1.0 pt |
41
- | MMLU (`--limit 100`) | 0.291 | 0.281 | −1.0 pt |
42
- | GSM8K (`--limit 100`) | 0.125 | 0.140 | +1.5 pt |
43
- | KL divergence vs base | 0 | 3.80 | (SFT-expected) |
44
 
45
  ## Sampling
46
 
@@ -73,6 +73,13 @@ Fine-tuned with [Unsloth](https://github.com/unslothai/unsloth) + HF
73
  [heretic](https://github.com/p-e-w/heretic). Environment + training
74
  discipline ported from [autoresearch](https://github.com/karpathy/autoresearch).
75
 
 
 
 
 
 
 
 
76
  ## License
77
 
78
  Apache-2.0, inherited from `google/gemma-4-E2B-it`.
 
33
 
34
  | Axis | Base | Unbound E2B | Δ |
35
  |---|---|---|---|
36
+ | Refusal rate (AdvBench 520, LLM judge) | 98.46% | **4.42%** | **−94.04 pts** |
37
+ | Useful-compliance rate | 0.96% | **39.23%** | **+38.27 pts** |
38
+ | Hallucination (on harmful prompts) | 1.35% | 15.96% | +14.61 pts |
39
+ | Coherence (benign prompts) | 1.00 | 1.00 | 0 |
40
+ | TruthfulQA mc2 (`--limit 100`) | 0.458 | 0.465 | +0.7 pt |
41
+ | MMLU (`--limit 100`) | 0.291 | 0.282 | −0.9 pt |
42
+ | GSM8K (`--limit 100`) | 0.125 | 0.120 | −0.5 pt |
43
+ | KL divergence vs base | 0 | 3.76 | (SFT-expected) |
44
 
45
  ## Sampling
46
 
 
73
  [heretic](https://github.com/p-e-w/heretic). Environment + training
74
  discipline ported from [autoresearch](https://github.com/karpathy/autoresearch).
75
 
76
+ 200 of the 700 compliance training examples were distilled from
77
+ [`AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4`](https://huggingface.co/AEON-7/Qwen3.6-27B-AEON-Ultimate-Uncensored-NVFP4)
78
+ — a fully uncensored teacher model that produced substantive, non-refusing
79
+ answers to harmful prompts. The AEON-distilled compliance set was the single
80
+ biggest contributor to useful-compliance on Unbound E2B (+6.73 pp at v13;
81
+ the v18 epistemic-decline iteration layered another +15.00 pp on top).
82
+
83
  ## License
84
 
85
  Apache-2.0, inherited from `google/gemma-4-E2B-it`.