LoRAcle KL-distill N1k
Intermediate scaling-curve checkpoint from the KL distillation persona-LoRA experiment.
AB any-match: 0.670 ± 0.028 · details: N=1000, 3 Q/A per persona (matched-step variant)
Companion siblings: ceselder/loracle-kl-distill-N11k (best, AB 0.728 ± 0.009).
See the lora-oracles repo for the full recipe + a CLAUDE_BRIEFING.md describing the experiment top-down.
Headline scaling curve (KL distill)
| N personas | AB any-match | this repo? |
|---|---|---|
| 1,000 | 0.670 ± 0.028 | ✓ |
| 5,000 | 0.701 ± 0.019 | |
| 11,000 | 0.728 ± 0.009 |
Plain-SFT baseline at matched N=1k was 0.594 ± 0.022, plateaued ~0.55 across N=500–11k.
Files
interpreter/— PEFT LoRA adapter (rank 256) onQwen/Qwen3-14Bencoder.pt,ao.pt— AOEncoder + auxiliary tensorstokenizer/— Qwen3-14B tokenizerloracle_config.yaml— training config
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support