Built a reporter-only confidence adapter on top of SmolLM3-3B

#50
by ajdramos - opened

The main one is a "reporter-only" LoRA: it's active only on a confidence turn, so it teaches the model to report how sure it is without touching any of its answers. The reported confidence ends up tracking whether the answer is actually right (AUROC ~0.87,
vs ~0.75 for the raw signal). There's also a cheap "disagreement cascade" that matches 6-sample voting accuracy at ~47% of the tokens.

Adapter:
huggingface.co/ajdramos/bojador-reporter-smollm3-3b
Code and full lab log:
github.com/ajdramos/bojador

Sign up or log in to comment