modaic
/

Qwen3.5-4B-probe

Text Classification

confidence-probe

sequence-classification

Model card Files Files and versions

qwen3.5-hard-only-r4

Summary

Base model: Qwen/Qwen3.5-4B

OOD Evaluation

benchmark	n	auroc	accuracy
arc_challenge	1000	0.8875	0.8890
judge_bench	278	0.7065	0.6583
mmlu	1000	0.7550	0.7680
mmlu_pro	1000	0.6889	0.7070
rod101_essay_scoring	81	0.7115	0.7407

MMLU AUROC with Tuning (by amount of data used to train)

MMLU Accuracy with tuning (by amount of data used to train)

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for modaic/Qwen3.5-4B-probe

Base model

Qwen/Qwen3.5-4B-Base

Finetuned

Qwen/Qwen3.5-4B

Adapter

(259)

this model