qwen35-4b-iconclass-sft-brillfull

Qwen3.5-4B-VL fine-tuned (SFT, 1 epoch) on davanstrien/iconclass-vlm-brillfull — the full-label iconclass dataset (~4.36 codes/image vs the truncated 3.54).

Why this model exists (research finding)

Built to test whether fixing truncated training labels lifts the iconclass classifier past its ~25% recall ceiling. It does not: training converged well (eval_loss 0.47), but on the clean 788-image full-label test it scores H-F1 45.3 / hier-recall 46.4 / code-recall 25.6 — recall unchanged vs models trained on truncated labels.

Conclusion: the 4B is capability-bound (identifying the right codes), not label-bound — neither reward tuning nor label completeness moves it.

The approach that did improve results is anchored fusion: use this model as a precision anchor, then a graded VLM-judge gates in extra codes from semantic retrieval. On the same clean test that lifts results to H-F1 47.5 / hier-recall 57.6, with zero extra training.

  • Base: unsloth/Qwen3.5-4B-Base
  • Recommended use: as the anchor in the anchored-fusion pipeline (best recall).

Trained with Unsloth + TRL.

Downloads last month
82
Safetensors
Model size
5B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for davanstrien/qwen35-4b-iconclass-sft-brillfull

Finetuned
(16)
this model
Finetunes
2 models