Qwen3.5-4B PLTs

Per-layer PLT checkpoints for Qwen/Qwen3.5-4B trained as same-layer MLP transcoders.

Layer L1 is currently missing because that checkpoint was not preserved in the training artifacts.

Training recipe

These checkpoints are intended for research on sparse feature discovery, circuit analysis, and PLT-based steering experiments in Qwen3.5-4B.

Checkpoint quality is not uniform across layers. Later layers generally behaved better than early layers in our experiments.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Finetuned

Finetuned

(219)

this model