LoRAcles: Weight-Space Interpretability at Scale
Collection
Training data and LoRAcles and LoRAcles for llama 3.3 70B, qwen3-14b and olmo-3-32B • 7 items • Updated
How to use ceselder/loracle-olmo3-32b-v3 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("allenai/Olmo-3.1-32B-Instruct")
model = PeftModel.from_pretrained(base_model, "ceselder/loracle-olmo3-32b-v3")A LoRAcle interpreter for allenai/Olmo-3.1-32B-Instruct. It reads tokenized LoRA
weight-deltas (SVD "direction tokens") injected into the frozen base model and describes
the behavior the weight update encodes — without running the fine-tuned model.
[K≤16 ranks × 64 layers × 7 mag-sides, d=5120].h' = h + 2·‖h‖·v̂ at the
placeholder positions of a rank-tagged prefix. Dynamic K ∈ {1..16} per sample.ceselder/loracle-training-data (27,398 train
organisms, 150 heldout). Load on top of allenai/Olmo-3.1-32B-Instruct with the same direction-token + layer-1
norm-match (gain 2.0) injection harness used to train it.
Base model
allenai/Olmo-3-1125-32B