Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection Paper โข 2601.19375 โข Published 3 days ago โข 5
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper โข 2504.15047 โข Published Apr 21, 2025 โข 6
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper โข 2503.16219 โข Published Mar 20, 2025 โข 52