Steering MoE LLMs via Expert (De)Activation
Paper
•
2509.09660
•
Published
NLP, Representation Learning, Machine Translation
Large Reasoning Models Are (Not Yet) Multilingual Latent Reasoners
How Programming Concepts and Neurons Are Shared in Code Language Models