A Single Layer to Explain Them All: Understanding Massive Values in Large Language Models

Start

This page is the model of WeMask, the github link is ME-Layer. We use Qwen-3-VL-4B as our foundation model for SFT and RL training. You can follow the guideline in this repo to start testing and training.

Citation

If you think our research is helpful, please cite with

@article{me_layer_2026,
  title={A Single Layer to Explain Them All: Understanding Massive Values in Large Language Models},
  author={Your Name and Co-authors},
  journal={Proceedings of the 43rd International Conference on Machine Learning (ICML)},
  year={2026}
}