--- license: mit --- # MoSEs SAR Models: Stylistics-Aware Router This repository contains the trained Stylistics-Aware Router (SAR) models for the MoSEs framework, an uncertainty-aware AI-generated text detection system. The SAR models are used to route input texts to relevant reference samples based on stylistic features. ## Model Overview Two SAR models are provided, trained on different datasets: ### main_1000.pt (Main SAR Model) - **Training Dataset**: Main dataset with 8,000 samples across 8 domains - **Domains**: CMV, SciXGen, WP, Xsum (with human and AI-generated continuations) ### tiny_200.pt (Tiny SAR Model) - **Training Dataset**: Tiny dataset with 1,600 samples across 4 domains - **Domains**: CNN, DialogSum, IMDB, PubMed (with human and GPT-4 generated variants) ## Citation If you use these models in your research, please cite the MoSEs paper: ``` @inproceedings{wu2025moses, title={MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds}, author={Wu, Junxi and Wang, Jinpeng and Liu, Zheng and Chen, Bin and Hu, Dongjian and Wu, Hao and Xia, Shu-Tao}, booktitle={Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing}, year={2025}, publisher={Association for Computational Linguistics} } ``` For the specific SAR models: ``` @model{moses_sar_models, title={MoSEs Stylistics-Aware Router}, author={Wu, Junxi and Wang, Jinpeng and Liu, Zheng and Chen, Bin and Hu, Dongjian and Wu, Hao and Xia, Shu-Tao}, year={2025}, url={https://huggingface.co/zhengliu8/Stylistics_Aware_Router} } ``` ## Related Resources - **MoSEs Paper**: [arXiv:2509.02499](https://arxiv.org/abs/2509.02499) - **MoSEs Code**: [GitHub Repository](https://github.com/creator-xi/MoSEs) - **Stylistics Reference Repository**: [HuggingFace Dataset](https://huggingface.co/datasets/zhengliu8/Stylistics_Reference_Repository) ## License This model is licensed under MIT Licence.