jadohu
/

Qwen3-8B-MASA-efficient

Reinforcement Learning

Model card Files Files and versions

jadohu commited on Nov 26, 2025

Commit

4d802d7

·

verified ·

1 Parent(s): 28c3326

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -7,4 +7,19 @@ language:
 base_model:
 - Qwen/Qwen3-8B-Base
 pipeline_tag: reinforcement-learning
----

 base_model:
 - Qwen/Qwen3-8B-Base
 pipeline_tag: reinforcement-learning
+---
+### Description
+This repository contains the model for [Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning](https://huggingface.co/papers/2510.03259).
+### Official Implementation
+https://github.com/akatigre/MASA-RL
+### Citation
+```bibtex
+@article{kim2025meta,
+  title={Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning},
+  author={Kim, Yoonjeon and Jang, Doohyuk and Yang, Eunho},
+  journal={arXiv preprint arXiv:2510.03259},
+  year={2025}
+}
+```