jadohu commited on
Commit
4d802d7
·
verified ·
1 Parent(s): 28c3326

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -7,4 +7,19 @@ language:
7
  base_model:
8
  - Qwen/Qwen3-8B-Base
9
  pipeline_tag: reinforcement-learning
10
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  base_model:
8
  - Qwen/Qwen3-8B-Base
9
  pipeline_tag: reinforcement-learning
10
+ ---
11
+ ### Description
12
+ This repository contains the model for [Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning](https://huggingface.co/papers/2510.03259).
13
+
14
+ ### Official Implementation
15
+ https://github.com/akatigre/MASA-RL
16
+
17
+ ### Citation
18
+ ```bibtex
19
+ @article{kim2025meta,
20
+ title={Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning},
21
+ author={Kim, Yoonjeon and Jang, Doohyuk and Yang, Eunho},
22
+ journal={arXiv preprint arXiv:2510.03259},
23
+ year={2025}
24
+ }
25
+ ```