Improve model card: add metadata, paper link, and description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,14 +1,32 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - mistralai/Mistral-7B-Instruct-v0.2
 
 
 
5
  ---
6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ## Citation
8
- ```
 
9
  @article{yang2025mix,
10
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
11
  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
12
  journal={arXiv preprint arXiv:2502.06876},
13
  year={2025}
14
- }
 
 
1
  ---
 
2
  base_model:
3
  - mistralai/Mistral-7B-Instruct-v0.2
4
+ license: apache-2.0
5
+ library_name: transformers
6
+ pipeline_tag: text-generation
7
  ---
8
 
9
+ # RESM-Mistral-7B
10
+
11
+ This repository contains the model weights for RESM-Mistral-7B, introduced in the paper [Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging](https://huggingface.co/papers/2502.06876).
12
+
13
+ ## Model Description
14
+
15
+ RESM-Mistral-7B is an aligned Large Language Model (LLM) based on the Mistral-7B-Instruct-v0.2 architecture. It is designed to achieve a balanced alignment of **Helpfulness, Honesty, and Harmlessness** (3H optimization).
16
+
17
+ The model was created using a novel parameter-level merging method called **RESM** (**R**eweighting **E**nhanced task **S**ingular **M**erging). RESM addresses the challenges of preference noise accumulation and layer sparsity adaptation through:
18
+ - **Outlier weighting**: Mitigating the impact of preference noise.
19
+ - **Sparsity-aware rank selection**: Adapting to layer-wise variations in parameter importance.
20
+
21
+ This approach allows the model to better navigate the collaborative and conflicting relationships between the 3H dimensions compared to traditional data mixture strategies.
22
+
23
  ## Citation
24
+
25
+ ```bibtex
26
  @article{yang2025mix,
27
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
28
  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
29
  journal={arXiv preprint arXiv:2502.06876},
30
  year={2025}
31
+ }
32
+ ```