Add model card metadata and description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +23 -4
README.md CHANGED
@@ -1,13 +1,32 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - meta-llama/Meta-Llama-3-8B-Instruct
 
 
 
5
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6
  ## Citation
7
- ```
 
8
  @article{yang2025mix,
9
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
10
- author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
11
  journal={arXiv preprint arXiv:2502.06876},
12
  year={2025}
13
- }
 
 
1
  ---
 
2
  base_model:
3
  - meta-llama/Meta-Llama-3-8B-Instruct
4
+ license: apache-2.0
5
+ library_name: transformers
6
+ pipeline_tag: text-generation
7
  ---
8
+
9
+ # Mix Data or Merge Models? RESM-Llama-3-8B
10
+
11
+ This repository contains the model weights for the 3H-aligned Large Language Model (LLM) presented in the paper [Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging](https://huggingface.co/papers/2502.06876).
12
+
13
+ ## Description
14
+
15
+ Achieving a balanced alignment across Helpfulness, Honesty, and Harmlessness (the 3H dimensions) is critical for responsible AI. This model was developed using **RESM** (**R**eweighting **E**nhanced task **S**ingular **M**erging), a novel model merging method that utilizes outlier weighting and sparsity-aware rank selection strategies.
16
+
17
+ RESM is designed to address challenges inherent in 3H-aligned merging, such as preference noise accumulation and layer sparsity adaptation. By working at the parameter level, it provides a conflict-resolution strategy that outperforms traditional data mixture methods in achieving balanced LLM alignment.
18
+
19
+ - **Base Model:** [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
20
+ - **Method:** RESM (Reweighting Enhanced task Singular Merging)
21
+ - **Optimization Goals:** Helpfulness, Honesty, and Harmlessness (3H)
22
+
23
  ## Citation
24
+
25
+ ```bibtex
26
  @article{yang2025mix,
27
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
28
+ author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi + and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and Fei Wu and Kun Kuang},
29
  journal={arXiv preprint arXiv:2502.06876},
30
  year={2025}
31
+ }
32
+ ```