Add model card metadata and paper link

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +17 -3
README.md CHANGED
@@ -1,13 +1,27 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - meta-llama/Meta-Llama-3-8B-Instruct
 
 
 
5
  ---
 
 
 
 
 
 
 
 
 
 
 
6
  ## Citation
7
- ```
8
  @article{yang2025mix,
9
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
10
  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
11
  journal={arXiv preprint arXiv:2502.06876},
12
  year={2025}
13
- }
 
 
1
  ---
 
2
  base_model:
3
  - meta-llama/Meta-Llama-3-8B-Instruct
4
+ license: apache-2.0
5
+ library_name: transformers
6
+ pipeline_tag: text-generation
7
  ---
8
+
9
+ # Mix Data or Merge Models? (RESM)
10
+
11
+ This repository contains the model weights for the 3H-aligned LLM (balancing Helpfulness, Honesty, and Harmlessness) developed in the paper [Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging](https://huggingface.co/papers/2502.06876).
12
+
13
+ ## Description
14
+ This model is constructed using a novel merging method called **RESM** (**R**eweighting **E**nhanced task **S**ingular **M**erging). RESM addresses challenges like preference noise accumulation and layer sparsity adaptation through outlier weighting and sparsity-aware rank selection strategies. The approach aims to achieve a balanced alignment across the 3H dimensions (Helpfulness, Honesty, and Harmlessness) more effectively than traditional data mixture or standard model merging techniques.
15
+
16
+ - **Base Model:** [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct)
17
+ - **Paper:** [Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging](https://huggingface.co/papers/2502.06876)
18
+
19
  ## Citation
20
+ ```bibtex
21
  @article{yang2025mix,
22
  title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
23
  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
24
  journal={arXiv preprint arXiv:2502.06876},
25
  year={2025}
26
+ }
27
+ ```