yangjinluan
/

3H_Merging_Llama3_Helpfulness

PyTorch

llama

Model card Files Files and versions

xet

Community

Add pipeline tag, library name and paper link

by nielsr HF Staff - opened Feb 3

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+17

-4

Files changed (1) hide show

README.md +17 -4

README.md CHANGED Viewed

@@ -1,13 +1,26 @@
 ---
-license: apache-2.0
 base_model:
 - meta-llama/Meta-Llama-3-8B-Instruct
 ---
 ## Citation
-```
 @article{yang2025mix,
   title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
-  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
   journal={arXiv preprint arXiv:2502.06876},
   year={2025}
-}

 ---
 base_model:
 - meta-llama/Meta-Llama-3-8B-Instruct
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
 ---
+# RESM-Llama-3-8B-Instruct
+This repository contains the weights for **RESM-Llama-3-8B-Instruct**, a model developed using the **RESM** (**R**eweighted **E**nhanced task **S**ingular **M**erging) method to balance Helpfulness, Honesty, and Harmlessness (3H optimization).
+The model was introduced in the paper [Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging](https://huggingface.co/papers/2502.06876).
+## Description
+Achieving balanced alignment of large language models (LLMs) in terms of Helpfulness, Honesty, and Harmlessness constitutes a cornerstone of responsible AI. The authors propose a novel **R**eweighted **E**nhanced task **S**ingular **M**erging method, **RESM**, through outlier weighting and sparsity-aware rank selection strategies to address the challenges of preference noise accumulation and layer sparsity adaptation inherent in 3H-aligned LLM merging.
 ## Citation
+```bibtex
 @article{yang2025mix,
   title={Mix Data or Merge Models? Balancing the Helpfulness, Honesty, and Harmlessness of Large Language Model via Model Merging},
+  author={Yang, Jinluan and Jin, Dingnan and Tang, Anke and Shen, Li capitals and Zhu, Didi and Chen, Zhengyu and Wang, Daixin and Cui, Qing and Zhang, Zhiqiang and Zhou, Jun and others},
   journal={arXiv preprint arXiv:2502.06876},
   year={2025}
+}
+```