Instructions to use sraj/Merge_Linear_NormBalanced with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use sraj/Merge_Linear_NormBalanced with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="sraj/Merge_Linear_NormBalanced")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("sraj/Merge_Linear_NormBalanced") model = AutoModelForMaskedLM.from_pretrained("sraj/Merge_Linear_NormBalanced") - Notebooks
- Google Colab
- Kaggle
| base_model: | |
| - sraj/CMB_MARK_CX_LRD | |
| - sraj/CMB_FWEdu_V2_FastTxt_CX_LRD | |
| - sraj/CMB_WX_SYN_CX_LRD | |
| library_name: transformers | |
| tags: | |
| - mergekit | |
| - merge | |
| # merge_linear_normbalanced | |
| This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). | |
| ## Merge Details | |
| ### Merge Method | |
| This model was merged using the [Linear](https://arxiv.org/abs/2203.05482) merge method. | |
| ### Models Merged | |
| The following models were included in the merge: | |
| * [sraj/CMB_MARK_CX_LRD](https://huggingface.co/sraj/CMB_MARK_CX_LRD) | |
| * [sraj/CMB_FWEdu_V2_FastTxt_CX_LRD](https://huggingface.co/sraj/CMB_FWEdu_V2_FastTxt_CX_LRD) | |
| * [sraj/CMB_WX_SYN_CX_LRD](https://huggingface.co/sraj/CMB_WX_SYN_CX_LRD) | |
| ### Configuration | |
| The following YAML configuration was used to produce this model: | |
| ```yaml | |
| # Norm-balanced: weights set inversely proportional to avg task vector L2 norm | |
| # A β 16, F β 43, S β 16 β weights β 1/16, 1/43, 1/16 β normalized β 2.7, 1.0, 2.7 | |
| models: | |
| - model: sraj/CMB_MARK_CX_LRD | |
| parameters: | |
| weight: 2.7 | |
| - model: sraj/CMB_FWEdu_V2_FastTxt_CX_LRD | |
| parameters: | |
| weight: 1.0 | |
| - model: sraj/CMB_WX_SYN_CX_LRD | |
| parameters: | |
| weight: 2.7 | |
| merge_method: linear | |
| parameters: | |
| normalize: true | |
| dtype: bfloat16 | |
| ``` | |