70B_smrt / README.md
schonsense's picture
Add files using upload-large-folder tool
96eb260 verified
metadata
base_model:
  - schonsense/70B_Triage
  - schonsense/70B_neolithic_rabbit
  - schonsense/schonsense_70B_thinkthonk
library_name: transformers
tags:
  - mergekit
  - merge

smrt

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SCE merge method using D:\mergekit\yamls\IPOplectic as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: sce
select_topk: 0.25

models:


  - model: "D:\\mergekit\\yamls\\IPOplectic"
  - model: "D:\\mergekit\\yamls\\sce_galaxy_brain"
  - model: schonsense/70B_Triage
  - model: schonsense/70B_neolithic_rabbit
  - model: schonsense/schonsense_70B_thinkthonk


base_model: "D:\\mergekit\\yamls\\IPOplectic"

parameters:
  normalize: false
  int8_mask: true

dtype: float32
out_dtype: bfloat16

tokenizer:
  source: base
  pad_to_multiple_of: 8