--- base_model: [] library_name: transformers tags: - mergekit - merge --- # prototype-0.4x204 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using /workspace/prototype-0.4x200 as a base. ### Models Merged The following models were included in the merge: * /workspace/cache/models--Delta-Vector--Austral-70B-Winton/snapshots/daa4ccd49dcd55300b7bde4a31c50e10331e2605 * /workspace/prototype-0.4x203 * /workspace/prototype-0.4x201 ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: /workspace/cache/models--Delta-Vector--Austral-70B-Winton/snapshots/daa4ccd49dcd55300b7bde4a31c50e10331e2605 - model: /workspace/prototype-0.4x203 - model: /workspace/prototype-0.4x201 base_model: /workspace/prototype-0.4x200 merge_method: model_stock tokenizer: source: base int8_mask: true dtype: float32 out_dtype: bfloat16 ```