prototype-0.4x261 / README.md
bruhzair's picture
Upload folder using huggingface_hub
2fa2e3b verified
metadata
base_model: []
library_name: transformers
tags:
  - mergekit
  - merge

prototype-0.4x261

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using /workspace/cache/models--Ppoyaa--MythoNemo-L3.1-70B-v1.0/snapshots/faaa4e992764eb4667b8f541dcf75ce8b7aaadcc as a base.

Models Merged

The following models were included in the merge:

  • /workspace/cache/models--ArliAI--Llama-3.3-70B-ArliAI-RPMax-v1.4/snapshots/4288519ba279872651e29e430a85c728277cb71b
  • /workspace/cache/models--deepcogito--cogito-v1-preview-llama-70B/snapshots/1d624e2293b5b35f9cfd2349f8e02c7ebf32ca83
  • /workspace/cache/models--tdrussell--Llama-3-70B-Instruct-Storywriter/snapshots/19be2a7c6382a9150e126cf144e2b2964e700d3c
  • /workspace/cache/models--NousResearch--Hermes-3-Llama-3.1-70B/snapshots/1fc86da0ce9cdb14cd775ad270bc7d1b4bf70ede
  • /workspace/cache/models--watt-ai--watt-tool-70B/snapshots/dbe19344ec6ee4b9e1636e9e6ce24fc6a85a725e

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: /workspace/cache/models--NousResearch--Hermes-3-Llama-3.1-70B/snapshots/1fc86da0ce9cdb14cd775ad270bc7d1b4bf70ede
  - model: /workspace/cache/models--watt-ai--watt-tool-70B/snapshots/dbe19344ec6ee4b9e1636e9e6ce24fc6a85a725e
  - model: /workspace/cache/models--tdrussell--Llama-3-70B-Instruct-Storywriter/snapshots/19be2a7c6382a9150e126cf144e2b2964e700d3c
  - model: /workspace/cache/models--ArliAI--Llama-3.3-70B-ArliAI-RPMax-v1.4/snapshots/4288519ba279872651e29e430a85c728277cb71b
  - model: /workspace/cache/models--deepcogito--cogito-v1-preview-llama-70B/snapshots/1d624e2293b5b35f9cfd2349f8e02c7ebf32ca83
base_model: /workspace/cache/models--Ppoyaa--MythoNemo-L3.1-70B-v1.0/snapshots/faaa4e992764eb4667b8f541dcf75ce8b7aaadcc
merge_method: model_stock
tokenizer:
  source: base
chat_template: llama3
int8_mask: true
pad_to_multiple_of: 8
dtype: float32