| --- |
| library_name: transformers |
| tags: |
| - mergekit |
| - merge |
| - conversational |
| - chat |
| - instruct |
| base_model: |
| - meta-llama/Llama-3.1-8B-Instruct |
| - sequelbox/Llama3.1-8B-MOTH |
| - ValiantLabs/Llama3.1-8B-ShiningValiant2 |
| license: llama3.1 |
| model-index: |
| - name: Llama3.1-8B-PlumChat |
| results: |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: Winogrande (5-Shot) |
| type: Winogrande |
| args: |
| num_few_shot: 5 |
| metrics: |
| - type: acc |
| value: 72.22 |
| name: acc |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: IFEval (0-Shot) |
| type: HuggingFaceH4/ifeval |
| args: |
| num_few_shot: 0 |
| metrics: |
| - type: inst_level_strict_acc and prompt_level_strict_acc |
| value: 42.43 |
| name: strict accuracy |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: BBH (3-Shot) |
| type: BBH |
| args: |
| num_few_shot: 3 |
| metrics: |
| - type: acc_norm |
| value: 13.94 |
| name: normalized accuracy |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: MATH Lvl 5 (4-Shot) |
| type: hendrycks/competition_math |
| args: |
| num_few_shot: 4 |
| metrics: |
| - type: exact_match |
| value: 3.1 |
| name: exact match |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: GPQA (0-shot) |
| type: Idavidrein/gpqa |
| args: |
| num_few_shot: 0 |
| metrics: |
| - type: acc_norm |
| value: 2.01 |
| name: acc_norm |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: MuSR (0-shot) |
| type: TAUR-Lab/MuSR |
| args: |
| num_few_shot: 0 |
| metrics: |
| - type: acc_norm |
| value: 4.77 |
| name: acc_norm |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| - task: |
| type: text-generation |
| name: Text Generation |
| dataset: |
| name: MMLU-PRO (5-shot) |
| type: TIGER-Lab/MMLU-Pro |
| config: main |
| split: test |
| args: |
| num_few_shot: 5 |
| metrics: |
| - type: acc |
| value: 12.52 |
| name: accuracy |
| source: |
| url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=sequelbox/Llama3.1-8B-PlumChat |
| name: Open LLM Leaderboard |
| --- |
| # PlumChat |
|
|
| This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). |
|
|
| ## Merge Details |
| ### Merge Method |
|
|
| This model was merged using the della merge method using [meta-llama/Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) as a base. |
|
|
| ### Models Merged |
|
|
| The following models were included in the merge: |
| * [ValiantLabs/Llama3.1-8B-ShiningValiant2](https://huggingface.co/ValiantLabs/Llama3.1-8B-ShiningValiant2) |
| * [sequelbox/Llama3.1-8B-MOTH](https://huggingface.co/sequelbox/Llama3.1-8B-MOTH) |
|
|
| ### Configuration |
|
|
| The following YAML configuration was used to produce this model: |
|
|
| ```yaml |
| merge_method: della |
| dtype: bfloat16 |
| parameters: |
| normalize: true |
| models: |
| - model: ValiantLabs/Llama3.1-8B-ShiningValiant2 |
| parameters: |
| density: 0.5 |
| weight: 0.3 |
| - model: sequelbox/Llama3.1-8B-MOTH |
| parameters: |
| density: 0.5 |
| weight: 0.21 |
| base_model: meta-llama/Llama-3.1-8B-Instruct |
| |
| ``` |
|
|
| # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard) |
| Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_sequelbox__Llama3.1-8B-PlumChat) |
|
|
| | Metric |Value| |
| |-------------------|----:| |
| |Avg. |13.13| |
| |IFEval (0-Shot) |42.43| |
| |BBH (3-Shot) |13.94| |
| |MATH Lvl 5 (4-Shot)| 3.10| |
| |GPQA (0-shot) | 2.01| |
| |MuSR (0-shot) | 4.77| |
| |MMLU-PRO (5-shot) |12.52| |
|
|
|
|