NOTES: This model seems to be overtly confident leading to hallucinations, normalization has seemed to also break the long context chaining. I do not recommend this model.

Thanks to @Epiculous for the dope model/ help with llm backends and support overall.

Id like to also thank @kalomaze for the dope sampler additions to ST.

@SanjiWatsuki Thank you very much for the help, and the model!

Quants Here: Thanks to @Lewdiculus https://huggingface.co/Lewdiculous/Kunocchini-1.2-7b-longtext-GGUF-Imatrix

This model was merged using the DARE TIES.

Models Merged

The following models were included in the merge:

NousResearch/Yarn-Mistral-7b-128k + Test157t/Kunocchini-1.1-7b

Configuration

The following YAML configuration was used to produce this model:

merge_method: dare_ties
base_model: Test157t/Kunocchini-1.1-7b
parameters:
  normalize: true
models:
  - model: NousResearch/Yarn-Mistral-7b-128k
    parameters:
      weight: 1
  - model: Test157t/Kunocchini-1.1-7b
    parameters:
      weight: 1
dtype: float16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	59.57
AI2 Reasoning Challenge (25-Shot)	59.90
HellaSwag (10-Shot)	82.51
MMLU (5-Shot)	63.05
TruthfulQA (0-shot)	41.72
Winogrande (5-shot)	77.35
GSM8k (5-shot)	32.90

Downloads last month: 33

Safetensors

Model size

7B params

Tensor type

F16

Model tree for Nitral-Archive/Kunocchini-1.2-7b-longtext-broken

Base model

NousResearch/Yarn-Mistral-7b-128k

Finetuned

(8)

this model

Papers for Nitral-Archive/Kunocchini-1.2-7b-longtext-broken

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Paper • 2311.03099 • Published Nov 6, 2023 • 32

Resolving Interference When Merging Models

Paper • 2306.01708 • Published Jun 2, 2023 • 18

Evaluation results

normalized accuracy on AI2 Reasoning Challenge (25-Shot)
test set Open LLM Leaderboard

59.900
normalized accuracy on HellaSwag (10-Shot)
validation set Open LLM Leaderboard

82.510
accuracy on MMLU (5-Shot)
test set Open LLM Leaderboard

63.050
mc2 on TruthfulQA (0-shot)
validation set Open LLM Leaderboard

41.720
accuracy on Winogrande (5-shot)
validation set Open LLM Leaderboard

77.350
accuracy on GSM8k (5-shot)
test set Open LLM Leaderboard

32.900