morriszms commited on
Commit
3a7a54a
·
verified ·
1 Parent(s): fa5e434

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,15 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ NeuralMonarch-7B-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
37
+ NeuralMonarch-7B-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
38
+ NeuralMonarch-7B-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
39
+ NeuralMonarch-7B-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
40
+ NeuralMonarch-7B-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
41
+ NeuralMonarch-7B-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
42
+ NeuralMonarch-7B-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
43
+ NeuralMonarch-7B-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
44
+ NeuralMonarch-7B-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
45
+ NeuralMonarch-7B-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
46
+ NeuralMonarch-7B-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
47
+ NeuralMonarch-7B-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
NeuralMonarch-7B-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e5fd1fecbdf57088f0c222bbc35cd24db36adef811c3bacea037c62e93e2cc7
3
+ size 2719243008
NeuralMonarch-7B-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6dea4d7cf2c8dce7ce772be056a69f1b3ec308cd0993080546ee6021930c3de5
3
+ size 3822025472
NeuralMonarch-7B-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48ba65c169ba471155449c05d2300710e20095999f7565d564710e1929bef932
3
+ size 3518987008
NeuralMonarch-7B-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:344a046f3ea8ddcc9c3a3eac1de7d691320cf131aebbcc771dff8fa2085472d2
3
+ size 3164568320
NeuralMonarch-7B-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:99b0b3f20be4fffd9bc14d983c87222266e4737a16c890b80c84ed21620070e5
3
+ size 4108917504
NeuralMonarch-7B-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:848ba0785eaf7a2494b202d9b92da7d192c1bd454002cc23f1cfa3093b3db68a
3
+ size 4368440064
NeuralMonarch-7B-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c29ba03b30857bad3a8fbf06ab07d14842706c2485a394d518351ad54fda1cc
3
+ size 4140374784
NeuralMonarch-7B-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69949e0b73b6313ee54d9e7fb0f4f03a38b1e7a14f7717afee1f136e9c65f682
3
+ size 4997716736
NeuralMonarch-7B-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba44606cf09d66dc450e4ed9d20f17c50a17f5128f6eb71ed87b07ab5265ca7d
3
+ size 5131410176
NeuralMonarch-7B-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:814e73b7e5812cb5c82ea5d51035b362aa1b5ea69db6642ab92b69c0b42efe68
3
+ size 4997716736
NeuralMonarch-7B-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eb3dcbd25abf0191a6e3028a9621f0b063223fb2552c703246019cead22f1f96
3
+ size 5942065920
NeuralMonarch-7B-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e63e8dc502aa8783e3c050f5731a2fcb424e3628be834abc8fa05b6800798ca
3
+ size 7695858432
README.md ADDED
@@ -0,0 +1,186 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: cc-by-nc-4.0
5
+ tags:
6
+ - merge
7
+ - lazymergekit
8
+ - dpo
9
+ - rlhf
10
+ - TensorBlock
11
+ - GGUF
12
+ dataset:
13
+ - mlabonne/truthy-dpo-v0.1
14
+ - mlabonne/distilabel-intel-orca-dpo-pairs
15
+ base_model: mlabonne/NeuralMonarch-7B
16
+ model-index:
17
+ - name: NeuralMonarch-7B
18
+ results:
19
+ - task:
20
+ type: text-generation
21
+ name: Text Generation
22
+ dataset:
23
+ name: AI2 Reasoning Challenge (25-Shot)
24
+ type: ai2_arc
25
+ config: ARC-Challenge
26
+ split: test
27
+ args:
28
+ num_few_shot: 25
29
+ metrics:
30
+ - type: acc_norm
31
+ value: 73.21
32
+ name: normalized accuracy
33
+ source:
34
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
35
+ name: Open LLM Leaderboard
36
+ - task:
37
+ type: text-generation
38
+ name: Text Generation
39
+ dataset:
40
+ name: HellaSwag (10-Shot)
41
+ type: hellaswag
42
+ split: validation
43
+ args:
44
+ num_few_shot: 10
45
+ metrics:
46
+ - type: acc_norm
47
+ value: 89.09
48
+ name: normalized accuracy
49
+ source:
50
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
51
+ name: Open LLM Leaderboard
52
+ - task:
53
+ type: text-generation
54
+ name: Text Generation
55
+ dataset:
56
+ name: MMLU (5-Shot)
57
+ type: cais/mmlu
58
+ config: all
59
+ split: test
60
+ args:
61
+ num_few_shot: 5
62
+ metrics:
63
+ - type: acc
64
+ value: 64.41
65
+ name: accuracy
66
+ source:
67
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
68
+ name: Open LLM Leaderboard
69
+ - task:
70
+ type: text-generation
71
+ name: Text Generation
72
+ dataset:
73
+ name: TruthfulQA (0-shot)
74
+ type: truthful_qa
75
+ config: multiple_choice
76
+ split: validation
77
+ args:
78
+ num_few_shot: 0
79
+ metrics:
80
+ - type: mc2
81
+ value: 77.79
82
+ source:
83
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
84
+ name: Open LLM Leaderboard
85
+ - task:
86
+ type: text-generation
87
+ name: Text Generation
88
+ dataset:
89
+ name: Winogrande (5-shot)
90
+ type: winogrande
91
+ config: winogrande_xl
92
+ split: validation
93
+ args:
94
+ num_few_shot: 5
95
+ metrics:
96
+ - type: acc
97
+ value: 84.61
98
+ name: accuracy
99
+ source:
100
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
101
+ name: Open LLM Leaderboard
102
+ - task:
103
+ type: text-generation
104
+ name: Text Generation
105
+ dataset:
106
+ name: GSM8k (5-shot)
107
+ type: gsm8k
108
+ config: main
109
+ split: test
110
+ args:
111
+ num_few_shot: 5
112
+ metrics:
113
+ - type: acc
114
+ value: 67.78
115
+ name: accuracy
116
+ source:
117
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=mlabonne/NeuralMonarch-7B
118
+ name: Open LLM Leaderboard
119
+ ---
120
+
121
+ <div style="width: auto; margin-left: auto; margin-right: auto">
122
+ <img src="https://i.imgur.com/jC7kdl8.jpeg" alt="TensorBlock" style="width: 100%; min-width: 400px; display: block; margin: auto;">
123
+ </div>
124
+ <div style="display: flex; justify-content: space-between; width: 100%;">
125
+ <div style="display: flex; flex-direction: column; align-items: flex-start;">
126
+ <p style="margin-top: 0.5em; margin-bottom: 0em;">
127
+ Feedback and support: TensorBlock's <a href="https://x.com/tensorblock_aoi">Twitter/X</a>, <a href="https://t.me/TensorBlock">Telegram Group</a> and <a href="https://x.com/tensorblock_aoi">Discord server</a>
128
+ </p>
129
+ </div>
130
+ </div>
131
+
132
+ ## mlabonne/NeuralMonarch-7B - GGUF
133
+
134
+ This repo contains GGUF format model files for [mlabonne/NeuralMonarch-7B](https://huggingface.co/mlabonne/NeuralMonarch-7B).
135
+
136
+ The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
137
+
138
+ ## Prompt template
139
+
140
+ ```
141
+ <s>system
142
+ {system_prompt}</s>
143
+ <s>user
144
+ {prompt}</s>
145
+ <s>assistant
146
+ ```
147
+
148
+ ## Model file specification
149
+
150
+ | Filename | Quant type | File Size | Description |
151
+ | -------- | ---------- | --------- | ----------- |
152
+ | [NeuralMonarch-7B-Q2_K.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q2_K.gguf) | Q2_K | 2.532 GB | smallest, significant quality loss - not recommended for most purposes |
153
+ | [NeuralMonarch-7B-Q3_K_S.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q3_K_S.gguf) | Q3_K_S | 2.947 GB | very small, high quality loss |
154
+ | [NeuralMonarch-7B-Q3_K_M.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q3_K_M.gguf) | Q3_K_M | 3.277 GB | very small, high quality loss |
155
+ | [NeuralMonarch-7B-Q3_K_L.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q3_K_L.gguf) | Q3_K_L | 3.560 GB | small, substantial quality loss |
156
+ | [NeuralMonarch-7B-Q4_0.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q4_0.gguf) | Q4_0 | 3.827 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
157
+ | [NeuralMonarch-7B-Q4_K_S.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q4_K_S.gguf) | Q4_K_S | 3.856 GB | small, greater quality loss |
158
+ | [NeuralMonarch-7B-Q4_K_M.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q4_K_M.gguf) | Q4_K_M | 4.068 GB | medium, balanced quality - recommended |
159
+ | [NeuralMonarch-7B-Q5_0.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q5_0.gguf) | Q5_0 | 4.654 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
160
+ | [NeuralMonarch-7B-Q5_K_S.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q5_K_S.gguf) | Q5_K_S | 4.654 GB | large, low quality loss - recommended |
161
+ | [NeuralMonarch-7B-Q5_K_M.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q5_K_M.gguf) | Q5_K_M | 4.779 GB | large, very low quality loss - recommended |
162
+ | [NeuralMonarch-7B-Q6_K.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q6_K.gguf) | Q6_K | 5.534 GB | very large, extremely low quality loss |
163
+ | [NeuralMonarch-7B-Q8_0.gguf](https://huggingface.co/tensorblock/NeuralMonarch-7B-GGUF/tree/main/NeuralMonarch-7B-Q8_0.gguf) | Q8_0 | 7.167 GB | very large, extremely low quality loss - not recommended |
164
+
165
+
166
+ ## Downloading instruction
167
+
168
+ ### Command line
169
+
170
+ Firstly, install Huggingface Client
171
+
172
+ ```shell
173
+ pip install -U "huggingface_hub[cli]"
174
+ ```
175
+
176
+ Then, downoad the individual model file the a local directory
177
+
178
+ ```shell
179
+ huggingface-cli download tensorblock/NeuralMonarch-7B-GGUF --include "NeuralMonarch-7B-Q2_K.gguf" --local-dir MY_LOCAL_DIR
180
+ ```
181
+
182
+ If you wanna download multiple model files with a pattern (e.g., `*Q4_K*gguf`), you can try:
183
+
184
+ ```shell
185
+ huggingface-cli download tensorblock/NeuralMonarch-7B-GGUF --local-dir MY_LOCAL_DIR --local-dir-use-symlinks False --include='*Q4_K*gguf'
186
+ ```