uukuguy
/

speechless-code-mistral-7b-v1.0

Text Generation

text-generation-inference

Model card Files Files and versions

uukuguy commited on Oct 13, 2023

Commit

9828fac

·

1 Parent(s): 4f21684

Update README.md

Files changed (1) hide show

README.md +39 -30

README.md CHANGED Viewed

@@ -24,12 +24,16 @@ model-index:
     metrics:
     - name: pass@1
       type: pass@1
-      value: 0.0
       verified: false
 ---
 <p><h1> speechless-code-mistral-7b-v1.0  </h1></p>
 Use the following dataset to fine-tune mistralai/Mistral-7B-v0.1 in order to improve the model's reasoning and planning abilities.
 Total 201,981 samples.
@@ -41,6 +45,40 @@ Total 201,981 samples.
 - Spider: 8,659 samples
 | | |
 |------ | ------ |
 | lr | 2e-4 |
@@ -75,32 +113,3 @@ A40-48G x 2
 | eeval_runtime            | 0:00:25.04 |
 | eeval_samples_per_second |      7.985 |
 | eeval_steps_per_second   |       |
-| Metric | Value |
-| --- | --- |
-| humaneval-python ||
-[Big Code Models Leaderboard](https://huggingface.co/spaces/bigcode/bigcode-models-leaderboard)
-CodeLlama-34B-Python: 53.29
-CodeLlama-34B-Instruct: 50.79
-CodeLlama-13B-Instruct: 50.6
-CodeLlama-34B: 45.11
-CodeLlama-13B-Python: 42.89
-CodeLlama-13B: 35.07
-[Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
-| Metric | Value |
-| --- | --- |
-| ARC | |
-| HellaSwag | |
-| MMLU | |
-| TruthfulQA |  |
-| Average |  |

     metrics:
     - name: pass@1
       type: pass@1
+      value: 50.0
       verified: false
 ---
 <p><h1> speechless-code-mistral-7b-v1.0  </h1></p>
+* [AWQ model(s) for GPU inference.](https://huggingface.co/TheBloke/speechless-code-mistral-7B-v1.0-AWQ)
+* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/speechless-code-mistral-7B-v1.0-GPTQ)
+* [2, 3, 4, 5, 6 and 8-bit GGUF models for CPU+GPU inference](https://huggingface.co/TheBloke/speechless-code-mistral-7B-v1.0-GGUF)
 Use the following dataset to fine-tune mistralai/Mistral-7B-v0.1 in order to improve the model's reasoning and planning abilities.
 Total 201,981 samples.
 - Spider: 8,659 samples
+## HumanEval
+| Metric | Value |
+| --- | --- |
+| humaneval-python | 50.0|
+[Big Code Models Leaderboard](https://huggingface.co/spaces/bigcode/bigcode-models-leaderboard)
+CodeLlama-34B-Python: 53.29
+CodeLlama-34B-Instruct: 50.79
+CodeLlama-13B-Instruct: 50.6
+CodeLlama-34B: 45.11
+CodeLlama-13B-Python: 42.89
+CodeLlama-13B: 35.07
+## lm-evaluation-harness
+[Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+| Metric | Value |
+| --- | --- |
+| ARC |59.64 |
+| HellaSwag |82.25 |
+| MMLU | 61.33 |
+| TruthfulQA | 48.45 |
+| Average | 62.92 |
+## Parameters
 | | |
 |------ | ------ |
 | lr | 2e-4 |
 | eeval_runtime            | 0:00:25.04 |
 | eeval_samples_per_second |      7.985 |
 | eeval_steps_per_second   |       |