TitleOS
/

Eve-4b_LoRA

Text Generation

eve-secure-coder

text-generation-inference

Model card Files Files and versions

TitleOS commited on Feb 27

Commit

0fae950

·

verified ·

1 Parent(s): a0cde84

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -71,6 +71,25 @@ It is ideal for:
 * Users seeking a coding assistant that offers conversation and personality alongside technical utility.
 * Coding without safety or moral refusals.
 ## Limitations & Warning
 * **No Guardrails:** As a result of the Heretic process, this model has no safety filters. It will generate output for any request. Users are solely responsible for how they utilize the model's output.

 * Users seeking a coding assistant that offers conversation and personality alongside technical utility.
 * Coding without safety or moral refusals.
+## Benchmarking
+Benchmarking is on-going, with a number of evaluations runs. So far, the following score are available:
+1. LiveCodeBench (Code Generation Lite - Release v2)
+Pass@1 (Quantization Q8_0): 26.22% (Passed 134 out of 511 problems)
+| Comparable Model | Parameter Size / Tier | Approximate Pass@1 |
+| :--- | :--- | :--- |
+| LLama-3-70b-Instruct | 70B | ~28.3% |
+| GPT-4o-mini (2024-07) | Small Proprietary | ~27.7% |
+| Claude 3 Sonnet (Original) | Large Proprietary | ~26.9% |
+| Mixtral-8x22B-Instruct | 141B (MoE) | ~26.4% |
+| **Eve-4B (Q8_0)** | 4B (Quantized) | 26.22% |
+| Mistral-Large | Large Proprietary | ~26.0% |
+| GPT-3.5-Turbo-0125 | Mid Proprietary | ~24.6% |
+| Claude 3 Haiku | Small Proprietary | ~24.5% |
+| Codestral-Latest | 22B | ~23.8% |
+| Llama-3-8b-Instruct | 8B | ~15.3% |
 ## Limitations & Warning
 * **No Guardrails:** As a result of the Heretic process, this model has no safety filters. It will generate output for any request. Users are solely responsible for how they utilize the model's output.