openchat
/

openchat-3.5-0106

Text Generation

text-generation-inference

Model card Files Files and versions

imone commited on Jan 8, 2024

Commit

d520e3a

·

1 Parent(s): 65238d2

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -205,7 +205,7 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 | Model                       | Size   | HumanEval+ pass@1 |
 |-----------------------------|--------|-------------------|
-| **OpenChat 3.5 0106**       | **7B** | **65.9**          |
 | ChatGPT (December 12, 2023) | -      | 64.6              |
 | WizardCoder-Python-34B-V1.0 | 34B    | 64.6              |
 | OpenChat 3.5 1210           | 7B     | 63.4              |
@@ -215,7 +215,7 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 <h3>OpenChat-3.5 vs. Grok</h3>
 </div>
-🔥 OpenChat-3.5 0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
 |                       | License     | # Param | Average  | MMLU   | HumanEval | MATH     | GSM8k    |
 |-----------------------|-------------|---------|----------|--------|-----------|----------|----------|

 | Model                       | Size   | HumanEval+ pass@1 |
 |-----------------------------|--------|-------------------|
+| **OpenChat-3.5-0106**       | **7B** | **65.9**          |
 | ChatGPT (December 12, 2023) | -      | 64.6              |
 | WizardCoder-Python-34B-V1.0 | 34B    | 64.6              |
 | OpenChat 3.5 1210           | 7B     | 63.4              |
 <h3>OpenChat-3.5 vs. Grok</h3>
 </div>
+🔥 OpenChat-3.5-0106 (7B) now outperforms Grok-0 (33B) on **all 4 benchmarks** and Grok-1 (???B) on average and **3/4 benchmarks**.
 |                       | License     | # Param | Average  | MMLU   | HumanEval | MATH     | GSM8k    |
 |-----------------------|-------------|---------|----------|--------|-----------|----------|----------|