Commit Β·
9603be0
1
Parent(s): b9a70f8
Update README.md
Browse files
README.md
CHANGED
|
@@ -71,36 +71,25 @@ pinned: false
|
|
| 71 |
|
| 72 |
|
| 73 |
<hr>
|
| 74 |
-
<
|
| 75 |
-
<a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color:
|
| 76 |
-
<span style="font-size: 1.
|
| 77 |
-
|
| 78 |
-
<
|
| 79 |
-
|
| 80 |
-
|
| 81 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 82 |
</span>
|
| 83 |
</a>
|
| 84 |
-
<
|
| 85 |
-
<button class="common-button">Model Repo</button>
|
| 86 |
-
</a>
|
| 87 |
-
<a href="https://openchat.team">
|
| 88 |
-
<button class="common-button">OpenChatUI Demo</button>
|
| 89 |
-
</a>
|
| 90 |
-
<a href="https://huggingface.co/spaces/openchat/openchat_3.5">
|
| 91 |
-
<button class="common-button">HuggingFace Space</button>
|
| 92 |
-
</a>
|
| 93 |
-
<a href="https://arxiv.org/pdf/2309.11235.pdf">
|
| 94 |
-
<button class="common-button">Paper</button>
|
| 95 |
-
</a>
|
| 96 |
-
-->
|
| 97 |
-
</p>
|
| 98 |
-
|
| 99 |
|
| 100 |
-
|
| 101 |
-
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/
|
| 102 |
-
|
| 103 |
-
</p>
|
| 104 |
|
| 105 |
<h1 style="vertical-align: middle;">
|
| 106 |
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
|
|
@@ -120,26 +109,24 @@ pinned: false
|
|
| 120 |
|
| 121 |
# π Benchmarks
|
| 122 |
|
| 123 |
-
| Model | # Params | Average | MT-Bench |
|
| 124 |
-
|--------------------|----------|----------|--------------|----------
|
| 125 |
-
| OpenChat-3.5
|
| 126 |
-
|
|
| 127 |
-
|
|
| 128 |
-
|
|
| 129 |
-
|
|
| 130 |
-
|
|
| 131 |
-
|
|
| 132 |
-
|
|
| 133 |
-
| | | | WizardLM 70B | Orca 13B | Orca 13B | Platypus2 70B | WizardLM 70B | WizardCoder 34B | Flan-T5 11B | MetaMath 70B |
|
| 134 |
-
|
| 135 |
-
|
| 136 |
## π Comparison with [X.AI Grok](https://x.ai/)
|
| 137 |
|
| 138 |
-
|
|
| 139 |
-
|--------------|-------------|---------|----------|------|-----------|----------|----------|
|
| 140 |
-
| OpenChat 3.5 | Apache-2.0 | 7B
|
| 141 |
-
|
|
| 142 |
-
| Grok-
|
|
|
|
| 143 |
|
| 144 |
# πContact
|
| 145 |
|
|
|
|
| 71 |
|
| 72 |
|
| 73 |
<hr>
|
| 74 |
+
<div style="background-color: white; padding: 0.7em; border-radius: 0.5em; color: black; display: flex; flex-direction: column; justify-content: center; text-align: center; ont-size: 0.5em;">
|
| 75 |
+
<a href="https://huggingface.co/openchat/openchat_3.5" style="text-decoration: none; color: black;">
|
| 76 |
+
<span style="font-size: 1.7em; font-family: 'Helvetica'; letter-spacing: 0.1em; font-weight: bold; color: black;">OPENCHAT</span><span style="font-size: 1.8em; font-family: 'Helvetica'; color: #3c72db; ">3.5</span>
|
| 77 |
+
<span style="font-size: 0.7em; font-family: 'Helvetica'; color: white; vertical-align: top; background-color:red; border-radius: 6em; padding: 0.066em 0.4em; letter-spacing: 0.1em; font-weight: bold;">1210</span>
|
| 78 |
+
<span style="font-size: 0.85em; font-family: 'Helvetica'; color: black;">
|
| 79 |
+
<br> π The Overall Best Performing Open Source 7B Model π
|
| 80 |
+
<br> π€ Outperforms <span style="font-weight: bold;">ChatGPT</span> (March) and <span style="font-weight: bold;">Grok-1</span> π€
|
| 81 |
+
<br> π<span style="font-size: 1em; font-family: 'Helvetica'; color: black; font-weight: bold;">15</span>-point improvement in Coding over <span style="font-size: 0.9em;
|
| 82 |
+
font-family: 'Helvetica'; color: black; font-weight: bold;">OpenChat-3.5π</span>
|
| 83 |
+
<br><br><span style="font-size: 1em; font-family: 'Helvetica'; color: #3c72db; font-weight: bold;">New Features</span>
|
| 84 |
+
<br> π‘ 2 Modes: Coding + Generalist, Mathematical Reasoning π‘
|
| 85 |
+
<br> π§ββοΈ Experimental support for Evaluator and Feedback capabilities π§ββοΈ
|
| 86 |
</span>
|
| 87 |
</a>
|
| 88 |
+
</div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 89 |
|
| 90 |
+
<div style="display: flex; justify-content: center; align-items: center">
|
| 91 |
+
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/1210bench.png?raw=true" style="width: 100%; border-radius: 1em">
|
| 92 |
+
</div>
|
|
|
|
| 93 |
|
| 94 |
<h1 style="vertical-align: middle;">
|
| 95 |
<img src="https://github.com/alpayariyak/openchat/blob/master/assets/logo_nobg.png?raw=true" alt="OpenChat Logo" style="width:20px; vertical-align: middle; display: inline-block; margin-right: 5px; margin-left: 0px; margin-top: 0px; margin-bottom: 0px;"/>About OpenChat
|
|
|
|
| 109 |
|
| 110 |
# π Benchmarks
|
| 111 |
|
| 112 |
+
| Model | # Params | Average | MT-Bench | HumanEval | BBH MC | AGIEval | TruthfulQA | MMLU | GSM8K | BBH CoT |
|
| 113 |
+
|--------------------|----------|----------|--------------|-----------------|----------|----------|---------------|--------------|--------------|-------------|
|
| 114 |
+
| OpenChat-3.5-1210 | **7B** | **63.8** | 7.76 | **68.9** | **49.5** | **48.0** | **61.8** | 65.3 | **77.3** | 61.8 |
|
| 115 |
+
| OpenChat-3.5 | **7B** | 61.6 | 7.81 | 55.5 | 47.6 | 47.4 | 59.1 | 64.3 | **77.3** | 63.5 |
|
| 116 |
+
| ChatGPT (March)* | ? | 61.5 | **7.94** | 48.1 | 47.6 | 47.1 | 57.7 | **67.3** | 74.9 | **70.1** |
|
| 117 |
+
| | | | | | | | | | | |
|
| 118 |
+
| OpenHermes 2.5 | 7B | 59.3 | 7.54 | 48.2 | 49.4 | 46.5 | 57.5 | 63.8 | 73.5 | 59.9 |
|
| 119 |
+
| OpenOrca Mistral | 7B | 52.7 | 6.86 | 38.4 | 49.4 | 42.9 | 45.9 | 59.3 | 59.1 | 58.1 |
|
| 120 |
+
| Zephyr-Ξ²^ | 7B | 34.6 | 7.34 | 22.0 | 40.6 | 39.0 | 40.8 | 39.8 | 5.1 | 16.0 |
|
| 121 |
+
| Mistral | 7B | - | 6.84 | 30.5 | 39.0 | 38.0 | - | 60.1 | 52.2 | - |
|
|
|
|
|
|
|
|
|
|
| 122 |
## π Comparison with [X.AI Grok](https://x.ai/)
|
| 123 |
|
| 124 |
+
| | License | # Param | Average | MMLU | HumanEval | MATH | GSM8k |
|
| 125 |
+
|-------------------|-------------|---------|----------|------|-----------|----------|----------|
|
| 126 |
+
| OpenChat 3.5 1210 | Apache-2.0 | **7B** | **60.1** | 65.3 | **68.9** | **28.9** | **77.3** |
|
| 127 |
+
| OpenChat 3.5 | Apache-2.0 | **7B** | 56.4 | 64.3 | 55.5 | 28.6 | **77.3** |
|
| 128 |
+
| Grok-0 | Proprietary | 33B | 44.5 | 65.7 | 39.7 | 15.7 | 56.8 |
|
| 129 |
+
| Grok-1 | Proprietary | ???B | 55.8 | 73 | 63.2 | 23.9 | 62.9 |
|
| 130 |
|
| 131 |
# πContact
|
| 132 |
|