LLM360
/

Crystal

@@ -20,14 +20,14 @@ Despite being trained on a smaller dataset of 1.4 trillion tokens—compared to
 It demonstrates superior performance in benchmarks like MMLU, HumanEval, and MBPP.
 By comparing CrystalCoder with other similar work, CrystalCoder is quite balance on language and coding tasks.
-| Model | Trained Tokens | ARC | HellaSwag | MMLU (5-shot) | TruthfulQA | Language Avg. | HumanEval (pass@1) | MBPP (pass@1) | Coding Avg. | Avg. of Avg.|
-| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
-| Mistral 7B | - | 59.98 | 83.31 | 64.16 | 42.15 | 62.40 | 29.12 | 38.78 | 33.95 | 48.68 |
-| **CrystalCoder 7B** | 1.4T | 47.01 | 71.97 | 48.78 | 35.91 | 50.92 | 28.38 | 36.38 | 32.38 | 41.65 |
-| CodeLlaMA 7B | 2.5T | 39.93 | 60.80 | 31.12 | 37.82 | 42.42 | 33.50 | 41.40 | 37.45 | 39.94 |
-| OpenLLaMA v2 7B | 1T | 43.60 | 72.20 | 41.29 | 35.54 | 48.18 | 15.32 | 12.69 | 28.01 | 38.10 |
-| LLaMA 2 7B | 2T | 53.07 | 77.74 | 43.80 | 38.98 | 53.39 | 13.05 | 20.09 | 16.57 | 34.98 |
-| StarCoder-15B | 1.03 | - | - | - | - | - | 33.63 | 43.28 | 38.46 | - |
 ## About LLM360
 LLM360 is an initiative for comprehensive and fully open-sourced LLMs,

 It demonstrates superior performance in benchmarks like MMLU, HumanEval, and MBPP.
 By comparing CrystalCoder with other similar work, CrystalCoder is quite balance on language and coding tasks.
+|        Model        | Trained Tokens | Avg. of Avg. | Language Avg. | Coding Avg. |  ARC  | HellaSwag | MMLU (5-shot) | TruthfulQA | HumanEval (pass@1) | MBPP (pass@1) |
+|:-------------------:|:--------------:|:------------:|:-------------:|:-----------:|:-----:|:---------:|:-------------:|:----------:|:------------------:|:-------------:|
+| Mistral 7B          | -              | 48.68        | 62.40         | 33.95       | 59.98 | 83.31     | 64.16         | 42.15      | 29.12              | 38.78         |
+| **CrystalCoder 7B** | 1.4T           | 41.65        | 50.92         | 32.38       | 47.01 | 71.97     | 48.78         | 35.91      | 28.38              | 36.38         |
+| CodeLlaMA 7B        | 2.5T           | 39.94        | 42.42         | 37.45       | 39.93 | 60.80     | 31.12         | 37.82      | 33.50              | 41.40         |
+| OpenLLaMA v2 7B     | 1T             | 38.10        | 48.18         | 28.01       | 43.60 | 72.20     | 41.29         | 35.54      | 15.32              | 12.69         |
+| LLaMA 2 7B          | 2T             | 34.98        | 53.39         | 16.57       | 53.07 | 77.74     | 43.80         | 38.98      | 13.05              | 20.09         |
+| StarCoder-15B       | 1.03           | -            | -             | 38.46       | -     | -         | -             | -          | 33.63              | 43.28         |
 ## About LLM360
 LLM360 is an initiative for comprehensive and fully open-sourced LLMs,