Update README.md
Browse files
README.md
CHANGED
|
@@ -29,7 +29,7 @@ K2 was developed as a collaboration between [MBZUAI](https://mbzuai.ac.ae/instit
|
|
| 29 |
The LLM360 Performance and Evaluation Collection is a robust evaluations set consisting of general and domain specific evaluations to assess model knowledge and function.
|
| 30 |
|
| 31 |
|
| 32 |
-
Evaluations include standard best practice benchmarks, medical, math, and coding knowledge. More about the evaluations can be found [here](llm360.ai/
|
| 33 |
|
| 34 |
|
| 35 |
<center><img src="k2_table_of_tables.png" alt="k2 big eval table"/></center>
|
|
|
|
| 29 |
The LLM360 Performance and Evaluation Collection is a robust evaluations set consisting of general and domain specific evaluations to assess model knowledge and function.
|
| 30 |
|
| 31 |
|
| 32 |
+
Evaluations include standard best practice benchmarks, medical, math, and coding knowledge. More about the evaluations can be found [here](https://www.llm360.ai/evaluation.html).
|
| 33 |
|
| 34 |
|
| 35 |
<center><img src="k2_table_of_tables.png" alt="k2 big eval table"/></center>
|