Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -13,5 +13,7 @@ license: bsd-3-clause
|
|
| 13 |
# PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models
|
| 14 |
|
| 15 |
This application presents the results of several models that we have
|
| 16 |
-
evaluated on verbal reasoning challenge
|
| 17 |
-
|
|
|
|
|
|
|
|
|
| 13 |
# PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models
|
| 14 |
|
| 15 |
This application presents the results of several models that we have
|
| 16 |
+
evaluated on a verbal reasoning challenge
|
| 17 |
+
([Papers](https://huggingface.co/papers/2502.01584),
|
| 18 |
+
[ArXiv](https://arxiv.org/abs/2502.01584)).
|
| 19 |
+
The overall results are below. Use the tabs above to explore the results in more detail.
|