update metric score
Browse files
README.md
CHANGED
|
@@ -72,11 +72,11 @@ As with all language models, it is hard to predict in advance how GPT-NeoX-Ko wi
|
|
| 72 |
|
| 73 |
<figure>
|
| 74 |
|
| 75 |
-
| Model | Public | Training FLOPs |
|
| 76 |
|--------------------------|-------------|----------------|--- |--- |--- |--- |--- |-------------------|
|
| 77 |
-
| KoGPT-trinity‡ | ✗ | ----- |
|
| 78 |
-
| KoGPT-KakaoBrain‡ | ✗ | ----- |
|
| 79 |
-
| GPT-NeoX-Ko-1.3B(ours)‡ | ✗ | ----- |
|
| 80 |
|
| 81 |
|
| 82 |
<figcaption><p>Models roughly sorted by performance, or by FLOPs if not available.</p>
|
|
@@ -111,17 +111,6 @@ To cite this model:
|
|
| 111 |
}
|
| 112 |
```
|
| 113 |
|
| 114 |
-
To cite the codebase that trained this model:
|
| 115 |
-
```bibtex
|
| 116 |
-
@misc{mesh-transformer-jax,
|
| 117 |
-
author = {Wang, Ben},
|
| 118 |
-
title = {{Mesh-Transformer-JAX: Model-Parallel Implementation of Transformer Language Model with JAX}},
|
| 119 |
-
howpublished = {\url{https://github.com/kingoflolz/mesh-transformer-jax}},
|
| 120 |
-
year = 2021,
|
| 121 |
-
month = May
|
| 122 |
-
}
|
| 123 |
-
```
|
| 124 |
-
|
| 125 |
If you use this model, we would love to hear about it! Reach out on [GitHub](https://github.com/kingoflolz/mesh-transformer-jax), Discord, or shoot Ben an email.
|
| 126 |
|
| 127 |
## Acknowledgements
|
|
|
|
| 72 |
|
| 73 |
<figure>
|
| 74 |
|
| 75 |
+
| Model | Public | Training FLOPs | kobest_boolq β | kobest_copa β | kobest_wic β | kobest_hellaswag β | kobest_sentineg β | Dataset Size (GB) |
|
| 76 |
|--------------------------|-------------|----------------|--- |--- |--- |--- |--- |-------------------|
|
| 77 |
+
| KoGPT-trinity‡ | ✗ | ----- | 0.6663 | 0.6222 | 0.656 | 0.4011 | 0.3534 | ----- |
|
| 78 |
+
| KoGPT-KakaoBrain‡ | ✗ | ----- | 0.3241 | 0.719 | 0.1356 | 0.4616 | 0.8065 | ----- |
|
| 79 |
+
| GPT-NeoX-Ko-1.3B(ours)‡ | ✗ | ----- | 0.5174 | 0.7072 | 0.6567 | 0.417 | 0.8444 | ----- |
|
| 80 |
|
| 81 |
|
| 82 |
<figcaption><p>Models roughly sorted by performance, or by FLOPs if not available.</p>
|
|
|
|
| 111 |
}
|
| 112 |
```
|
| 113 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 114 |
If you use this model, we would love to hear about it! Reach out on [GitHub](https://github.com/kingoflolz/mesh-transformer-jax), Discord, or shoot Ben an email.
|
| 115 |
|
| 116 |
## Acknowledgements
|