Update README.md
Browse files
README.md
CHANGED
|
@@ -19,6 +19,21 @@ The evaluation dataset is in Chinese, and we used the same language model **RoBE
|
|
| 19 |
| [hellonlp/simcse-base-zh(sup)](https://huggingface.co/hellonlp/simcse-roberta-base-zh) | **80.96**| -| -| -| -| -|
|
| 20 |
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
## Uses
|
| 23 |
You can use our model for encoding sentences into embeddings
|
| 24 |
```python
|
|
|
|
| 19 |
| [hellonlp/simcse-base-zh(sup)](https://huggingface.co/hellonlp/simcse-roberta-base-zh) | **80.96**| -| -| -| -| -|
|
| 20 |
|
| 21 |
|
| 22 |
+
|
| 23 |
+
## Data List
|
| 24 |
+
The following data are all in Chinese.
|
| 25 |
+
| Data | Link | size(train) | size(valid) | size(test) |
|
| 26 |
+
|:-----------------------:|:------------:|:------------:|:------------:|:------------:|
|
| 27 |
+
| STS-B | [STS-B](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/10yfKfTtcmLQ70-jzHIln1A%3Fpwd%3Dgf8y)| 5231| 1458| 1361|
|
| 28 |
+
| ATEC | [ATEC](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/1gmnyz9emqOXwaHhSM9CCUA%3Fpwd%3Db17c)| 62477| 20000| 20000|
|
| 29 |
+
| BQ | [BQ](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/1M-e01yyy5NacVPrph9fbaQ%3Fpwd%3Dtis9)| 100000| 10000| 10000|
|
| 30 |
+
| LCQMC | [LCQMC](https://pan.baidu.com/s/16DfE7fHrCkk4e8a2j3SYUg?pwd=bc8w )| 238766| 8802| 12500|
|
| 31 |
+
| PAWSX | [PAWSX](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/1ox0tJY3ZNbevHDeAqDBOPQ%3Fpwd%3Dmgjn)| 49401| 2000| 2000|
|
| 32 |
+
| SNLI | [SNLI](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/1NOgA7JwWghiauwGAUvcm7w%3Fpwd%3Ds75v)| 146828| 2699| 2618|
|
| 33 |
+
| MNLI | [MNLI](https://link.zhihu.com/?target=https%3A//pan.baidu.com/s/1xjZKtWk3MAbJ6HX4pvXJ-A%3Fpwd%3D2kte)| 122547| 2932| 2397|
|
| 34 |
+
|
| 35 |
+
|
| 36 |
+
|
| 37 |
## Uses
|
| 38 |
You can use our model for encoding sentences into embeddings
|
| 39 |
```python
|